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REMARKS 

The Office is authorized to charge the fee for a three month extension of time to 
Deposit Accoimt No. 50-091 1 . Any fees that may be due in coimection with the fiUng of this 
paper or with this application may be charged to Deposit Account No. 50-091 1 . If a Petition 
for Extension of time is needed, this paper is to be considered such Petition. 

A supplemental Information Disclosure Statement accompanies this response. 

A change in Power of Attorney for the undersigned is has been filed. A Change in 
Correspondence Address accompanies this response. 

Claims 1, 3-7, 9, 1 1-16, 45, 48, 51-53, 57-59, 61-63, and 65-78 are pending. Claims 
1, 5, 59, 63 and 66 are amended and claims 2 and 56 are cancelled without prejudice or 
disclaimer. The claims are amended for clarity. For example, the claim is reorganized so 
that it is clear that recited limitations are not part of the preamble. Further basis for the claim 
amendments can be found in the specification as originally filed, for example, at page 20, 
lines 3-7; at page 25, lines 3-16, and at page 28, lines 1-3; and Table 3, which recite particular 
specificity determinants S1-S4 and methods of mutating the specificity determinants to 
generate protese muteins. 

The arguments and response filed August 7, 2008, June 11, 2009 and July 20, 2010 
are each incorporated by reference herein. 

I. CLAIM OBJECTIONS 

Claim 66 is objected to because it depends upon cancelled claim 64. This objection is 
addressed by amendment of the claims herein. 

II. THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63 AND 
65-78 UNDER 35 U.S.C. §112, FIRST PARAGRAPH - NEW MATTER 

Claims 1-7, 9, 1 1-16, 45, 48, 51-53, 56-59, 61-63 and 65- 78 are rejected under 35 
U.S.C. §112, first paragraph, because it is alleged that the claimed subject matter is not 
described in the specification. Specifically, the Examiner states that in Claim 1 , step (e) is 
not supported by the as-filed specification. Pursuant to MPEP 2163.06, the Examiner 
requests that applicants point out where in the original disclosure the amendments to the 
claims appear. 

It appears that the same rejection that applies to claim 1 , step (e) also applies to the 
other independent claims 53, 59 and 63. In particular, in each of these claims, the 
amendment that was made to step e) is underlined in the portion of step e) reproduced below, 
which recites: 
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(e) testing the identified prote£ise(s) or biologically active portion thereof for 
cleavage and inactivation of an activity of the target protein that contains the 
substrate sequence, t hereby identifying a protease mutein or a 
catalytically biologically active portion thereof that inactivates an activity of a 
target protein that is involved with the disease or pathology. 

Hence, the amended portion of the claims is directed to testing the activity of the target 
protein upon cleavage by an identified proteases to verify that the identified mutein protease 
inactivates an activity of the target protein. Basis for this amendment can be found at 
paragraph [0050] and [0059], which describe general methods of engineering a protease to 
inactivate an activity of a target protein, and paragraph [0123], which describes testing or 
assaying the target protein upon cleavage by the protease for inactivation or destruction of a 
function or activity. 



[0050] The present invention is drawn to methods for generating and screening proteases to 
cleave target proteins at a given substrate sequence. Proteases are protein-degrading enzymes 
that recognize an amino acid or an amino acid substrate sequence within a target protein. 
Upon recognition of the substrate sequence, proteases catalyze the hydrolysis or cleavage of a 
peptide bond within a target protein. Such hydrolysis of the target protein may inactivate 
it, depending on the location of peptide bond within the context of the full-length 
sequence of the target sequence. The specificity of proteases can be altered through protein 
engineering. If a protease is engineered to recognize a substrate sequence within a target 
protein or proteins that would (i.) alter the function Le, by inactivation of the target 
protein(s) upon catalysis of peptide bond hydrolysis and, (ii.) the target protein(s) are 
recognized or unrecognized as points of molecular intervention for a particular disease 
or diseases, then the engineered protease has a therapeutic effect via a proteolysis- 
mediated inactivation event, [emphasis added] 

[0059] In some examples, the engineered protease is designed to cleave any of the target 
proteins in Table 1, thereby inactivating the activity of the protein. The protease can be 
used to treat a pathology associated with that protein, by inactivating one of the target 
proteins, [emphasis added] 

[0123] Mutant proteases that match the desired specificity profiles, as determined by substrate 
libraries, are then assayed using individual peptide substrates corresponding to the desired 
cleavage sequence. Variant proteases are also assayed to ascertain that they will cleave the 
desired sequence when presented in the context of the full-length protein. The activity of the 
target protein is also assayed to verify that its function has been destroyed by the 
cleavage event The cleavage event is monitored by SDS-PAGE after incubating the purified 
full-length protein with the variant protease, [emphasis added] 

Therefore, the specification as originally filed describes a method for generating and 
screening proteases as claimed that includes a step of testing the identified mutant proteases 
for activity in cleaving the target protein. 

III. THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63 AND 
65-78 UNDER 35 U.S.C. §112, FIRST PARAGRAPH -Lack of Support 

Claims 1-7, 9, 1 1-16, 45, 48, 51-53, 56-59, 61-63 and 65- 78 are rejected under 35 
U.S.C. §1 12, first paragraph, because 1) the Examiner states that the specification fails to 
provide written description for candidate proteases that inactivate an activity of the target 
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protein, the inactivation of which can ameliorate a disease or pathology and 2) the Examiner 

also states the specification fails to describe method step (e). With respect to this latter 

rejection, the Examiner states: 

Furthermore, the specification fails to describe the claim method of 
step (e) testing the identified protease(s) or biologically active portions thereof 
for cleavage and inactivation of an activity of the target protein that contains 
the substrate sequence, thereby identifying a protease mutein or a biologically 
active portion thereof that inactivates an activity of a target protein that is 
involved with the disease or pathology. It is not apparent from the disclosure 
description as to type or kind of testing of any identified protease, if any, that 
has been applied to any or all kinds of target protein involved in any type of 
disease or pathology. 

This rejection respectfully is traversed. 
Relevant Law 

The case law related to the written description requirement is set forth in previous 
responses, incorporated by reference herein. 
The Rejected Claims 

There are four independent claims, claims 1, 53, 59 and 63, that differ with respect to 
the particular target proteins or scaffold proteases used in the method. The basic steps of the 
method are the same for all independent claims. For example, independent claim 1 , as 
amended herein, recites: 

A method of producing and identifying a mammalian protease mutein that inactivates 
an activity of a target protein involved with a disease or pathology in a mammal, comprising 
the steps of: 

(a) producing a library comprising protease muteins comprising mutation(s) of a 
serine protease scafTold and/or biologically active portions thereof at an amino acid position 
selected from among amino acid residues between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 
189, 190, 191, 192, 215, 217, 218 and 226 by chymotrypsin numbering, wherein: 

each different mutein protease in the library is a member of the library; 

each member of the library has N mutations relative to a wild-type mammalian 
protease scaffold or a biologically active portion thereof; and 

N is a positive integer; 

(b) contacting members of the library with a target protein or with a polypeptide 
comprising a substrate sequence that is present in the target protein, wherein: 

the target protein is selected from among a cell surface molecule that 
transmits an extracellular signal for cell proliferation, a cytokine, a cytokine receptor and a 
signaling protein that regulates apoptosis; and 

cleavage of a substrate sequence in the target protein inactivates an activity 
of the target protein; and 

(c) measuring a cleavage activity and/or substrate specificity of at least two members 
of the library for the target protein or substrate sequence; 

(d) based on the measured activity and/or specificity, identifying members of the 
library that have an increased cleavage activity and/or altered substrate specificity for cleaving 
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the substrate sequence or the target protein, relative to the wild-type mammalian protease 
scaffold; and 

(e) testing the identified protease(s) or biologically active portion thereof for 
cleavage and inactivation of an activity of the target protein that contains the substrate 
sequence,.thereby identifying a protease mutein or a biologically active portion thereof that 
inactivates an activity of a target protein that is involved with the disease or pathology. 

Dependent claims further recite particulars of the number of mutations, the target 
protein, the scaffold protease, and steps of measuring cleavage activity and inactivation of an 
activity of the target protein. 

Analysis 

First, for clarity of the claims, the language "inactivation of the target protein can 
ameliorate a disease or pathology" is deleted from the claims. As stated in previous 
responses, the instant claims are directed to a method of screening for mutant proteases 
that inactivate an activity of a target protein. While the specification renders it clear that 
the identified proteases can serve as candidate therapeutics for treating a disease or pathology 
(see e.g. at paragraph [0005], which states that "the resultant proteins may be used as agents 
for in vivo therapy"), the instant claims are not directed to a method of treatment of diseases 
or pathologies, nor to methods of ameliorating a disease or pathology. Hence, the fact that 
the inactivation of the target protein can ameliorate a disease or pathology is not required by 
the instant claims, and is not a limitation of the method of the instant claims. Applicant's 
inclusion of this limitation w£is only meant to render it clear that the target protein is involved 
in a disease or pathology by virtue of the fact that destruction or inactivation of an activity of 
the target protein can ameliorate a disease or pathology. Accordingly, by deletion of this 
phrase, the portion of the rejection that relates to written description for candidate proteases 
that inactivate an activity of the target protein, the inactivation of which can ameliorate a 
disease or pathology, is rendered moot. 

Second, with respect to the rejection as to the description of step e) of the claims, 
Applicant respectfully submits that the specification provides detailed description, including 
in Working Examples, sufficient to show that Applicant has possession of the claimed subject 
matter. In particular, the specification describes exemplary target proteins for use in the 
method and also describes assays for testing the activity of a target protein in order to assay 
whether a function or activity of the target protein has been destroyed or inactivated 
following cleavage by protease. Further, as set forth in MPEP 2163, Applicant is not 
required to describe information that is well knovm in the art (stating that "[ijnformation 
which is well known in the art need not be described in detail in the specification. See, e,g,^ 
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HybritecK Inc. v. Monoclonal Antibodies, Inc., 802 F.2d 1367, 1379-80, 231 USPQ 81, 90 
(Fed. Cir. 1 986)). In this instance, as described further below, assays to assess or test the 
function or activity of a target protein is well known in the art. 

The specification describes that exemplary of target proteins are cell surface 
molecules that transmit an extracellular signal, cj^okines, cytokine receptors, and proteins 
involved in apoptosis. For example, a cell surface molecule that transmits an extracellular 
signal is a G protein coupled receptor (GPCR) or a kinase. One of skill in the art is familiar 
with target proteins that are dell surface molecules that transmit an extracellular signal, 
cytokines, cytokine receptors, and proteins involved in apoptosis. The specification describes 
at least 45 exemplary target proteins that are representative of these classes and that are 
involved with a disease or pathology {see e.g. Table 1 on page 9). Table 1 is reproduced 
below: 
Table 1 
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IL-4/IL-4R 


Asthma 


Cytokine 


TNF/TNFR 


Asthma, Crohn's disease. 


Cytokine 




HIV Infection, inflammation. 






psoriasis, rheumatoid 






arthritis 




CCR5/CXCR4 


HIV infection 


GPCR 


gp120/gp41 


HIV infection 


Fusion protein 


CD4 


HIV infection 


Receptor 


Hem aglutinin 


Influenza Infection 


Fusion protein 


RSV fusion protein 


RSV Infection 


Fusion protein 


B7/CD28 


Graft-v.-host disorder. 


Receptor 




rheumatoid arthritis. 






transplant rejection, diabetes 






meliitus 




IgE/lgER 


Graft-v.-host disorder, 


Receptor 




transplant rejection 




CD2,CD3.CD4.CD40 


Graft-v.-host disorder. 


Receptor 




transplant rejection, psoriasis. 




IL-2/IL-2R 


Autoimmune disorders. 


Cytokine 




graft-v.-host disorder. 






rheumatoid arthritis 




VEGF.FGF.EGF.TGF 


Cancer 


Cytokine 


Her2/neu 


Cancer 


Receptor 


CCR1 


Multiple sclerosis 


GPCR 


CXCR3 


Multiple sclerosis, rheumatoid 


GPCR 




arthritis 




CCR2 


Atherosclerosis, rheumatoid 


GPCR 




arthritis 




Src 


Cancer, osteoporosis 


Kinase 


Akt 


Cancer 


Kinase 


Bcl-2 


Cancer 


Protein-protein 


BCR-AbI 


Cancer 


Kinase 


GSK-3 


Diabetes 


Kinase 


cdk-2/cdk-4 


Cancer 


Kinase 
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Other exemplary proteins involved in apoptosis also are provided in the specification at page 
39, lines 19-24, i.e. human cytochrome c, human Apaf-1, human caspase-9, himian caspase-7, 
human caspase-6, human caspase-2, human BAD, human BID, human BAX, human PARP, 
or hvmian p53. These target proteins are not new, but are proteins that are well known in the 
art and have activities that are well known in the art. The instant claims explicitly recite 
these classes of target proteins or these target proteins. Further, the description of these 
target proteins is representative of a class of target proteins, such that one of skill in the art 
would understand and be able to identify a target protein involved in a pathology or disease. 

In this case the modified proteases are screened for cleavage and inactivation of an 
activity of the target protein. The specification describes that in the method, proteases are 
engineered to alter the function of a target protein to inactivate it (see e,g, at page 9, lines 16- 
19). Thus, the specification describes that in the method "[t]he activity of the target 
protein is also assayed to verify that its function has been destroyed by the cleavage 
event'* (see e.g, at page 32, lines 23-25). 

The specification exemplifies activities or functions of exemplary proteins that can be 
inactivated or destroyed upon incubation and cleavage by a protease. For example, the 
specification describes that cleavage and inactivation of a TNF receptor can inactivate the 
receptor such that the action of tumor necrosis factor is inhibited {see e.g. at page 9, lines 29- 
32). In another example, the specification describes that cleavage and inactivation of a target 
that is the same as activated protein C, which is involved in promoting blood coagulation, can 
attenuate the blood coagulation cascade (see e.g. at page 10, lines 1-3). In a further example, 
the specification describes that cleavage and inactivation of a cell surface molecule involved 
in tumorigenicity can inactivate their ability to transmit extracellulsir signals, especially for 
cell proliferation (see e.g. at page 10, lines 4-6). In an additional example, the specification 
describes that cleavage and inactivation of a protein involved in cell cycle progression, can 
inactivate the ability of the protein to allow the cell cycle to go forward (see e.g. at page 10, 
lines 9-12). In other examples, the specification describes that cleavage and inactivation of 
viral membrane fusion proteins can inhibit the ability of the virus to infect cells (see e.g. at 
page 10, lines 13-17). In further examples, the specification describes that cleavage and 
inactivation of the same target protein as plasminogen activator can result in inactivation of 
the thrombolytic cascade (see e.g. at page 10, lines 18-21). In additional examples, the 
specification describes that cleavage and inactivation of cytokines or receptors can inactivate 
the associated signaling cascade (see e.g. at page 10, lines 22-26). In other examples, the 
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specification describes that cleavage and inactivation of proteins involved in apoptosis inhibit 
the associated signaling cascade for apoptosis and inhibit apoptosis {see e,g. at page 10, lines 
1 7-3 1 and at page 39, lines 1 8-24). 

Accordingly, there is extensive descriptions of functions that can be altered by 
inactivation of a target protein. It is within the level of one of skill in the art to assay for such 
functions depending on the specific target protein. For example, as described in the 
specification, following incubation with a protease, various functions of a target protein 
including, for example, cell proliferation, coagulant activity, binding of a ligand to a receptor 
(e.g. TNF binding to TNF receptor), cell proliferation, other assays for cell cycle activity, 
viral infection of cells, and apoptosis can be assayed. As set forth below, in vitro and in vivo 
assays to assess various functions of target proteins, including target proteins as exemplified 
in the instant specification, were well known to one of skill in the art at the time of the 
priority date of the instant application: 



Target 


Assay 


Reference 


Caspase 3 


ApoAlert 
CPP32/Caspase-3 
colorimetric assay 
kit (Clontech) 


Izban et al., "Characterization of the interleukin- 
1 beta-converting enzyme/ced-3 -family protease, 
caspase-3/CPP32, in Hodgkin's disease: lack of 
caspase-3 expression in nodular lymphocyte 
predominance Hodgkin's disease," Am J Path 
154(5):1439-1447(1999) 


tumor necrosis 
factor 


In vitro TNF assay 
with TNF- 
sensitive L cells; 
In vivo TNF assay 
- standard Meth A 
sarcoma assay. 


Rubin et al., "Purification and characterization of a 
human tumor necrosis factor from the LuKII cell 
line," Proc Natl Acad Sci USA 82:6637-6641 (1985); 
Williamson et aL, Proc Natl Acad Sci USA 80:5397- 
5401 (1983) 


interleukin-l 


Thymocyte 
proliferation 
assay; Ability to 
stimulate IL-2 
production 


Lachman et al., J. Immunol. 1 19:2019 (1977); 
Howard et al., "Role of interleukin 1 in anti- 
immunoglobulin-induced B cell proliferation," J. Exp 
Med 157:1529-1543 (1983); 
Lowenthal and MacDonald, "Binding and 
intemalization of interleukin 1 by T cells," J. Exp 
Med 164:1060-1074 (1986) 


interleukin-2 


IL-2 dependent 
exponential 
proliferation of 
murine cytotoxic 
T-cell line 
(CTLL) 


Gillis and Mizel, "T-Cell lymphoma model for the 
analysis of interleukin 1 -mediated T-cell activation," 
Proc Natl Acad Sci USA 78(2): 1 133-1 137 (1981) 


interleukin-4 


Cellular Assay — 
stimulate 
proliferation of 
purified B cells 


Grabstein et al., "Purification to homogeneity of B 
cell stimulating factor. A molecule that stimulates 
proliferation of multiple lymphokine-dependent cell 
lines," J Exp Med, 163:1405-1414 (1986) 
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interleukin-4 
receptor 


Induction oi 
intracellular 
signaling 


Kammer et al., Homodimenzation oi interleukin-4 
receptor a chain can induce intracellular signaling," J 
Biol Chem 271(39):23634-23637 (1996) 


interleukin-5 


cytotoxicity in 
LAK cells 


Aoki et aL, "Interleukin 5 enhances interleukin 2- 
mediated lymphokine-activated killer activity, J Exp 
Med, 170:583-588(1989) 


interleukin-12 


chemotactic 
activity 


AUavena et al., "Interleukin-12 is chemotactic for 
natural killer cells and stimulates their interaction 
With vascular endothelium. Blood 84:2261-2268 
(1994) 


interleukin- 1 3 


IL- 1 3 induced IL- 
6 secretion in 
Caki-1 cells 


Caput et al., "Cloning and Characterization of a 
Specific Interleukin (IL)- 1 3 Binding Protein 
Structurally Related to the IL-5 Receptor a Chain," J 
Biol Chem 271(28): 16921-16926 (1996) 


p-selectin 


Induction of tissue 
factor expression 


Celi et aL, "P-selectin induces the expression of 
tissue factor on monocytes, Proc Natl Acad Sci USA 
91:8767-8771 (1994) 


p-selectin 
glycoprotein 
ligand (PSGL) 


Binding to 
immobilized p- 
selectin 


Moore et al., "The P-selectin Glycoprotein Ligand 
from Human Neutrophils Displays Sialylated, 
Fucosylated, O -Linked Poly-N-acetyllactosamine," J 
Biol Chem 269:22318-23327 (1994) 


Substance P 


Substance P with 
human blood T- 
lymphocytes 
stimulates T- 
lympocyte 
proliferation, 
quantified by flow 
cytometric and 
direct binding 
assays. 


Payan et al., "Substance P recognition by a subset of 
human T lymphocytes," J Clin Invest. 74:1532-1539 
(1984) 


bradykinin 


Effect on rat 
dorsal root 
ganglion neurons 
in vitro 


Thayer et al., "Regulation of calcium homeostasis in 
sensory neurons by bradykinin, J Neuroscience 
8(ll):4089-4097 (1988) 


Factor IX 


coagulation assay 


Chung et al., "purification and characterization of an 
abnormal Factor IX (Christmas factor) molecule, J 

Clin Invest 62:1078-1085 (1978) 


CCR5 


Fusion assay in 
HeLa-P4 cells 


Picard et aL, "Multiple extracellular domains of 
CCR-5 contribute to human immunodeficiency virus 
type 1 entry and fusion, J Virology 71:5003-501 1 
(1997) 


CXCR4 


Functional assay 
in U373MG-CD4 
cells 


Brelot et al., "Role of the first and third extracellular 
domains of CXCR-4 in human immunodeficiency 
virus coreceptor activity," J Virology 71:4744-4751 
(1997) 


gpl20 


CD4 binding 


Willey et al., "In vitro mutagenesis identifies a region 
within the envelope gene of the human 
immunodeficiency virus that is critical for 
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infectivity, J Virology 62:139-147 (1988) 


gp41 


HIV replication in 
Jurkat 

lymphocytes; 
efiiciency oi viral 
entry 


Gabuzda et al., "Effects of deletions in the 
cytoplasmic domain on biological functions of human 
immunodeficiency virus type 1 envelope 
glycoproteins, J Virology 66:3306-3315 (1992) 


hemagglutinin 


Agglutination 


Sekikawa and Lai, "Defects in functional expression 
of an influenza virus hemagglutinin lacking the signal 
peptide sequences, Proc Natl Acad Sci USA 
80:3563-3567(1983) 


B7 


CD2- and CD3- 
dependent T-cell 
proliferation 


Azuma et aL, "CD28 interaction with B7 
costimvilates primary allogeneic proliferative 

responses and cytotoxicity mediated by small, resting 
T lymphocytes;' J Exp Med 175:353-360 (1992) 


CD28 


PI3 -kinase activity 


Pages et aL, "Two distinct intracytoplasmic regions 
of the T-cell adhesion molecule CD28 participate in 
phosphatidylinositol 3 -kinase association, J Biol 
Chem 271:9403-9409 (1996) 


CD2 


IL-2 production 


Chang et aL, "Dissection of the human CD2 
intracellular domain. Identification of a segment 
required for signal transduction and interleukin 2 
production," J Exp Med 169:2073-2083 (1989) 


CD4 


Inhibition of 
syncytium 
formation and 
virus replication 


Thali et aL, "Effects of changes in gpl20-CD4 
binding affinity on human immunodeficiency virus 
type 1 envelope glycoprotein function and soluble 
CD4 sensitivity, J Virology 65:5007-5012 (1991) 


VEGF 


Induction of 

expression of 
connective tissue 
growth factor 


Suzuma et aL, "Vascular endothelial growth factor 
induces expression of connective tissue growth factor 
via KDR, Fltl, and phosphatidylinositol 3-kinase-akt- 
dependent pathways in retinal vascular cells," J Biol 
Chem 275:40725-70731 (2000) 


fibroblast 
growth factor 


Prolfieration of 
capillary 
endothelial cells 


Feige and Baird "Basic fibroblast growth factor is a 
substrate for protein phosphorylation and is 
phosphorylated by capillary endothelial cells in 
culture, Proc Natl Acad Sci USA 86:3174-3178 
(1989) 


EGF receptor 


Cell proliferation 


Moriai et aL, "A variant epidermal growth factor 
receptor exhibits altered type alpha transforming 
growth lactor binding and transmembrane signaling, 
Proc Natl Acad Sci USA 91:10217-10221 (1994) 


transforming 
growth factor 


Inhibition of TNF- 
a production 


Ranges et aL, "' Inhibition of cytotoxic T cell 
development by trsmsforming growth factor beta and 
reversal by recombinant tumor necrosis factor alpha," 
J Exp Med 166:991-998 (1987) 



Hence, based on the description in the specification, it is within the level of one of 
skill in the art to test for inactivation of an activity of a target protein. As an example, the 
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specification describes numerous assays that can be used to assess apoptosis including, for 
example, DNA ladder formation by gel electrophoresis, morphologic examination by electron 
microscopy, flow cytometry including with the use of various DNA dyes, Annexin V or 
TUNNEL staining (see e,g, at page 39, lines 3-17). 

Further, Working Examples are provided that evidence assays to assess inactivation of 
an activity of a target protein following incubation with a protease. For example, Example 4 
is directed to proteolytic cleavage and inactivation of tumor necrosis factor and txmior 
necrosis factor receptor (TNFR) and describes various assays to assess cleavage and 
inactivation of the cytokine receptor or cytokine. For example, the Example describes 
assessing inactivation of TNFR by cleavage of its stalk region such that it becomes free from 
the membrane and released into the media; the released protein can be detected in the media 
by ELIS A. Also, the Example describes an assay to assess cleavage and inactivation of TNF 
by looking for degradation products of a radioactive protein. The Example also describes 
assays of further testing for cleavage and inactivation of the ligand or receptor by incubation 
of the radioactive cytokine ligand or neutrophils (PMNs; expressing the TNFR) with 
protease. Following incubation with protease the radioactive ligand is incubated with the 
neutrophils, and binding to the TNFR thereon is determined. 

Example 8 further describes various assays to screen for cleavage of target proteins 
and inactivation or destruction of a function of the target protein. As described in the 
Example, following incubation of a purified target protein or cells expressing the target 
protein, cleavage can be assessed by assays such as SDS-PAGE and visualization of protein 
by Coomasie blue staining, autoradiography using a radiolabeled probe or by Western blot. 
ELISA also can be used. The Example further exempHfies this with respect to cleavage of 
tumor necrosis factors, TNFRl and TNFR2. 

Example 9 further describes functions associated with various cytokines and cytokine 
receptors, which can be assayed for to determine if cleavage by a protease destroyed the 
function. For example. Example 9 describes that TNF its receptor TNFR (TNFRl or 
TNFR2) are associated with various signal transduction events such as the production of pro- 
inflammatory cytokines (IL-1, IL-6 and GM-CSF), induction of metalloproteinases such as 
collagenase and stromelysisn, and increased in proliferation and activity of osteoclasts. Any 
of these functions can be assessed to determine if they are destroyed or inactivated upon 
incubation of the cytokine or receptor with protease. The Example further describes that 
VEGFA^EGFR is associated with angiogenesis and that the receptors are tyrosine kinases 
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involving in initiating signaling cascades. Hence, such functions can be assessed to 
determine if they are destroyed or inactivated upon incubation of a VEGF or receptor with 
protease. The Example further describes functions and activities of the ligands EGFR and 
HER-2 and other related ErbB family of receptor tyrosinse kinases. The Example describes 
that activation of the ErbB receptors through binding of ligand results in downstream 
signaling through the mitogen-activated protein kinase (MAPK) and the 
AKT/phosphoinositide 3-kinase (PI3-kinase) pathways. Also, the Example describes that 
these receptors are involved in cell proliferation, differentiation £ind angiogenesis. Hence, 
any of these functions can be assessed to determine if they are destroyed or inactivated upon 
incubation of the ligand or receptor with protease. 

Finally, Example 1 1 exemplifies cleavage and inactivation of an activity of the target 
protein caspase-3 by an exemplified granzyme B mutant (I99A/N218A). It is noted that 
granzyme B normally cleaves caspase-3 to activate it. This Example, however, describes 
cleavage of a distinct substrate sequence in caspase-3 by the mutant protease such that 
activity of caspase-3 is inactivated or destroyed (see e.g. at page 52 lines 1-5 and Figure 1, 
which demonstrate the altered specificity of the mutant protease). This inactivation cleavage 
sequence is only recognized by the mutant granzyme B and not by wildtype granzyme B (see 
e,g. at page 52, lines 22-27). In particular, the Example and accompanying Figure 7B 
exemplify that granzyme B cleaved and inactivated full-length caspase-3 as assessed in 
assays for apoptosis activity. For example, the Example states "[a]s shown in Figure 7B, the 
mutant granzyme B antagonized the effect of wildtype granzyme B to induce apoptosis by 
inactivating caspase-3 

In simi, it respectfully is submitted that the specification renders it clear that the 
method of identifying a mutant protease that inactivates a target protein, includes a step of 
testing or assaying a function or activity of the target protein to verify that it has been 
destroyed following incubation by the protease. The specification provides numerous 
examples of exemplary target proteins that are associated with a disease or pathology, and in 
particular numerous examples that are representative of the class of target proteins that 
provide an extracellular signal, a cytokine, a cytokine receptor or a protein involved in 
apoptosis. The specification describes the function of representative of such target proteins, 
and also describes the consequences of inactivating an activity of the protein. The 
specification, including in Working Examples, describes several assays to assess the function 
of target proteins to determine if an activity has been destroyed or inactivated, such as cell 



22- 




Applicant : Nguyen e/ a/. ^ Attorney Docl^No.: 33328.04905.US01/4905 

Serial No. : 10/677,977 Amendment and Response 

Filed : October 02, 2003 

signaling assays, binding assays, proliferation assays and apoptosis assays. Further, functions 

of target proteins, including the exemplary classes of target proteins described in the 

specification, and assays to assess the function of the target proteins are well known to one of 

skill in the art. Accordingly, it respectfully is submitted that Applicant has possession of a 

method that includes as a step "testing the identified protease(s) or biologically active portion 

thereof for cleavage and inactivation of an activity of the target protein." 

IV. THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63 
AND 65-78 UNDER 35 U.S.C. §112, FIRST PARAGRAPH - Scope 

Claims 1-7, 9, 1 1-16, 45, 48, 51-53, 56-59, 61-63 and 65- 78 are rejected under 35 

U.S.C. §112, first paragraph, as lacking written description because the Examiner states that 

the specification describes granzyme as the sole cleaving enzyme used in the method, but that 

the claims encompass an enormous variation of enzyme muteins, even for granzyme, which 

contains numerous allelic variants. The Examiner states that the genus of proteases 

encompassed by the claims as employed in the method is too large and structurally diverse. 

Hence, the Examiner asserts that: 

[T]he instant claims are not adequately described in the specification to 
claim the broad and structurally diverse genus of the compounds used in the 
method. The claims lack written description and a skilled artisan carmot, as 
one can do with a fiilly described genus, visualize or recognize the identity of 
the members of the genus that are allelic variants. This is true even for the 
single granzyme, let alone for any enzymes of no claim distinguishing features 
as use in the broad methods steps. 

Applicant respectfully traverses this rejections. The arguments and response filed 
August 7, 2008, June 11, 2009 and July 20, 2010 are incorporated by reference. 
Relevant Law 

The case law related to the written description requirement is set forth in previous 
responses, incorporated by reference herein. 
The Rejected Claims 

Claim 1 is set forth above. Independent claims 53, 59 and 63 also are directed to a 
method of screening to produce and identify a protease that inactivates an activity of a target 
protein involved in a disease or pathology, but differ in their scope. Each of the claims 
include steps of a) producing a library of protease muteins; b) contacting mutein protease 
members of the libreiry with a substrate sequence in a target protein or a target protein 
involved in a disease or pathology; c) measuring a cleavage activity and/or substrate 
specificity of at least two members; d) based on the measured activity and/or specificity 
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identifying members that have increased cleavage activity or specificity compared to the 
starting protease scaffold; and e) testing the identified proteases or biologically active portion 
thereof for cleavage and inactivation of an activity of the target protein, thereby identifying a 
mutein protease that cleaves and inactivates an activity of a target protein involved in a 
disease or pathology. 

As amended, all claims recite that in the step of producing a library of protease 
mutein of protease scaffolds, the protease muteins include mutation of a serine protease or a 
particular recited serine protease at an amino acid position selected from among amino 
acid residue between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 
217, 218 and 226 by chymotrypsin numbering. Hence, all claims provide the structure of 
the protease muteins included in the library that is used in the method. 

The independent claims 1, 53, 59 and 63 differ in their recitation of the particular 
protease scaffold used in the method, or in the target protein that is a target of the method. 
For example, independent claim 1 is directed to specific target proteins such as a cell 
surface molecule that transmits an extracellular signal for cell proliferation, a cytokine, a 
cytokine receptor and a signaling protein that regulates apoptosis and also recites that the 
target protein is a serine protease scaffold. Dependent claim 7 further recites specific serine 
protease scaffolds. Independent claim 53 is directed to specific mammalian protease 
scaffolds such as a granzyme A, granzyme B, granzyme M, cathepsin, trypsin, chymotrypsin, 
subtilisin, MTSP-1, elastase, chymase, tryptase, collagenase, papain, neutrophil elastase, 
complement factor serine proteases, ADAMTS13, neural endopeptidase/neprilysin, furin, 
cruzain, and urokinase plasminogen activator (uPA), and also recites further steps of the 
method f) -i) Independent claim 59 is directed to specific target proteins such as a cell 
surface molecule that transmits an extracellular signal for cell proliferation, a cytokine, a 
cytokine receptor and a signaling protein that regulates apoptosis and also directed to specific 
protease scaffolds such as granzyme A, granzyme B, granzyme M, cathepsin, trypsin, 
chymotrypsin, subtilisin, MTSP-1, elastase, chymase, tryptase, collagenase, papain, 
neutrophil elastase, complement factor serine proteases, ADAMTS13, neural 
endopeptidase/neprilysin, furin, cruzain, and urokinase plasminogen activator (uPA). 
Independent claim 63 is directed to specific recited target proteins and also is directed to 
specific human protease scaffolds. 

Analysis 

As discussed in the Guidelines for Examination of Patent Applications Under the 35 
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U.S.C. 1 12, para. 1, Written Description" Requirement (hereinafter the Guidelines), the 
written description requirement for a claimed genus may be satisfied through 1) sufficient 
description of a representative number of species by actual reduction to practice, 2) reduction 
to drawings, or 3) by disclosure of relevant, identifying characteristics, le., structure or other 
physical and/or chemical properties, by functional characteristics coupled with a known or 
disclosed correlation between function and structure, or by a combination of such identifying 
characteristics, sufficient to show the applicant was in possession of the claimed genus. 

In this instance. Applicant has more than complied with the written description 
requirement by providing sufficient description of a representative nimiber of species and by 
providing relevant identifying characteristics of all members of the genus. 

With respect to the protease scaffolds to which the protease muteins contain mutations 
of, it respectfully is submitted that the instant claims are directed to methods of screening 
libraries that contain muteins of proteases. Applicant is not claiming the protease scaffolds or 
the mutant proteases, but methods in which proteases are used as scaffolds to create mixtures. 
The skilled artisan is capable of preparing mixtures of mutant proteases (or of catalytically 
active portions thereof). Screening methods and libraries by their nature rely on diversity, not 
knowledge of any structure/function relationships; the screening methods ferret out products 
that have a desired activity. Notwithstanding this, the specification teaches and exemplifies 
a wide variety of protease scaffolds and also teaches residues that are specificity determinants 
into which mutations can be made, and how to make mutations. By virtue of mutation of 
amino acid residues that are associated with specificity of proteases, the structvire of the 
protease muteins in the library is correlated with function. 

For example, the specification describes exemplary scaffold proteases for use in the 
method {see e,g. Table 2). These are all known proteases in the art, including known allelic 
and species variants. Applicant is not inventing these proteases; they are known. As 
stated above and in previous responses. Applicant is not required to describe information that 
is well known in the art (see, e.g., Hybritech, Inc. v. Monoclonal Antibodies, Inc.y 802 F.2d 
1367, 1379-80, 231 USPQ 81, 90 (Fed. Cir. 1986)). Also, there is no per se rule to provide 
sequence information when that sequence is already known in the field. Capon v. Eshhar v. 
Dudas 418 F,3d 1349 (Fed. Cir. 2005): Falkner v. Inglis 448 F.3d 1357, 1367-68 (Fed. Cir. 
2006). As evidence of the knowledge of one of skill in the art with regard to the sequence 
and stmcture of various serine proteases, a list of references and accession nimibers 
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evidencing the knowledge of one of skill in the art with respect to the sequences of known 
proteases is provided: 



PROTEASE (^sneciesl 


Seauence Accession Number 

li>%^XX%'W xm%r%r^00M^^XX X ^ M-SX ^ A 


T'l'vnciii 






P00761 

A V/V/ / V/ X 




P07477 Trvnsin-1 

X \J / ~ 1 / XXJf^OXll X 


l-ToTTin <jar)ien^ 


P07478 Trvnsin-2 


X XV/XAXV/ OC*L/XWXXO 


P35030 Trvnsin-3 


Homo saniens 

X xvfxxxv^ ovALyxwxxo 


AAA61231 

A XX XX XV/ X X 


Homo saniens 

X Xx^XXXV^ OMXyXX/XXtJ 


AAG30943 

X XX X XmJ ftp*/ V/ ^ i ^ 


Homo saniens 


AAC50728 


Homo saniens 


AAA61232 

X XX XX XV/XA'ap'A' 


Streotomvces eriseus 


P00775 


Fusarium oxvsDorum 


P35049 


ChvmotrvDsin 




Homo saniens 

X xv/xxxvy iJKx^xwxxtj 


099895 Chvmotrvnsin-C 


Homo saniens 

X xv/xix\y OCXX/XwXXO 


P17S'^8 r^hvmotrvnsino&en B 

XX/ ^ mJ \f ^xii y xxiv/Lx y l/ jxxxvy&wxx 


Homo sanipns 

X X\JIH\J idCIXy x^xxo 


AAAS^l^R Chvmotrvnsino&en B 

xxxxxXw/^X^O >m/A1 Y xixvu y UOXXXV/^wXX 


Homo saniens 

X XV7XX1V/ ^vXL/XV'XXO 


A API 6020 Chvmotrvnsinoaen B 

XXXXX m^\J\J^\J V^ll y XXJ.V tX Y L/OXXXWgwXX 1-9 


Rattiis norvefficus 

XxXt'Ll'UO XXV/X VV'gXV/lXO 


P07338 Chvmotrvnsin B 

X \J 1 mj \j v^iA y xxxv/LX y L/^xxi u 


Cradiis morhiia ( Atlantic coH'^ 

V_f UVXtXO XXXV^X XXvXCX 1 xm.LXClXX^XV' VVVX f 


P80646 Chvmotrvnsin B 

X %J\J\J~\J V^H y IxXw IX Y Lf OXXl u 


Vesna orientalis ^^Oriental hornet^ 

V wOL/CX V/X XWXXlCXXXi^ I V/X XwXXbCXX XXV/XXXwVy 


P00768 Chvmotrvnsin-2 

X \J\J 1 \J\J V^ll'Y XXXWtX y LyOXXl ^ 


Vesna crahro ^Ruronean hornet^ 

V V/i3L/CX wXCXI^XVy 1 X-/VU. V/L/%i/ClXX XXV/XXX^t f 


P00769 Chvmotrvnsin-2 


Phaedon cochlear! ae r\4iistarfl heetle^ 

X XXCXwUvfXX WV^WXXXV'CtX XmV' I XTXvXOLCU. i^wwLXWf 


097398 


Rattiis nor&efficus 


P55091 Chvmotrvnsin C 

X >j>j\My\. yxxiV7 tx y Lyoxxi 


Skuhtilisin 




Racillus siihtilis 

X^UV'XXXUO OvXL/VXXXi3 


P04189 Subtilisin E 

X \j~ i.\yy oui^txxi^xij x^ 


Hacillus suhtilis 

X^tXWXXXlXk^ kJ «Xliy trXXXO 


AAA22742 

XXXXXl rirA* / ~Jm» 


Bacillus subtil is 

X^VfrWXXX U^lJ VC^I^ VXXXkJ 


AAA22814 

XB.XXXX.^>^ X 1^ 


Racillus suhtilis 

X^CX\/XXXU'i3 ii7U'L/bXXXii9 


AAA22744 


Bacillus licheniformis 

MmJr^m^^RK X fc_r XX^^XX^^XXXX^^X XXXXlJ 


P0078 1 Subtilisin D Y 

X Vfvr / U X LXL/ ^XXXOXXX X 


Bacillus lentus 

^^^AA^^XXX %1 iT 1 %r 1 1 V VI i J 


P29599 SubtiHsn BL 

X ^y^y y vxiyixxxoxx 


Granzvme A 




Homo saniens 


PI 2544 

X X I I 


Homo saniens 

X XiV/xxxx<r ij%xxyx^^xxw 


AAA52647 

X XX XX x«,/ vy 1 / 


Homo saniens 

X Xx^XXXx^ lJ4X|^XWXXO 


AAD00009 

xxxx_x»./ V/ V/ 


N4us musciilus 

XTXvXO XXXlXOVflXXU^ 


PI 1032 

XXX 






Homo saniens 


P10144 

X X XX XII 


Homo sapiens 


AAA52118 


Homo sapiens 


AAB59528 


Homo sapiens 




Homo sapiens 


AAA67124 


Mus musculus 


P04187 


Granzyme M 




Homo sapiens 


P51124 


Homo sapiens 


AAA59582 
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riomo sapiens 


/\/\/\j/ZOZ 


Katrus norgevicus 




jVLus muscuius 








OaGus moriiua \^/\iianiic cou^ 


poo 1 07 


■ V%V T'Mk^ fl 

i^nyiiidse 




xiomo sapiens 




riomo sapiens 


/VAx>ZOoZo 


riomo sapiens 


AAA ^OAOn 


riomo sapiens 


AAA ^ono 1 

/V/\/\3ZUZ 1 


nomo sapiens 


AAA Q 


\^anis lamiiiaris 


r^Z 1 ohZ 


ivius muscuius 


r Z 1 o^'f 


iviacaca lascicuians 




r apio nomaaryas 


rjz 1 


ivaiius norvegicus 








i^anca papaya 


PAA7Q/1 


iNeutropnii eiastase 




riomo sapiens 


r^UoZ^O 


riomo sapiens 




riomo sapiens 


A A A 1 A 1 '71 
AAAJOl / J 


riquus caoaiius 


r J / Dj / iNeutropnu eiastase z/\ 


ij/quus caoaiius 


rj / JDo JNeuiropnii eiastase zo 


ivius muscuius 


i^yz^zo^ 


^ompieineiii lacior serine profeases 




riomo sapiens 


P/lCT/in A/f AQP1 


IV yi 1 1 1 1 ^41 1 1 1 1 fi 

ivius muscuius 


POOHA/l A/f ACPI 


riomo Sapiens 


A A 0/1 n^y 1 


AUAJVl 1 013 




riooio sapiens 




riomo sapiens 


r>Al30y4o / 


iieurai enaopeptiaase/nepriiysin 




riomo sapiens 




nomo sapiens 


AAA^IOI^ 


nomo sapiens 


\^/\/\.Oo /3Z 


nomo sapiens 


AAA ^OOO/i 


nomo sapiens 


o A A 'ini ^7 


l\ yl 1 1 f*^ 4*^^ 1 1 f> />i 1 1 1 1 

ivius muscuius 


i^oijyi 


wrycioiagus cunicuius 


rUoU4y 


ivauus norvegicus 


rU/oOl 


ous scrota 


PI QAO 1 

r lyozi 


f 111 lu 




Homo sapiens 


P09958 


Mus muscuius 


P23188 


Bos taurus 


Q28193 


Rattus norvegicus 


P23377 


Xenopus laevis 


P29119 
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i^ruzain 




1 rypaiiosoma cruzi 




1 rypanosonia cruzi 




1 rypanosoina cruzi 


A A A'^n97n 


X rypanosoiiia cruzi 


OA A "^0970 
J oZ / o 


1 rypanosoma cruzi 


AAA'^OIRO 


uroKinase plasminogen acnvafor (ux^a^j 




riomo sapiens 




riomo sapiens 


A A A Al 


riomo sapiens 




nomo sapiens 




ivius muscuius 








Rattus norvegicus 


P29598 


Bos Taurus 


Q05589 


Papio cjoiocephalus 


PI 6227 


Sus scrofa 


P04185 


Gallus gallus 


P15120 



In addition, the specification and claims provide the relevant identifying 
characteristics of the protease muteins used in the method that correlate structure and 
function. In particular, the specification describes that amino acid residues that contribute to 
the substrate specificity can be targeted for mutagenesis, /. e. those amino acids that are 
specificity determinants. These are amino acid residues in proteases that are identified in the 
application conferring substrate specificity to the protease; hence, mutating these residues can 
alter or change the substrate specificity. The specification describes exemplary specificity 
determinants for serine proteases (see e.g. at page 20, lines 3-7; and at page 28, lines 1-3; and 
Table 3). These specificity determinants include, for example, amino acid residues between 
58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin numbering. The chymotrypsin numbering scheme was devised by Jonathan 
Greer (PROTEINS: Structure, Function and Genetics, 7:317-334 (1990)) and is based on 
sequence alignment between and among various serine proteases with numbering based on 
the numbering scheme of bovine chymotrypsin A. Alignment of exemplary serine proteases, 
including tryspin, and chymotrypsin and identification of corresponding residues by 
chymotrypsin numbering is exemplified in see, e.g, Friedrich et al J Biol Chem. 2002 Jan 
18;277(3):2 160-8. The claims, as amended, recite that the proteases contain mutations at 
amino acid residues between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 
215, 217, 218 and 226 by chymotrypsin numbering. The specification fiirther describes 
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exemplary granzyme B mutants containing mutations in these specificity determinants (see 
e,g. Table 4), 

Under the heading, "Mutagenesis of the Scaffold Protease," the specification 
describes that mutation of specificity determinants can be effected as follows (see e.g., page 
25, lines 3-16): 

In order to change the substrate preference of a given subsite (S1-S4) for a 
given amino acid, the specificity determinants that line the binding pocket are 
mutated, either individually or in combination. In one embodiment of the 
invention, a saturation mutagenesis technique is used which the residue(s) 
lining the pocket is mutated to each of the 20 possible amino acids. . . 
Alternatively, single amino acid changes are made using standard, 
commercially available site-directed mutagenesis kits such as QuikChange 
(Stratagene). In another embodiment, any method commonly known in the art 
for specific amino acid mutation could be used.. . 

For example, the specification describes that each member of the library is a protease 
scaffold with a number (N) of mutations made to each member of the library. The 
specification describes that the resulting members in the library can contain a number of 
mutations such as, for example, 1, 2-5 (e.g. 2, 3, 4 or 5), 5t10 (e.g. 5, 6, 7, 8, 9 or 10), or 10- 
20 (e.g. 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20) (see e.g. at page 3, lines 709). 

Hence, the specification describes the scaffold proteases (which are known proteases 
well known to one of skill in the art), describes residues thereof that are specificity 
determinants and that are targeted for mutagenesis in the protease muteins, describes methods 
of effecting mutations, and exemplifies mutation of specificity determinants in granzyme B. 
Hence, one of skill in the art can easily envision the full scope of the claimed subject matter 
by knowing the sequence of the protease scaffold and the mutant residues contained in the 
library of protease muteins. Accordingly, it respectfully is submitted that because Applicant 
adequately describes the scaffold to be mutagenized, how to effect mutations in order to 
produce a library of protease muteins and what residues to mutate. Applicant had possession 
of the claimed methods, which are for identifying mutein proteases, at the time of filing the 
application. 

Rebuttal to Examiner's Comments 

J) The Examiner Statement that "it is known that in biotechnology art one species 
cannot be extrapolated to another due to the numerous unforeseeable reaction or interactions 
or protease-substrate " is misplaced 
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The Examiner appears to be stating that the genus of proteases is not described 

because there is not any correlation of function between the proteases in the method. For 

example, the Examiner states as an example citing Rosen et al (2002008681 1) that: 

caspases are among the most specific of proteases, with an unusual and 
absolute requirement for cleavage after aspartic acid. (The only other 
eukaryotic protease known to have a similar specificity is the serine protease 
granzyme B, a mediator of granule-dependent cytotoxic 1' lymphocyte- 
mediated apoptosis). Recognition of at least four amino acids NH2-terminal 
to the cleavage site is also a necessary requirement for efficient catalysis. The 
preferred tetrapeptide recognitions motif differs significantly among caspases 
and explains the diversity of their biological functions. Their specificity is 
even more stringent: not all proteins that contain the optimal tetrapeptide 
sequence are cleaved, implying that tertiary structural elements may influence 
substrate recognition. Cleavage of proteins by caspases is not only specific, 
but also highly efficient. The strict specificity of caspases is consistent with 
the observation that apoptosis is not accompanied by indiscriminate protein 
digestion; rather, a select set of proteins is cleaved in a coordinated manner, 
usually at a single site, resulting in a los or change in function. 

The Examiner is reminded that the claims are directed to a method of screening by 

engineering proteases to alter or change their substrate specificity. Hence, the proteases 

muteins identified by practice of the method have new specificities and activities, and hence 

extrapolation to any known or existing protease misses the point of the method . The 

method is a method to identify proteases with altered substrate specificity such that the 

protease cleaves and inactivates a target protein involved in a disease or pathology. By virtue 

of the change in specificity compared to the scaffold protease, the protease mutein can be 

used as a candidate therapeutic to destroy or inactivate the activity of the target protein. The 

substrate specificity of the scaffold protease is irrelevant to the claimed method, since the 

method is a method to identify protease muteins with new and altered substrate specificities. 

Hence, knowledge of the substrate specificity of the protease scaffold used in the method is 

not required. As discussed further below in rebuttal 2) there is no requirement in the claims 

that the protease scaffold have any cleavage activity and/or specificity for the target protein. 

Further, as stated above, the claims provide the relevant identifying characteristics of the 

mutein proteases by identifying residues that are specificity determinants for mutations; 

mutation of a residue associated with the specificity of a protease is necessarily correlated to 

the function of protease specificity, which is the function that is being engineered by practice 

of the method. 
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2) The Examiner 's Statement on page 15 of the Response that ''[t]he mutated 
protease has to be screened against a target protein specific only for said protease in order 
to identify mutant(s) with increased activity relative to the parent protease is not correct 

The Examiner appears to believe that the claimed method is limited to assessing 
cleavage activity and specificity of a protease mutein against a known target substrate of the 
corresponding protease scaffold. This is not correct. There is no limitation in the claims 
that recites that the target protein involved in a disease or pathology is a target protein of the 
protease scaffold. The point of the method is to engineer new and different specificities for 
protease mutants. Indeed, cleavage activity and/or substrate specificity for a target protein 
can be "increased" whether or not the corresponding protease scaffold exhibits any cleavage 
activity and/or substrate specificity for the target protein. For example, if the corresponding 
unmodified protease scaffold does not exhibit any cleavage activity and/or substrate 
specificity for a substrate sequence in a target protein or a target protein, than necessarily any 
protease mutein that does exhibit cleavage activity and/or substrate specificity for a substrate 
sequence in a target protein or a target protein exhibits "increased" cleavage activity and/or 
substrate specificity compared to the protease scaffold. Hence, there absolutely is no 
requirement in the claims that the protease scaffold exhibit any activity towards the target 
protein used in the method. 

Accordingly, as stated in previous responses, the structure/function of the starting 
protease is not relevant to practice of the method; the method identifies those proteases with 
the desired function, /. e, increased cleavage and/or substrate specificity for a target protein 
compared to the corresponding scaffold protease. Yet, the structure/function of the protease 
muteins is clearly set forth in the specification, and in the claims as amended, by reciting that 
the mutations in the protease occur at residues that are specificity determinants, /. e. between 
residue 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 
by chymotrypsin numbering, Hence, mutating the structure of the protease at these known 
identified residues is correlated to the function of altered substrate specificity and/or cleavage 
activity. 

3) Reliance on In re Ruschig 379 R 2d 990, 995, 154 USPQ 118, 23 (CCPA 
1967) is inapt. 

The Examiner rebuts Applicant's argimients that the specification describes in detail 
the protease scaffold for use in the method {e.g. as set forth in Table 2), residues that can be 
mutated in generating the protease mutein libraries and methods of mutagenesis by stating 
that: 




Applicant : Nguyen a/, ^ Attorney Do^li^No.: 33328.04905. USO 1/4905 

Serial No. : \QI611y911 Amendment and Response 

Filed : October 02, 2003 

...general statements in the specification and/or definitions do not replace the 
requirement of the law that applicants describe in detail that which is claimed 
to show possession of the claims being sought. Furthermore, a listing or 
selection of every possible protease scaffold or target does not constitute a 
written description of every species in a genus. It would not "reasonably lead" 
those skilled in the art to any particular species. In re Ruschig, 379 F, 2d 990, 
995, 1 54 USPQ 1 1 8, 23 (CCPA 1 967). 

The Examiner's attention is directed to pages 25-26 of the response filed August 07, 
2008 in which Applicant rebutted this exact same point. In that response. Applicant stated 
that the issue of the Court in In re Rushig was whether the compound of claim 1 3 was 
supported by the disclosure, whether it was new matter. The disclosure did not disclose the 
specific compound of claim 13, but instead disclosed a general formula of a compound with 
specified R groups. It was argued that one of skill in the art could arrive at the claimed 
compound from the general formula. The Court, however, affirmed the Board of Appeals 
ruling that the compound was not supported. 

The claims and disclosure at issue are not analogous to In re Ruschig. In the instant 
case, Applicant is not claiming a product by a general formula of proteases. Rather, 
Applicant is claiming a method that can employ any protease as a scaffold, regardless of its 
function or substrate specificity, in order to generate a library of protease muteins that contain 
mutations at amino acid residues between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 
190, 191, 192, 215, 217, 218 and 226 by chymotrypsin numbering. The specification 
provides an extensive list of exemplary scaffold proteases that can be used in the method, 
which are proteases that are well known to one of skill in the art. Thus, this is not a "laundry 
list disclosure of every moiety," but rather a disclosure of the actual scaffold proteases 
contemplated for mutagenesis for use in the method. By virtue of practice of the method, a 
protease mutein is identified that has increased cleavage activity and/or substrate specificity 
for a target protein involved in a disease or pathology and inactivates an activity of the 
disease or pathology. 

4) The Examiner 's statement that Written Description is Not Satisfied Because 
''only a single protease, granzyme modified at specific positions for mutein identification is 
described in the working example " does not comport with the Written Description 
Requirement 

The Examiner is reminded that satisfaction of the written description requirement 
does not require Working Examples. This basic dogma of written description jurisprudence 
was recently affirmed in the Federal Circuit's decision in Centocor v. Abbots whereby the 
Court stated "we have repeatedly indicated that the written description requirement does 
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not demand either examples or an actual reduction to practice" Centocor Ortho Biotech, 
Inc. V. Abbot Labs., No. 2010-1 14, 20-21 (Fed. Cir. 2/23/201 1). 

As described in detail above, and in previous responses, the specification provides 
extensive description of all steps of the method as claimed, including description of the step 
of producing a protease mutein library that contains mutations compared to a scaffold 
protease- The specification, and claims, describe the particular serine protease scaffolds that 
are mutagenized, describe the specificity determinant residues that are mutated {ue. between 
58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin nxmibering), and describes methods of mutagenesis. The protease scaffold is 
merely being used as a scaffold to generate libraries of protease mutants. Indeed, the 
specificity determinants that are mutated in the library of protease muteins correlate the 
structure and function of the protease muteins to altered cleavage activity and/or specificity, 
since changing residues that are known specificity determinants is associated with altering the 
activity and specificity of the protease. Accordingly, the specification clearly evidences 
possession of the claimed subject matter since provision of numerous serine protease 
scaffolds and identification of specificity determinant residues to be mutated, permits one of 
skill in the art to clearly envision and visualize the library of protease muteins used in the 
practice of the method. 

V. THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63 AND 
65-78 UNDER 35 U.S.C. §102 

Claims 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63 and 65-78 are rejected under 35 
U.S.C. § 102(b) as being anticipated by Lien et al (Combinatorial Chemistry and High 
Throughput Screening, 1999)(as evidenced by Shi et al, USP 20020197701), The Examiner 
states that Lien et al discloses a method of identifying serine proteases using targeted 
combinatorial mutagenesis of serine proteases with N mutations by generating a library of 
mutant proteases, contacting the library with a substrate and identifying the mutant. For 
example, the Examiner states that Lien et aL discloses "quantitative assessments of cleavage 
made by monitoring the hydrolysis (inactivation as claim). . and further states that Lien et 
al discloses other various limitations of the claimed subject matter. The Examiner further 
states that Lien et al discloses the same effect achieved by the specific proteases, le. 
inactivation or cleavage of the target protein, and that Lien al at page 73, col. 1 discloses 
that it is useful to generate proteases v^th new or desirable cleavage specificities, such as for 
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use in practical applications in biotechnology such as in therapy for the modulation of 
zymogen cascades, such as blood coagulation. 

This rejection is respectfully traversed. The arguments at pages 3 1-39 provided with 
the response mailed July 20, 2010 are incorporated by reference; herein. 

Relevant Law 

The case law related to anticipation is set forth in previous responses, incorporated by 
reference herein. 

The Rejected Claims 

The rejected claims are set forth above. It is noted that, as amended, all claims recite 
that in the step of producing a library of protease muteins containing mutations of a serine 
protease scaffold or a particular recited serine protease scaffold, a mutation is at an amino 
acid position selected from among amino acid residue between 58 and 64, 97, 98, 99, 
100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by chymotrypsin 
numbering. 

Analysis 

The disclosure of Lien et al , and the distinction of the claims from the disclosure of 
Lien et al, has been discussed in the previous response mailed July 20, 2010, incorporated by 
reference herein. The Examiner is reminded that anticipation requires the disclosure in a 
single prior art reference of each element of the claim under consideration. In this instance. 
Lien et al does not disclose a method using a protease library, whereby the library contains 
protease muteins having mutations in a serine protease scaffold at £in amino acid residue 
between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 
226 by chymotrypsin numbering. Hence, Lien et al does not anticipate the claimed subject 
matter at least for this reason. 

Notwithstanding this, in addition, it respectfully is submitted that there is no 
disclosure in Lien et al , alone or as evidenced by Shi et al , of a method of identifying a 
protease mutein that cleaves and inactivates an activity of a target protein involved in a 
disease or pathology. The Examiner appears to be equating cleavage and hydrolysis with 
inactivation of an activity of a protein. This is not correct . All proteases hydrolyze peptide 
bonds upon cleavage of a target protein, yet not all proteases inactivate an activity of the 
target protein. Hydrolysis of peptide bonds results in cleavage of a target; it does not describe 
whether the cleaved protein is inactive or active. In fact, cleavage or products can include 
activation cleavage or inactivation cleavage. In biological processes many proteases activate 
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an activity of a target protein. This occurs, for example, in proteolytic cascades, such as is 
involved in complement activation, coagulation and apoptosis, whereby the protease acts as a 
catalyst to activate downstream targets. For example, thrombin cleaves by hydrolysis the 
target protein substrates involved in the coagulation pathway . In particular, the downstream 
activated targets of thrombin are described in Bode et al (1992) Protein Science, 1:426:471 
as follows: 

a-Thrombin converts fibrinogen into fibrin, which consequently aggregates 
and forms the intercormecting network of thrombi. Furthermore, thrombin is 
able to activate several coagulation and plasma factors, such as Factor V, 
Factor VII, XIII, and protein C. [emphasis added] 

With respect to Lien et al , Lien et al discloses the use of targeted combinatorial 
mutagenesis (TCM) in the screening methods to engineer proteases. In particular. Lien et al. 
reviews the results of various methods in terms of elucidating the roles of various residues in 
altering the cleavage activity or substrate specificity of the protease. Lien et al summarizes 
one study (Tsiang et al (Biochemistry (1996) 3:16449) that used random substitution, instead 
of TCM, to alter the substrate preference of thrombin from fibrinogen to protein C (see e.g, 
last sentence of first paragraph on page 73). As noted above, naturally, activation of protein 
C and fibrinogen is mediated by cleavage by thrombin. Hence, according to Lien et al, the 
study identified a mutant protease with preferential protein C activation over fibrinogen 
activation. Thus, while Lien et al discloses that such a thrombin mutant can be used in 
therapy for modulation of zymogen cascades, such as blood coagulation, the resulting mutant 
summarized in the review article of Lien et al has the opposite effect of the protease 
muteins identified by the instant method. There is no disclosure in Lien et al of any method 
of identifying a protease that inactivates an activity of a target protein, and in particular a 
protein involved in a disease or pathology. Hence, Lien et al also fails to disclose this 
limitation of the claims, and for this reason, also does not anticipate the instant claims. 

Rebuttal to Examiner's Comments 

1 ) The Examiner states that the limitation in claim 1 of the recitation of target 
proteins, wherein "the target protein is selected from among a cell surface molecule that 
transmits an extracellular signal for cell proliferation, a cytokine, a cytokine receptor and a 
signaling protein that regulates apoptosis" is not accorded any weight because it is in the 
preamble. The Examiner reasons that the preamble is used only to state a purpose or intended 
use for the target protein. This is not correct. First, the limitation of the target protein is not 
in the preamble, since it follows the transition clause "wherein" in the claim. Hence, the 
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limitation is part of the claim. Nevertheless, in the interest of advancing this case to 
allowance and for clarity, the independent claim 1, 53, 59 and 63 are amended herein to move 
the limitations in the claim that are recited above "comprises" to below the transition phrase 
"comprises" under the particular steps of the method. Hence, the recitation of the particular 
target protein or the protease used in the method are explicit limitations in the claims and 
should be considered and given weight when reviewing the claim in view of cited references. 

Second, the limitation is not made to state a purpose or intended use for the target 
protein but rather limits the particular target protein or substrate sequence of the target protein 
that is used in the method. In particular, the limitation of "a cell surface molecule that 
transmits an extracellular signal for cell proliferation, a cytokine, a cytokine receptor and a 
signaling protein that regulates apoptosis" are known classes or proteins. Table 1 in the 
specification at page 9, and as reproduced above, provides exemplary known target proteins 
that are representative of these classes of proteins. 

2) The Examiner states that Lien et al. discloses a truncated human growth hormone 
substrate for proteolytic cleavage, which is included in claim 63, and cites to the paragraph 
bridging pages 86-87 of Lien et al First, Applicant respectfully submits that growth 
hormone is not recited as a target protein in claim 63, and hence Lien et al cannot anticipate 
claim 63 for this reason. Also, notwithstanding the fact that Lien et al. does not disclose the 
other limitations of the claims. Applicant can find no reference in Lien et al of a method of 
screening a protease mutein for cleavage activity and/or specificity for a target substrate that 
is a growth hormone. The only reference to growth hormone oh page 87 of Lien et al. refers 
to the use of phage display to engineer human growth hormone (citing to Lowman et al 
(1993) J Mol Biol, 234:564-78). This study is completely different from the instant method 
because it is directed to the affinity maturation of growth hormone for binding to its receptor. 
It has nothing to do with a method of screening protease muteins against a target substrate, in 
particular a target substrate involved in a disease or pathology. 

VI, THE REJECTION OF CLAIMS 1-5, 7, 9, 11, 13-16, 48, 51-53, 58-59, 61-63, 65 
AND 67 UNDER 35 U.S.C. §103 

Claims 1-5, 7, 9, 11, 13-16, 48, 51-53, 58-59, 61-63, 65 and 67 are rejected under 35 
U.S.C. § 103(a) as being unpatenable over Guinto et al (Proc. Natl. Acad. Sci. USA, 96, 
pp. 1852- 1857, March 1999). The Examiner states that Guinto et al teaches method of 
identifying a mammalian protease from a database a total of 284 enzymes. In particular, the 
Examiner states that the variability of amino acid residue at position 225 is assessed, and that 
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mutation of this residue is identified as influencing the catalytic activity of the enzyme. 

Further, the Examiner states: 

The claim ability of the candidate mutant as a therapeutic for treatment 
of the disease associated with the target substrate is a property considered 
inherent to the prior art mutant serine proteases. The ability of the mutant 
protease to catalyze the target protein such as fibrinogen or protein C will 
result in the inactivation of the target hence its amelioration of the disease 
associated with fibrinogen or protein C. 

Applicant respectfully traverses this rejection. It is noted that claim 6, 12, 45, 56 (now 
cancelled), 57, 66 and 68-79 are not rejected on this ground. 
Relevant Law 

The case law related to obviousness is set forth in previous responses filed August 7, 
2008, June 11, 2009 and July 20, 2010, each incorporated by reference herein. 



In addition, the concept of inherency is not applicable to the question of obviousness. 
In re Sporman, 363 F.2d 444, 150 USPQ 449 (CCPA 1965). To state that a reference 
inherently teaches an element of a claim rebuts any finding of prima facie obviousness. The 
concept of inherency is not properly applicable to the question of obviousness (see, In re 
Sporman, 363 F.2d 444, 150 USPQ 449 (CCPA 1965)). 

Obviousness and inherency are entirely different questions; that which may be 
inherent is not necessarily known and, therefore, is an indication of unobviousness (In re 
Sporman, 363 F.2d 444, 449, 150 USPQ 449, 452 (CCPA 1965; see, also In re Naylor, 360 
F.2d 765, 152 USPQ 106 (CCPA 1966); In re Adams, 356 F.2d 998, 148 USPQ 742 (CCPA 
1966); and In re Shetty, 566 F.2d 81, 195 USPQ 753 (CCPA 1977)). 

Reference to an unexpected property as inherent begs the question of whether an 
unexpected property rebuts prima facie obviousness. The unexpected property is part of the 
invention as a whole, and, therefore, evidence of unobviousness of the claimed subject 
matter. In In re Naylor a process for preparing a polybutadiene polymer having unexpected 
properties was at issue. The CCPA held that the fact that a rubbery polybutadiene having 
high 1 ,2-addition might be inherent in following the combined teachings of the prior art is 
immaterial, if one of ordinary skill in the art would not appreciate or recognize that inherent 
result. In In re Adams, the CCPA held that since properties of a claimed structure are always 
inherent, it is "transparently erroneous" to state that subject matter cannot be patented on the 
basis of an inherent property. In re Shetty, the court held that "inherency is quite immaterial 
if, as record established here, one of ordinary skill in the art would not appreciate or 
recognize that inherent result." 
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The Rejected Claims 

The rejected claims are set forth above. It is noted that, as amended, all claims recite 
that in the step of producing a library comprising protesise mutein, the protease scaffold, 
which in all claims is a serine protease or a particular recited serine protease, comprise a 
mutation at an amino acid position selected from among amino acid residue between 58 
and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin numbering. 



Guinto et al teaches the role of the amino acid residue 225, numbered based on 
chymotrypsin numbering like the instant application, in serine proteases. Guinto et al 
teaches studies involving saturation mutagenesis of thrombin at position 225. The resulting 
mutants were then tested for their ability to hydrolyze a chromogenic substrate, fibrinogen 
and protein C, as well as their ability to be inhibited by the inhibitor antitrhombin II. Guinto 
et al teaches that the mutants have reduced catalytic activity. 

First, as noted above, the Examiner's reasoning that inactivation of the target is 
inherent to the mutant is flawed. First, the Examiner, who states that the ability of the 
candidate mutant as a therapeutic for treatment of the disease associated with the target 
substrate is a property considered inherent to the prior art mutant serine proteases, is 
reminded (see discussion of case law above) that inherency has no place in a consideration of 
obviousness. Inherent properties are relevant in the context of anticipation, but in the context 
of an obviousness rejection, such properties can evidence unexpected results. As discussed 
below and in responses of record, no prior art of record before the earliest priority date of the 
instant application teaches or suggests any method of engineering or identifying a protease 
mutein that inactivates an activity of a target protein involved in a disease or pathology, nor 
that protease muteins identified by such a method could be candidate therapeutics. 

More importantly, the Examiner's reasoning is flawed, since hydrolysis of a target 
protein is not synonymous with inactivation of an activity of the target protein. As evidenced 
by Bode et al ((1992) Protein Science, 1 :426:471) discussed above, thrombin hydrolyzes and 
cleaves its target substrates to activate them. Hence, inactivation of the target protein 
substrates by thrombin is not an inherent activity of the protease. Rather, the inherent 
activity of thrombin is that is activates its protein target substrates. Applicant notes that the 
instant claims include an affirmative step of testing the protease mutein for inactivation of an 
activity of the protease. It respectfully is submitted that a method of engineering or screening 



Analysis 
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for a protease mutein, including a protease mutein of thrombin, that inactivates an activity of 
a target protein substrate is not taught or suggested in Guinto et al There is no evidence in 



substrate, nor any teaching or suggestion of a method of screening to identify new protease 
muteins engineered to have this activity towards particular target proteins involved in a 
disease or pathology. 

Notwithstanding this, Gvdnto et al only is directed to elucidation and mutagenesis of 
the amino acid residue 225. There is no teaching or suggestion of any other residues that are 
specificity determinants in serine proteases that, when mutated, can alter the cleavage activity 
and/or substrate specificity towards a target protein involved in a disease or pathology. In 
particular, there is no teaching or suggestion that any of amino acid residues 58 and 64, 97, 
98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by chymotrypsin 
numbering, are specificity determinants. Hence, there is no teaching or suggestion in Guinto 
et al. that a method of screening a library of protease muteins containing mutations at any of 
the above residues could be used to identify a protease mutein that has an increased cleavage 
activity and/or substrate specificity for a target protein involved in a disease or pathology, 
and that by virtue of that is identified as inactivating an activity of the target protein. 

In sum, Guinto et al. fails to render the claims obvious because it does not teach or 
suggest all limitations of the claims. For example, Guinto et al. does not teach or suggest 
producing a library of protease muteins that have mutations at amino acid residues 58 and 64, 
97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by chymotrypsin 
numbering. Guinto et al also does not teach or suggest a step of testing an identified 
protease mutein for inactivation of an activity of the target protein. Since a prima facie case 
of obviousness requires that the reference teach or suggest all elements as claimed, the 
Examiner has failed to set forth a case of prima facie obviousness. 

Rebuttal to Examiner^s Comments 

1) In reply to Applicant's argument in the last response that the target proteins cleaved 

by the mutant thrombins are activated and not inactivated, the Examiner state in reply: 

Guinto teaches above inhibitors, /.e., inactivation of the target protein as 
claimed. The resultant effect of said inhibition would inherently ameliorate a 
disease or pathology to which the target protein is involved. The function of the 
target protein involvement with the numerous diseases is irrelevant to the 
screening protease that binds or inhibits said target protein. 



Guinto et al of any protease that hydrolyzes and inactivates an activity of a protein target 
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It is respectfully submitted that it is not clear to Applicant what "inhibitors" the Examiner 
is referring to in the teachings of Guinto et aL Nevertheless, in the arguments above and in the 
last response mailed July 20, 2010, Applicant stated that Guinto et al does not teach that the 
protease mutants mutated at position 225 inactivate an activity of any target protein. Rather, 
thrombin, and the thrombin mutants as taught in Guinto, activate the target substrates; Guinto 
et aL teaches that mutation of residue 225 results in reduced catalytic activity and presumably 
reduced ability to activate the target substrates. Since inactivation of a protein target 
substrate is not an inherent activity of hydrolysis by a protease, the Examiner has no basis to 
conclude that Guinto et aL inherently teaches inhibition of target proteins. There is no 
teaching or suggestion in Guinto et aL that hydrolysis and cleavage of a thrombin target 
protein, such as protein C, results in inactivation of an activity of protein C. Guinto et aL 
does not teach the thrombin protease mutants are inhibitors, nor does it teach that the 
thrombin protease mutants inactivate an activity of any of its target substrates. 

2) In reply to Applicant's argument in the last response that Guinto et aL fails to 

disclose any of the recited target proteins in the claims that are a "cell surface molecule that 

transmits an extracellular signal for cell proliferation, a cytokine, a cytokine receptor and a 

signaling protein that regulates apoptosis," the Examiner again refers the Applicant to the 

previous comment about the preamble. In reply to this. Applicant refers the Examiner to the 

Rebuttal 1) above on page 28. 

VIL THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63, 
AND 65-78 UNDER 35 U.S.C. §103 

Claims 1-7, 9, 1 1-16, 45, 48, 51-53, 56-59, 61-63, and 65-78 are rejected under 35 
U.S.C. § 103(a) as being unpatenable over Lien et aL (Combinatorial Chemistry and High 
Throughput Screeing, 1999, 2:73-90) in view of either Harris et aL I (J. Biol. Chem., 
273:27364-27373 (1998) or Harris et aL II (Current Opinion in Chemical Biology, 2:127-132 
(1998)) and Waugh et aL (Nature Structure Biology). The Examiner states that while Lien et 
aL does not disclose the enzyme as granzyme and the substrate as caspase, Harris et aL I 
teaches the substrate specificity of granzyme B and that caspases are likely substrates, and 
teaches that Arginine 192 is a structural determinant of specificity of granzyme B; Harris et 
aL II teaches the same method as Harris et aL I; and Waugh et aL teaches that granzymes are 
a vital component of the ability to induce apoptosis. Thus, the Examiner concludes that it 
would have been obvious to one of ordinary skill to use the serine protease granzyme and the 
target protein caspase in the method of Lien et aL 
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Applicant respectfully traverses this rejection. This rejection is similar to previous 
rejections made of record that have been addressed in responses filed August 7, 2008, June 
1 1 , 2009 and July 20, 20 1 0, each of which are incorporated by reference herein. 

Relevant Law 

• * 

The case law related to obviousness is set forth in previous responses filed August 7, 
2008, June 11, 2009 and July 20, 2010, each incorporated by reference herein. 
The Rejected Claims 

The rejected claims are set forth above. It is noted that, as amended, all claims recite 
that the step of producing a library of protease mutein, the protease muteins in the library 
contain mutations in a serine protease scaffold or in a particular recited serine protease 
scaffold at an amino acid position selected from among amino acid residue between 58 
and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin numbering. 

Analysis 

As discussed above and in previous responses of record. Lien et ah does not teach or 
suggest a method including all limitations as instantly claimed. For example. Lien et al does 
not teach or suggest a method using a protease library, whereby the library contains protease 
muteins having mutations in a serine protease scaffold at an amino acid residue between 58 
and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin numbering. Lien et al also does not teach or suggest any method of 
identifying a protease that inactivates an activity of a target protein involved in a disease or 
pathology, since contrary to the Examiner's comments throughout this response, inactivation 
of an activity of a target protein is not inherent to hydrolysis of that protein by a protease. 
None of Harris et aL I, Harris et al II or Waugh et al, , singly or in any combination, cures 
these defects. 

Harris et al I is directed to a study of the mechanism by which granzyme B mediates 
apoptosis of target cells and demonstrates that granzyme B displays extended substrate 
specificity and proposes a model by which granzyme B acts. In this context, for example, 
while Harris et al I teaches two mutants of granzyme B at amino acid residue 192, Harris et 
al teaches that mutation of amino acid residue 1 92 in granzyme B reduces its hydrolysis of 
an optimal tetrapeptide substrate. Harris et al does not teach or suggest employing 
granzyme B nor any protease as a scaffold in a method for generating and identifying 
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proteases that inactivate an activity of a target protein to produce a candidate therapeutic. 
Thus, Harris et al is of no relevance to the instant claims. 

Further, there is no teaching or suggestion in Harris et al I of any protease mutants 
that exhibit increased cleavage activity and/or substrate specificity for a target protein 
involved in a disease or pathology as required by the instant claims. Indeed, not only does 
Harris et al teach that the granzyme B mutants exhibit reduced hydrolysis of the synthetic 
substrate, the synthetic substrate is not a target protein involved in a disease or pathology, in 
particular any target protein recited in the instant claims. Also, Harris et al I does not teach 
or suggest any method that includes as a step testing the identified protease mutant for 
inactivation of an activity of the target protein. In this regard, Harris et al teaches that 
caspase is a target substrate of granzyme B. Like the coagulation pathway, the apoptosis 
pathway involves a proteolytic cascade, whereby proteases act as catalysts to activate 
downstream molecules. For example, granzymes such as granzyme B cleave caspases, 
thereby altering the conformation and leading to their activation. Specifically, granzyme B 
cleaves procaspase-3 to its activated caspase-3 form, which itself performs further 
downstream functions in the apoptosis pathway. For example, Harris et al I teaches at page 
27364, 2"^ column: 

Although granzyme B is the only known mammalian serine protease to have 
P 1 -proteol5^ic specificity, it is shared with the caspases, a family of cysteine 
proteases that are also activated during apoptosis. The link between granzyme 
B and the caspases heis been strengthened by studies indicating that granzyme 
B can cleave and activate certain members of the caspases, and it has been 
suggested that this is one of the mechanisms by which granzyme B mediates 
apoptosis in vivo, [emphasis added] 

Harris et al II is even of less relevance to the instant claims. Harris et al II broadly 
teaches methods of engineering the substrate specificity of proteases. Harris et al does not 
identify any of residues 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 
217, 218 and 226 by chymotrypsin numbering as specificity determinants in serine proteases, 
nor suggest their modification. Hence, Harris et al II does not teach or suggest a method 
using a protease library, whereby the library contains protease muteins having mutations in a 
serine protease scaffold at an amino acid residue between 58 and 64, 97, 98, 99, 100, 171, 
174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by chymotrypsin numbering. Harris 
et al II also does not teach or suggest any method of identifying a protease that inactivates an 
activity of a target protein involved in a disease or pathology. Thus, Harris et al II, singly or 
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in combination with Lien et al. or Harris et al I, does not cure the deficiency in rendering the 

claims prima facie obvious. 

Finally, Waugh et al also is of little relevance to the instant claims. Waugh et al 

teaches that granzymes are involved in inducing apoptosis by acting on downstream 

substrates such as caspases by activation cleavage. Waugh et al also teaches the elucidation 

of the molecular determinants of substrate specificity in granzyme B. In particular, the first 

paragraph of Waugh et al teaches: 

Granzymes are a vital component of the cytotoxic lymphocyte's ability to 
induce apoptosis, contributing to rapid cell death of a tumor or virally infected 
target cell by the cleavage of downstream substrates and the activating 
cleavage of caspases. [emphasis added] 

Thus, Waugh et al is directed to the normal function of granzyme to activate caspases to 
induce apoptosis. Waugh et al does not teach or suggest any protease that inactivates an 
activity of a target protein, in particular a target protein involved in a disease or pathology, 
nor a method of identifying such a protease mutein as instantly claimed. Further, while 
Waugh et al teaches the elucidation of the molecular determinants of substrate specificity in 
granzyme B, Waugh et al does not teach or suggest that mutation of any of these residues 
would alter substrate specificity, in particular substrate specificity for a target protein 
involved in a disease or pathology. Waugh et al does not teach or suggest any mutants of 
granzyme B. There is no teaching or suggestion in Waugh et al of a method of generating 
mutants of granzyme B, such as a library of protease mutants as claimed that contain 
mutations at an amino acid residue between 58 and 64, 97, 98, 99, 100, 171, 174, 180, 189, 
190, 191, 192, 215, 217, 218 and 226 by chymotrypsin numbering. There is no teaching or 
suggestion in Waught et al of any method of screening for mutant proteases to identify a 
protease that has increased cleavage activity and/or substrate specificity for a target protein 
involved in a disease or pathology that includes a step of testing the identified protease 
mutants for inactivation of an activity of the target protein. Since Waugh et al also does not 
teach or suggest a library of protease mutants at any of the noted residues that are specificity 
determinants, and does not teach or suggest testing any mutants for inactivation of an activity 
of a target protein involved in a disease or pathology, Waugh et al , singly or in combination 
with Lien et al , Harris et al I or Harris et al II, does not teach or suggest the claimed 
method. 

Hence, the Examiner has failed to set forth a prima facie case of obviousness because 
none of Lien et al , Harris et al I, Harris et al II and/or Waugh et al , singly or in any 
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combination thereof, teaches all elements as claimed. Each of independent claims 1, 53, 59 and 
63 is directed to a method of producing and identifying a protease mutein that cleaves and 
inactivates a target protein involved in a disease or pathology, including steps of: 1) producing a 
library of protease muteins each having N mutations at an amino acid residue between 58 and 
64, 97, 98, 99, 100, 171, 174, 180, 189, 190, 191, 192, 215, 217, 218 and 226 by 
chymotrypsin numbering, relative to a particular recited serine protease scaffold; 2) measuring 
the cleavage activity and/or substrate specificity of at least two members in the library for a 
substrate sequence in a target protein involved in a disease or pathology; 3) identifying those 
protease muteins in the library that have increased cleavage activity and/or altered substrate 
specificity relative to a wild-type scaffold protease; and 4) testing the identified proteases for 
cleavage and inactivation of an activity of the target protein that contains the substrate sequence 
to identify a protease mutein or a biologically active portion thereof that inactivates an activity 
of a target protein that is involved with the disease or pathology. Li particular, the cited 
references, singly or in combination, do not teach or suggest a method that includes a step of 
producing a library of protease muteins containing mutations at the recited specificity 
determinants, measuring and identifying protease muteins that have increased cleavage activity 
and/or substrate specificity for a substrate sequence in a target protein or a target protein 
involved in a disease or pathology, or a step of testing identified protease muteins for 
inactivation of an activity of the target protein. Since prima facie obviousness requires that the 
references, singly or in any combination, teach or suggest all elements as claimed, the Examiner 
has failed to set forth a case of prima facie obviousness. 
Rebuttal to Examiner's Comments 

1) In reply to Applicant's argument in the previous response that Harris et al teaches that 
granzyme B activates caspase, and thus does not teach or suggest a method including a step of 
testing for inactivation of an activity of the target protein, the Examiner points towards the 
portion of claim 1 that recites "regulates" apoptosis and states that this encompasses 
"activation." The Examiner continues, however, to state that the limitation in claim 1 of the 
target protein is in the preamble and is not given any weight. 

First, Applicant's argument that Harris et al teaches that granzyme B activates caspase 
goes to the limitation in the claims of step e) "testing the identified protease(s) or biologically 
active portion thereof for cleavage and inactivation of an activity of the target protein that 
contains the substrate sequence, thereby identifying a protease mutein or a biologically active 
portion thereof that inactivates an activity of a target protein that is involved with the disease 
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or pathology." Harris et al I, which only is directed to a study of the function of granzyme B 
to activate caspase, does not teach or suggest any method for screening prtoeases, including 
any method that includes a step of testing or identifying a protease that inactivates an activity 
of a target protein. There is no teaching or suggestion in Harris et al I for preparing a library 
that contains mutant granzyme B enzymes, nor that granzyme B could ever be engineered to 
inactivate an activity of any target protein. 

For this reason, the Examiner's reference to the portion of claim 1 that recites 
"regulates apoptosis" is misplaced. This portion of the claim is a limitation on the particular 
target protein that is used in the method. For example, a caspase is a target protein that 
regulates apoptosis and could be used in the instant method. The limitation on what the 
target protein is, is not relevant to the endpoint of the method of testing and identifying a 
protease mutein for inactivation and activity of that target protein. 

Further, as noted above, the claims are amended herein to render it clear that the 
limitation on the target protein is not part of the preamble, but are explicit limitations of the 
claimed method. Applicant respectfully requests that the Examiner consider and give all 
recited limitations weight. 

2) The Examiner refers to the Examples to state that the findings of the instant 
application are the same as Harris et al , since the instant application also shows that an 
exemplified protease mutein activates caspase-3. For example, the Examiner cites page 53, 
lines 18-20 of the instant application, which states: "at low concentrations the mutant 
activates caspase-3 by cleaving at the activation sequence (SEQ ID NO:4). 



It respectfully is submitted that the Examiner must review the results in the 
Example as a whole. Picking and choosing only portions of the Example is not 
representative of the results depicted in the Example. In this case, a complete review of the 
results in Example 1 1 exemplifies that the exemplified protease granzyme B mutein 
I99A/N2 1 8 A inactivates the activity of caspase by cleavage at an identified inactivation 
sequence at residues 260-265 (SEQ ID NO:2) (see e.g. at page 52, lines 6-7). In contrast to 
cleavage of the activation sequence that is set forth in SEQ ID .NO:4, which is the normal 
sequence cleaved by wild-type granzyme B to activate caspase, cleavage of this inactivation 
sequence in caspase-3 set forth in SEQ ID NO:2 inactivates an activity of caspase-3. Thus, 
the Example compares the activity of wildtype granzyme B, (which normally cleaves that 
activation sequence in caspase-3 set forth in SEQ ID NO:4) and the mutein I99A/N218A for 
inactivation cleavage and inactivation of an activity of caspase-3. 
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The Example states that the mutant granzyme B I99A/N2 1 8 A is able to cleave 
caspase-3 at the inactivation sequence as represented by a shift in cleavage product of the 
appropriate size for the cleaved peptide, while "wild-type granzyme B does not cleave the 
peptide" (see e.g. at page 52, lines 16-27). The Example further describes that the cleavage 
of the inactivation sequence results in cleavage of the small subunit of caspase-3 (see e.g. at 
page 52, line 28 to page 53, line 2). Further, the Example describes that the mutant can 
cleave and inactivate an activity of full-length caspase-3 as assessed by assaying the activity 
of caspase-3 for its substrate Ac-DEVD-AMC (see e.g. at page 53, lines 3-7). The results in 
the Example show that the mutant granzyme B dramatically inactivates the activity of 
caspase-3 with a Vmax in the presence of the mutant granzyme B of approximately zero (see 
e,g. at page 53, lines 7-13). The Example further describes that the mutant granzyme B also 
inhibits an activity of caspase-3 as assessed in assays assessing apoptosis. While the 
Example shows that at low concentrates the mutant activates caspase-3 by cleaving the 
activation sequence, the result states that "at high concentrations it inhibits caspase-3 by 
cleaving at the inactivation sequence" (see e.g, at page 53, lines 14-24). Finally, the 
Example demonstrates that the mutant granzyme B can effectively antagonize caspase- 
induced activation by wildtype granzyme B (see e.g, at page 53, line 25 to page 54, line 3). 
For example, the Example states that "as shown in Figure 7B, the mutant granzyme B 
antagonized the effect of wildtype granzyme B to induce apoptosis by inactivating the 
caspase-3." 

Thus, the Examples clearly demonstrates practice of the method as claimed by 
generating a protease mutant; measuring the cleavage activity and/or substrate specificity for 
a substrate sequence in a target protein (here, the inactivation cleavage sequence in caspase-3 
as set forth in SEQ ID NO:2) and identifying a mutant that has increased cleavage or 
substrate specificity therefor; and testing the identified rriutant for inactivation of an activity 
of the target protein (here, the activity of caspase-3 to cleave is substrate Ac-DEVD-AMC or 
the activity of caspase-3 to induce apoptosis). This is in direct contrast to the findings in 
Harris et al. 

VIIL THE REJECTION OF CLAIMS 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63, 
AND 65-78 FOR NONSTATUTORY DOUBLE PATENTING 

Claims 1-7, 9, 11-16, 45, 48, 51-53, 56-59, 61-63, and 65-78 are provisionally 
rejected on the ground of nonstatutory obviousness-type double patenting as being 
unpatentable over claims 1-10 of copending Application No. 12/005,949 ('949) application. 
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As stated in the response mailed July 20, 2010 and previous responses. Applicant 
requests deferral of resolution of this issue. It is not possible to assess whether claims at 
allowance in each application will overlap requiring a terminal disclaimer until there is an 
indication of allowable subject matter in at least one application. It is premature to file a 
terminal disclaimer at this time. If, when one or both applications is deemed allowed, it is 
determined that a terminal disclaimer is necessary. Applicant will file a terminal disclaimer. 



Consideration of the above remarks, entry of this amendment and continued 
examination of the application on the merits respectfully are requested. 

Respectfully suymitted. 
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Apoptosis (programmed cell death) serves an impor- 
tant role in the normal morphogenesis , immunoregu- 
lation, and homeostatic mechanisms in both normal 
and neoplastic ceUs. Caspase-3/CPP32 , a member of 
the ICE/Ced-3*£Eunily of cysteine proteases, is an im- 
portant downstream mediator of several complex 
proteolytic cascades that result in apoptosis in both 
hematopoietic and nonhematopoietic cells. Previous 
studies have demonstrated that caspase'3 is com- 
monly expressed in classical Hodgkin's disease 
CCHD); however, the biological significance of its ex- 
pression in Hodgkin's disease is unknown. In this 
report, the expression of ca$pase-3 in nodular lym- 
phocyte predominance Hodgkin's disease (NLPHD) 
was evaluated by immunohistochemistry; in addition, 
we investigated the role of caspase-3 in CD95 (Fas)- 
mediated apoptosis in three CHD cell lines. Formalin- 
fixed, paraffin-embedded tissue sections from 11 
cases of NLPHD were immunostained for caspase-3 
using a polyclonal rabbit antibody that detects both 
the 32-kd zymogen and the 20-kd active subunit of the 
caspase-3 protease. Only 1/11 cases of NLPHD dem- 
onstrated caspase-3 immunoposltlvity in lymphocyt- 
ic/histiocytic cells. Caspase-3 expression was also 
evaluated in three CHD cell lines, HS445, L428, and 
KMH2. Whereas caspase-3 expression was detected in 
HS445 and L428 cell lines, no expression was found 
in KMH2 cells by immunohistochemlcal staining. 
Treatment of HS445 and L428 ceU lines for 72 hours 
with agonistic CD95 monoclonal antibody induced 
marked apoptosis that was significandy inhibited by 



pretreatment with the caspase-3 inhibitor, DEVD- 
FMK, as determined by terminal deoxynucleotidyl 
transferase-mediated dUTP nick end-labeling assay 
and flow cytometric analysis of 7-amino-actinomycin 
D staining. In addition, a significant increase in 
caspase-3 activity as determined by an enzyme color- 
imetric assay was detected in HS445 and L428 ceUs 
after 48 hours of CD95 stimulation. In marked con- 
trast, treatment of caspase-3-deficient KMH2 cells 
with anti-CD95 mAb did not demonstrate an increase 
in caspase-3 activity or induce apoptosis. These data 
demonstrate caspase-3 is important for CD95-niedi- 
ated apoptosis in CHD cell lines. In addition, the 
majority of NLPHD cases examined in this study failed 
to express detectable levels of caspase-3 , suggesting 
these tumor cells may be resistant to apoptotic stimuli 
dependent on caspase-3 activity. Furthermore, these 
data suggest the differential expression of caspase-3 
noted between NLPHD and CHD may provide addi- 
tional evidence that each is a unique disease entity. 
(Am J Pathol 1999, 154:1439-1447) 

increased understanding of tlie pliysiological and patho- 
logical processes of programmed cell death, or apopto- 
sis, at the molecular level will provide insights into carci- 
nogenesis and potentially create new opportunities for 
development of novel prognostic markers and therapeu- 
tic tools for the treatment of various neoplasms. One of 
the earliest cell death-regulating genes to be identified 
was the proto-oncogene Bcl-2. an apoptosis inhibitor that 
appears to block a step in an evolutionariiy conserved 
pathway involved in apoptosis. Subsequent investiga- 
tions led to the isolation of a homologue of Bcl-2 in the 
nematode Caenorhabitis elegans. This homologue, called 
Ced-9, is necessary for the survival of all cells in this 
organism.^ Ced-9 opposes the actions of two cell death- 
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promoting genes, Ced-3 and Ced-4, which are critical for 
apoptosis in C. efegans^ The gene product of Ced-3 
demonstrates homology to the mammalian interleukin- 
1 j3-converting enzymes (ICE), a group of cysteine pro- 
teases.^ Ced-4 Is thought to be homologous to Apaf-1, a 
mammalian protein that can associate with several death 
proteases to promote apoptosis.® 

To date at least 13 members of the ICE/Ced-3 family 
(caspases) have been identified, the majority of which, on 
activation, are involved In the induction and execution 
phases of apoptosis/*® Of these cysteine proteases, 
caspase-3 (CPP32. Yama. apopain) is believed to be one 
of the most commonly involved in the execution of apo- 
ptosis in various cell types/ On cleavage by other 
caspases, caspase-3 gives rise to two active subunits 
with molecular masses of 17-20 kd and 10-12 kd/-® 
These subunits assemble to form an enzymatically active 
tetrameric complex.^ Activation of caspase-3 has been 
described in a number of cell types undergoing apopto- 
sis induced by a variety of stimuli, including CD95 (Fas/ 
Apo-1} signaling. ''^•''^ 

CD95, a cell surface protein receptor belonging to the 
tumor necrosis factor (TNF)/nerve growth factor receptor 
family, is an important molecule in the induction of apo- 
ptosis In both hematopoietic and nonhematopoietic 
cells.®"^^ Mutations in the gene that codes for CD95 have 
been linked to the development of autoimmune disease 
and lymphoproliferatlve disorders In both humans and 
animal models.^®"''® Previous studies demonstrated that 
crossllnking of the CD95 receptor on the cell surface by 
agonistic antibody or by its ligand, CD95L, induced ap- 
optosis that was dependent on caspase activation. ''^"^ 
Furthermore, the inhibition of CD95-mediated apoptosis 
by blocking proteolysis of caspase-3 by viral proteins is 
suggested to play a role In the pathogenesis of various 
neoplasms/°*''"''^^'''^ 

The rote of caspases, including caspase-3. applied to 
apoptotic processes in Hodgkin's disease is currently 
undefined. In this report, we demonstrate caspase-3 
plays an important role in CD95-mediated apoptosis in 
classical Hodgkin's disease (CHD) cell lines. Further- 
more, we demonstrate that nodular lymphocyte predom- 
inance Hodgkin's disease (NLPHD) lacks caspase-3 ex- 
pression by Immunophenotypic analysis. The lack of 
caspase-3 expression in NLPHD may contribute to the 
development and pathogenesis of this disease by imbu- 
ing tumor cells with resistance to caspase-3-dependent 
apoptotic pathways. 



Materials and Methods 

Case Selection, Histological Examination, and 
Immunohistochemistry of NLPHD 

Formalin-fixed, paraffin-embedded tissue sections from 
11 cases of NLPHD were selected from the surgical 
pathology files of Loyola University Medical Center and 
the University of Michigan Medical School for immuno- 
hlstochemlcal determination of caspase-3. Diagnosis of 
NLPHD was performed using established criteria on 



Table 1. Antibodies Used in Immunohistochemtcal Staining 
of NLPHD 



Antibody 


Source 


Dilution 


LCA (PD7/26/16 


DAKO (Carpinteria, CA) 


1:50 


and 2B11) 






CD30 (Ber-H2) 


DAKO 


1:40 


CD20 (L26) 


DAKO 


1:100 


EMA (E29) 


DAKO 


1:100 


CD 15 (Leu-Mi) 


Becton Dickinson, (San 


1:50 




Jose. CA) 




CD45RO (A6) 


Zymed Laboratories (San 


1:50 




Francisco, CA) 





lymph node biopsy histology and immunohistochemis- 
try 24.25 NLPHD was diagnosed by the finding of typical 
nodular architecture and lymphocytic/histiocytic (L&H) 
cells with the appropriate CD20- and CD45RB-positlve 
immunophenotype. 

Morphology assessment of NLPHD cases was per- 
formed on 4-;jLm tissue sections with hematoxylin-eosin. 
Immunoperoxidase staining of lymph node sections with 
the antibodies listed in Table 1 was performed using a 
Ventana 320 automated stainer (Ventana Medical Sys- 
tems. Tucson, A2) and a streptavidln/horseradish perox- 
idase detection kit (Ventana), with microwave antigen 
retrieval and trypsin pretreatment used as necessary. 
The chromogen was 3,3'-diaminobenzidine tetrahydro- 
chloride (DAB). 



Cell Lines 

The CHD cell lines KMH2, L428. and HS445 were used in 
this study. KMH2 and L428 cell lines were obtained from 
the German Collection of Microorganisms and Cell Cul- 
tures (Braunschweig, Germany). HS445 and Jurkat cell 
lines were obtained from the American Type Culture Col- 
lection (Manassas, VA), Cell lines were cultured in RPMl 
1640 (Gibco-BRL. Grand Island, NY) supplemented with 
20% (v/v) heat-Inactivated fetal bovine serum (Sigma 
Chemical Co.. St. Louis, MO), 2 mmoI/L L-glutamine 
(GIbco-BRL), 25 mmol/L Hepes (Sigma), and antlbiotic- 
antimycotlc solution (Sigma). All cell lines were main- 
tained at 37°C in a humidified incubator at 5% COg. 



Immunohistochemical Analysis for Caspase-3 
Expression 

Four-micron-thIck formalin-fixed, paraffin-embedded tis- 
sue sections from each case of NLPHD were deparaf- 
finlzed in xylene, hydrated in graded alcohol, and pre- 
treated for antigen retrieval in 10 mmol/L citrate buffer, 
pH 6.0. for 10 minutes. Cytospins from CHD cell lines 
were fixed in a 1 :1 mixture of acetone and methanol for 10 
minutes. Staining was performed using polyclonal rabbit 
anti-human CPP32 (1:200 titer. DAKO Corp.. Carpinteria, 
CA) and a Vectastain ABC peroxidase, rabbit IgG detec- 
tion kit (Vector Laboratories. Burlingame, CA) with 
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3-amino 9-ethyl carbazole (AEC) as the chromogen. The 
chromogen DAB was used for paraffin-embedded spec- 
imens. Formalin-fixed, paraffin-embedded tissue sec- 
tions from three cases of caspase-3-posltive nodular 
sclerosis Hodgkin's disease and a reactive tonsil were 
used as positive controls for caspase-3 staining. 



Apoptosis Induction and Detection 

For apoptosis assays, 1 x 10^ cells from each cell line 
were cultured in 24-well tissue culture plates (Falcon. 
Lincoln Park, NJ) and incubated with 500 ng/ml of ago- 
nistic anti-CD95 monoclonal antibody (mAb) (clone 
CH11. mouse IgM, Upstate Biotechnology. Lake Placid, 
NY) for indicated time periods, with or without 1 hour 
preincubation with 10 /xmol/L caspase-3 peptide inhibitor 
Ac-Asp-Glu-Val-Asp-fluoromethyl ketone (DEVD-FMK, 
Clontech, Palo Alto, CA). 

Detection of apoptosis in CHD cell lines by terminal 
deoxynucleotidyl tranferase-mediated dUTP nick end-la- 
beling (TUNEL) was quantitated using the ApopTag in 
situ apoptosis peroxidase detection kit (Oncor, Gaithers- 
burg, MD). Cytospin preparations of cells were fixed in 
1% formaldehyde for 15 minutes followed by 1 hour fix- 
ation in 70% ethanol at -20'*C. After a brief wash In FA 
buffer (DIfco Laboratories, Detroit, Ml), each slide was 
incubated at room temperature (RT) for 10 minutes with 
equilibration buffer followed by 1 hour incubation at 37''C 
with TdT enzyme (or deionized water (dHsO) for negative 
controls) diluted with the reaction buffer. The TdT reac- 
tion was stopped with stop/wash buffer and each spec- 
imen was briefly washed with FA buffer before 30 minute 
incubation with anti-digoxigenin-peroxldase at RT. After a 
series of washes with FA buffer, each slide was devel- 
oped with DAB/hydrogen peroxide (Sigma) color sub- 
strate for 6 minutes at RT. All slides were counterstained 
with hematoxylin. A CD95-sensitive Jurkat T cell line was 
used as a positive control for apoptosis. A positive reac- 
tion for apoptosis was characterized by brown/black col- 
oration of the nuclear or perinuclear region of the cell. 
Apoptotic cells were quantitated by 1000-cell count at 
400 X magnification. 

The 7-Amino Actinomycin D (7-AAD) staining method 
to measure cell viability was performed per manufactur- 
er's protocol using Via-Probe 7-AAD (PharMingen, San 
Diego, CA). Briefly, anti-CD95 mAb-treated and un- 
treated cells (1x10® cell/ml) were washed twice in cold 
PBS and resuspended in lx binding buffer (10 mmol/L 
Hepes/NaOH (pH 7.4), 140 mmol/L NaCL, and 2.5 
mmol/L CaCy. Resuspended cells were then incubated 
for 20 minutes at 20-25°C in the dark with 5 /ml of 7-AAD. 
Samples (30,000 events per sample) were then quanti- 
tated on an Epics XL-MCL flow cytometer (Coulter, Miami 
Lakes, FL), recorded in LIST mode, and registered on 
logarithmic scales. 7-AAD emission was detected in the 
FL-3 channel (>650 nm). Analysis was performed using 
Coulter System II software. 



Determination of Caspase-S Activity in 
Cell Lines 

Caspase-3 activity was determined using the ApoAlert 
CPP32/Caspase-3 colorimetrlc assay kit (Clontech). After 
a 48-hour Incubation with anti-CD95 mAb, duplicate sam- 
ples of untreated and treated cells (2 x 10® cells) were 
washed in cold PBS, resuspended in 50 /x! cell lysis 
buffer, and incubated on ice for 10 minutes. Cell lysates 
were pelleted, followed by transfer of the supernatants to 
microcentrifuge tubes. Fifty microliters of 2x reaction 
buffer with 5 mmol/L DTT and 5 m' of 1 mmol/L DEVD-p- 
nitroanillde (pNA)-conjugated CPP32 substrate were 
added to each tube, followed by 1 hour incubation in a 
water bath at 37°C. A control reaction of treated cells 
without DEVD-pNA was included. Optical density (OD) 
for each specimen was determined at 405 nm using 
the EL 312e microplate reader (Bio-Tek Instruments. 
WinooskI, VT). For quantification of protease activity, sam- 
ple values were plotted on a calibration curve derived from 
the OD values obtained from each of five standards (range: 
0-20 nmole pNA). For each sample, units of CPP32 activity 
were determined by the following formula: 

Units of CPP32 activity = (AOD) 

X (calib. Curve slope) ~^ x (nmole pNA/OD) 

where AOD is the change in optical density from the 
control reaction without conjugated substrate. 



Results 

Histology and Immunohistochemical 
Characterization of NLPHD Cases 

All 11 specimens demonstrated architectural changes 
consistent with NLPHD and were composed predomi- 
nantly of large nodules with focal areas of diffuse efface- 
ment. Typical nodules contained characteristic L&H cells 
intermixed in a background of small lymphocytes and 
occasional epithelioid histiocytes separated by com- 
pressed intervening paracortical areas composed of 
small lymphocytes and scattered plasma cells. 

In all specimens, L&H cells demonstrated positive 
staining for CD20 and LCA. In seven specimens, L&H 
cells expressed EMA, with one case also expressing 
CD30. In all cases, L&H cells were negative for CD15 and 
CD45RO. 



Most NLPHD Cases Failed to Express 
Caspase-3 by Immunohistochemistry 

In 10 of 11 cases of NLPHD, including the case which 
expressed CD30, L&H cells were negative for caspase-3 
expression by immunohistochemical staining as repre- 
sented in Figure 1 A. In one case, caspase-3 immunopos- 
itivity was detected in the cytoplasm in a few scattered 
L&H cells (Figure IB). In contrast, caspase-3 expression 
was demonstrated both in Hodgkin Reed-Sternberg 
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Figure 1 . lliis L&H cell Carrow), as seen in the majority of NLPHD cases, was immunohistochemically negative for caspase-3, whereas scattered plasma cells and 
l^phocytes expressed ca$pase-3 (A, DAB XIOCX)). A single L&H cell (arrow) from one case of NLPHD displayed cytoplasmic expression of caspase-3 (B, DAB 
X 1000). Control cases of nodular sclerosis Hodgkin's disease demonstrated diffuse caspase-3-immunoposilivity of HRS cells and intense F>osiiive immunosiaining 
of lymphocytes and plasma cells within tlie surrounding infiltrate (C, DAB X400). Reactive follicular centers in tonsil controls also displayed intense positive 
staining for caspase-3 (D, DAB, X200). 



(HRS) cells and in background lymphocytes in three 
cases of nodular sclerosis Hodgkin's disease (Figure 
1C). In addition, tonsil tissue positive controls demon- 
strated caspase-3 immunopositivity concentrated pre- 
dominantly In germinal center cells of secondary follicles 
(Figure 1D). 

Caspase-S Was Detectable in CHD Cell Lines 

Three CHD cell lines (HS445. L428. and KMH2) were 
analyzed for caspase-3 expression by immunohisto- 
chemistry. HS445 and L428 consistently demonstrated 
substantial cytoplasmic immunostalning for caspase-3 
(Figure 2 and data not shown). However, in contrast. 



repeated immunohistochemistry. assays failed to detect 
expression of caspase-3 in the KMH2 cell line (Figure 2). 

Caspase-3 Was Proteolytically Cleaved and 
Activated during CD95'Mediated Apoptosis in 
Caspase-3-Positive CHD Cell Lines 

Activation of the CD95 receptor by ligand or agonistic 
mAb is known to induce apoptosis with concomitant pro- 
teolytic cleavage and activation of caspases, including 
caspase-3, in CD95-positive neoplasms/®"^^ To investi- 
gate the effect of CD95 stimulation with potential activa- 
tion of caspase-3 in Hodgkin's disease, we examined the 
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Figure 2. Immunohisicxrhemical detection of caspase-3 on cytospin preparations of the L428 cell line (A) displayed strong cytoplasmic positive staining for 
caspase-3 (AEC, X400); however, the KMH2 cell line (B) failed to express caspase-3 (AEC, X400). Isotype control antibody staining was negative (data not 
shown). 



effect of agonistic CD95 mAb on CHD cell lines. The 
HS445 and L428 cell lines displayed a significant in- 
crease In apoptosis after 72 hours* treatnnent with anti- 
CD95 mAb as quantitated by both flow cytometric anal- 
ysis with 7-AAD (Figure 3) staining and the TUNEL assay 
(Figure 4, Table 2). CD95-lnduced apoptosis was signif- 
icantly inhibited in these cells by the caspase-3 peptide 
inhibitor. DEVD-FMK, as demonstrated by the decrease 
in number of apoptotic cells to nearly background levels 
(Figures 3 and 4, Table 2). In contrast, the KMH2 cell line 
demonstrated consistent resistance to anti-CD95 mAb 
treatment with no effect by the addition of DEVD-FMK 
(Figure 3 and Table 2). Resistance of KMH2 cells to 
CD95-mediated apoptosis was not due to lack of CD95 
expression, as all three cell lines expressed similarly 
high levels of CD95 as determined by flow cytometric 
staining.^® 

The significant inhibition of apoptosis by a caspase-3 
inhibitor In HS445 and L428 cells, and the lack of apo- 
ptosis Induced by caspase-3-deficient KMH2 cells sug- 
gests caspase-3 is important for CD95-medlated apopto- 
sis in CHD cell lines. To further substantiate caspase-3 
cleavage and activation in CD95-mediated apoptosis in 
CHD, each cell line was evaluated for changes In 
caspase-3 activity before and after treatment with anti- 
CD95 mAb using an enzyme colorimetric assay (Figure 
5). Forty-eight-hour treatment revealed approximately 
tenfold increases in caspase-3 activity in HS445 and 
L428 cells in contrast to no difference detected In treated 
KMH2 cells. Positive control CD95-sensitive Jurkat T cells 
displayed a fivefold increase in caspase-3 activity after 
24 hours of treatment with anti-CD95 mAb. 



Discussion 

Among the caspases identified in humans thus far, 
caspase-3 is probably one of the most relevant and best 
studied as regards to apoptosis in hematopoietic cells. 
Caspase-3 (CPP32, Yama. apopain) has been shown to 
be a key effector molecule in the downstream execution 
of various apoptotic stimuli.®""' Activated 
caspase-3 cleaves and inactivates many vital cellular 
proteins during apoptosis including kinases and proteins 
associated with cellular structure, cell cycle, and DNA 
repair. One such well characterized caspase-3 death 
substrate is poly(ADP-ribose) polymerase (PARP), an en- 
zyme involved in DNA repair, genome surveillance, and 
integrity.^^*^^ In addition, caspase-3 appears to indirectly 
activate endonucleases implicated in internucleosomal 
DNA cleavage by removing the negative regulatory effect 
of PARP.3^ 

The cleavage and activation of caspase-3 during ap- 
optosis has been well documented in neoplastic cells. 
Caspase-3 activation and subsequent cleavage of its 
substrates, protein kinase C-6 (PKC-S) and PARP, was 
demonstrated by chemotherapeutic drug treatment in 
human leukemic cell lines.^^ Cross-resistance to CD95- 
and chemotherapeutic drug-induced apoptosis due to 
lack of caspase activation including caspase-3 was dem- 
onstrated in a human acute T-cell leukemia line, CEM.^"' 
Also, MCF breast carcinoma cells lacking expression of 
caspase-3 were resistant to apoptotic stimuli.^^ Thus, the 
expression and activation of caspase-3 appears to be 
critical for the execution of various apoptotic stimuli in 
neoplasms. 
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Figure 3. Flow cytometric analysis of 7-AAD staining in anti-CD95 (Fas)-treated CHD cell lines. Increased cell death was observed after 72 hours of treatment with 
500 ng/ml agonistic CD95 mAb (CHll) in HS443 and L428 cell lines compared to untreated control cells. Pretreaimeni of cells with caspase-3 F>eplicle inhibitor, 
DEVD-FMK, significantly decreased cell death in ami-CD95-treated cells to near background levels. In contrast, no significant increase in cell death was observed 
after anti-CD95 treatment of KMH2 cells as compared to untreated cells. The xaxis represents fluorescence intensity (log scale) and the axis represents relative 
cell number. These data are representative of at least three separate experiments performed. 



Hodgkin's disease accounts for 14% of malignant lym- 
phomas. Currently, one-third of advanced Hodgkin's dis- 
ease patients are resistant to conventional therapies.^® 
Our knowledge of the expression and function of apop- 
tosis-related proteins such as caspases and how they 
may contribute to the pathogenesis and treatment of this 
malignancy is limited. Previous immunohistochemical 
studies in situ demonstrated that caspase-3 is commonly 
expressed in CHD.^^ However, the examination of 
caspase-3 expression in NLPHD has been limited. Fur- 
thermore, the overall biological significance of caspase-3 
in Hodgkin's disease is unknown. Therefore, in this study 
we examined the expression of caspase-3 in NLPHD and 
determined its functional significance in CHD cell lines. 

We first examined the in situ expression of caspase-3 
in NLPHD. By immunohistochemistry. we identi- 
fied caspase-3 immunoposltivity in scattered L&H cells 
from only 1 of 11 cases of NLPHD. These findings are in 



agreement with the study of Chhanabhai and colleagues, 
who found no expression of caspase-3 in L&H cells from 
6 cases of NLPHD.^^ In addition, these authors observed 
the HRS in the majority of cases of CHD were positive for 
caspase-3 expression.^'' These latter observations differ 
from the immunohistochemical findings of Xerri et a! in 
which only 3 of 16 cases of HRS of CHD (nodular scle- 
rosis and mixed cellularity type) were caspase-3-immu- 
nopositive.^® The reason for the difference in caspase-3 
expression in CHD noted between these groups is pres- 
ently unclear. 

Our Immunohistochemical analysis of HRS cells of three 
CHD cell lines revealed substantial expression of 
caspase-3 in HS445 and L428. but only weak expression in 
KMH2 cells. These findings concur with Western blot anal- 
ysis of these cell lines, which revealed expression of the 
32-kd zymogen form of caspase-3 in HS445 and L428, but 
virtually undetectable expression in KMH2 cells.^ 
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Anti-Fas + DEVD -FMK 




Figure 4. 'ITJNEL assay for apoptosLs. Compared to untreated cells (A), the L428 cell line displayed a considerable increase in apoptosis (dark brown/black cells) 
after 72 hour incubation wiih 500 ng/ml agonistic CD95 (Fas) mAb (B), Pretreatmeni of L428 cells with 10 pM DEVD-FMK decreased the number of apoptotic 
cells to near baseline levels (C). Slimulalion of KMH2 cells with anii-CD95 mAb with or without DEVD-FMK pretreatment showed no increase in apoptosis 
compared to untreated cells (data not shown). 



To address the biological significance of caspase-3 in 
Hodgkin's disease, we investigated the role of caspase-3 
in CD95-mediated apoptosis in CHD fines. After stimula- 
tion of the CD95 receptor by agonistic CD95 nnAb. sig- 
nificant apoptosis was induced in caspase-3 positive cell 
lines HS445 and L428 by TUNEL and 7-AAD assays. 
However, KMH2 cells, which virtually failed to express 
caspase-3 by innmunohistochennistry, were consistently 



Table 2. Apoptosis Rates induced in Anti-CD95 niAb 
Treated or Untreated CHD Cell Lines as 
Determined by TUNEL Assay 



CHD cell line 




Apoptosis* 




Untreated 


Anti-CD95 


Anti-CD95+ 
DEVD-FMK 


HS445 


7.5% 


20.5% 


9.9% 


L428 


7.8% 


30.2% 


10.8% 


KMH2 


1.0% 


1 .3% 


1.0% 



*1 X 10® cells were untreated or treated with 500 ng/ml anti-CD95 
mAb (CH11) for 72 hours with or without pretreatment with 10 /iM 
caspase-3 peptide inhibitor, DEVD-FMK. Apoptosis was quantitated by 
TUNEL staining as described in l\1aterials and Methods. 
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Figure 5. Caspa.se-3 activity in anti-CD93-treated CHD cell lines. Posiiive 
control for caspa.se-3 activity is demonstrated in Jurkat T cells by fivefold 
induction of caspase-3 aaivity after 24 hours of agonistic CD95 niAb treat- 
ment. HS445 and L428 cell lines had approximately ten-fold increases in 
ca.spa.se-3 activity after 48 hours' irealmeni with anii-CD95 mAb compared to 
untreated controls. In contrast, no significanl increase in caspase-3 activity 
was detected in antl-CD95-treated KMH2 cells when compared to untreated 
cells- These data are representative of tluee separate experiments. 



resistant to CD95 stimulation, suggesting that resistance 
to CD95-nriediated apoptosis in this cell line nnay be due to 
a deficiency of caspase-3^® (nnanuscript in preparation). 

To establish caspase-3 as a key mediator in CD95- 
induced apoptosis in CHD cell lines, enzyme assays 
specific for caspase-3 activity were performed. Approx- 
imately tenfold increases in caspase-3 activity were ob- 
served in HS445 and L428 after 48 hours' incubation with 
anti-CD95 mAb. compared to no increase in KMH2 cells. 
In addition, we pretreated each cell line with the 
caspase-3 peptide inhibitor DEVD-FMK before CD95 ac- 
tivation. Previous studies in other experimental systems 
have demonstrated that DEVD inhibitors have specificity 
for caspase-3 by bearing similarities to the cleavage site 
of the caspase-3 substrate, parp.^-^^'2^-3"''39-^i The ad- 
dition of DEVD-FMK to cultures of HS445 and L428 sig- 
nificantly decreased CD95-mediated apoptosis; how- 
ever, there was no effect on KMH2 cells. These findings 
in CHD lines correlate with previous studies which dem- 
onstrated caspase-3 is proteolytically cleaved and acti- 
vated and plays a key role in CD95-mediated apoptosis 
in other experimental systems.®"''^'^^'^''*^ However, it 
should be noted that CD95-induced apoptosis may occur 
without activation of caspase-3, suggesting the existence 
of alternate apoptosis execution pathways in response to 
CD95 signaling.'*^ 

Most investigations related to apoptosis in Hodgkin's 
disease have focused on the expression of mitochondrial 
apoptosis regulatory proteins Bcl-2, Bcl-x, and Bax.^^""*^ 
These studies revealed variable expression of 801-2^"*"^^ 
but frequent expression of the pro-apoptotic protein 
Bax'*'' and the apoptosis antagonist BcI-Xl.'*®"''® Previous 
investigations of CD95 expression by HRS cells have 
been limited; however, these studies revealed that CD95 
is expressed on HRS cells in the majority of cases of 
CHD.^°-^ In this report, we demonstrate that CHD cell 
lines expressing CD95 can undergo apoptosis by CD95 
stimulation. A recent study assayed CD95-lnduced apo- 
ptosis in fresh tissue samples with Hodgkin's disease: 
however, the HRS cells were not specifically analyzed.^^ 
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Because HRS cells usually constitute less than 1% of 
involved tissue, it is difficult to assay CD95 stimulation of 
HRS ceils directly without selective separation. 

The absence of caspase-3 expression In L&H cells is 
similar to that seen in several Indolent B-cell NHLs, most 
notably follicular center lymphoma (FCL). grade | .^a.s^ 
Recent studies have noted clinical similarities between 
NLPHD and indolent B-cell NHL.^^^ However, NLPHD 
differs from the majority of low-grade B-cell NHLs with 
respect to treatment response. Most NLPHD patients are 
cured and rarely show progressive disease, in contrast in 
the majority of FCL patients. Furthermore, although bcl-2 
is commonly overexpressed in FCLs, NLPHD typically 
lacks expression of this protein.^'*''*®'®® The combined 
high and low expression of bcl-2 and caspase-3 protein, 
respectively, in low-grade FCL suggests the incurability 
of many of these lymphomas may be directly related to 
the overexpression of anti-apoptotic proteins (eg, Bcl-2) 
combined with the lack of downstream apoptotic media- 
tors such as caspase-3. Furthermore, overexpression of 
Bcl-2 and Bc1-Xl in cell lines can also block cleavage and 
activation of caspase-3,^^'^'^®~®^ 

The lack of caspase-3 expression in NLPHD may also 
be an important mechanism of resistance to apoptosis. 
Furthermore, the differential expression of caspase-3 be- 
tween CHD and NLPHD suggests that each may be a 
distinct disease entity, and may account for some of the 
clinical differences between these two disorders. Addi- 
tional studies to define the expression and function of 
caspases and their relationship to other apoptosis-re- 
lated proteins may provide novel insights into the patho- 
genesis and treatment resistance of this malignancy. 
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ABSTRACT A factor with tumor necro^ factor (TNF) 
activity produced by tlie LuKII human lymphoblastoid cell line 
[designated TNF(LuKII)] was purified sequentially by using 
controlled-pore glass, lentil lectin-Sepharose, and procion red 
agarose chromatography, 3ie!ding TNF with a specific activity 
of 1.5 X 10^ units per mg of protein and an isoelectric point 
of -6.7. Purified TNF(LuKn) firactionated by NaDod- 
$04/PAGE under reducing as well as noiiredudng conditions 
was found to contain seven protein bands of A#r 80,000, 70,000, 
43,000^ 25»000, 23»000, 21,000, and 19»000. The proteins ^AMr 
80,ci00 and 70»000 could not be dissociated Into lower mdecular 
wdght components. Peptide mapping analysis and immuno- 
blottlng analysis revealed that the seven protein bands In the 
purified TNF(LuKII) preparations are related. After fraction- 
ation of TNF(LuiCID by NaDodS04/PAGE under reducing 
conditions, TNF activity was recovered from the regions of Mr 
IQjm and 19,000-25,000. Purified human TNF(Luiai) (i) 
produces hemorrhagic necrosis of Meth A mouse sarcoma in 
the standard in vivo mouse TNF assay; (u) has the same |iattem 
of reactivity as mouse TNF (cytotoxic/cytostatic/no effect) on 
A pamel of human cancer cell lines; and (iu) has its anticeUular 
effect potentiated by interferon, also a feature of ihouse TNF. 



The presence of a tumor inhibitory factor in the sera of mice 
infected with bacillus Calmette-Gu^rin (BCG) and subse- 
quently ii\jected with endotoxin was reported by Carswell et 
aL (1). Sera from these mice cause necrosis and regression of 
certain tumors in mice and have a cytotoxic effect on tumor 
cells in vitro (1-5). By using similar methods, a factor with the 
same in vivo and in vitro properties can be induced in rats (1) 
and rabbits (1, 6. 7). The antitumor factor present in the sera 
of animals sensitized to BCG or other immuhopOtentiating 
agents, sucli as Corynebacterium parvum or Zyinosan, and 
then challenged with endotoxin has been termed tumor 
necrosis factor (TNF). Biochemical studies have indicated 
that serum TNF activity is associated with both high molec- 
lilar wei^t components (4, 8) and components in the 
range of 40,000 to 70,000 (3-5, 9, 10). 

We have recently reported that human cell lines of hem- 
atopoietic origiii have the capacity to produce a factor with 
TNF activity (11). The product of one of the lines (LuKII) 
was chosen for detailed studies and, according to the fbllow- 
ing criteria. TNF(LuKII) and mouse TNF have identical 
properties: (0 mouse L cells made resistant to mouse TNF 
are resistant to TNF(LuKIl), and L cells made resistant to 
TNF(LuKII) are resistant to mouse TNF; (ii) the anticeUular 
response of a panel of human cell lines to TNF(LuKII) or 
mouse TNF is indistinguishable and can be potentiated in a 
synergistic fashion by interferon; and (iiO .TNF(Lukll) 
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causes hemorrhagic necrosis of Meth A sarcomas in the 
standard in vivo TNF assay (11). 

In the present study, we report a sequential chromato- 
graphic procedure for the purification of TNF(LuKII) and 
describe biochemical, serological, and biological character- 
istics of purified TNF(LuKII). 

MATERIALS AND METHODS 

ProductloD of TNF(LuKID. LuKII cells (8 x ID' cells per 
ml) in RPMI 1640 medium containing 8% fetal calf serum 
were incubated with 10 ng of mezerein per ml (L. C. 
Services, Wobum, MA) for 48 hr. The cells were separated 
by centrifugation, resuspended in firesh RPMI medium lack- 
ing any protein supplement, and incubated for aii additional 
48 hr. Cells were removed by centrifugation, and the culture 
media were used as the source of TNF(LuKII). 

In Vitro TNF Assay. TNF assays were performed in 96-well 
microliter plates. Serially diluted fractions were sterilized by 
ultraviolet radiation and TNF-sensitive L cells were added to 
each well at a density of 2 x 10^ cells per well in 100 /tl. After 
incubation for 2 days at 37^C, the plates were examined 
microscopically and the percentage of dead cells was deter- 
mined. The unitage of the sample was calculated as the 
reciprocal of the highest dilution that killed 50% of the cells. 
All TNF assays were run in parallel with a laboratory 
standard and titers are expressed in laboratory units. 

in Vivo TNF Assay. The standard Meth A sarcoma assay 
was performed as described (li). 

Monoclonal Antibody to Human TNF. BALB/c mice were 
injected with 1600 units of TNF(LuKII), with a specific 
activity of 1.5 x 10^ units/nig. For the initial injection, * 
TNF(LuKII) was mixed with Freund*s complete adjuvant 
(1:1) and was injected subcutaneously. Subsequent injections 
were given intraperitonealiy in the absence of adjuvant. 
Serum antibody to TNF(LuKlI) was determined by an 
ELISA with TNF(LuKII) bound to polystyrene plates. After 
nine immunizations over a period of 7 months, the spleen of 
one mouse with a high titer of TNF(LuKII) antibody was 
removed and fused with cells of the P3U1 mouse plasma- 
cytoma cell line. Resulting clones were screened for their 
abiUty to bind TNF(LuKII) in ELISAs. A hybridoma (des- 
ignated Tl-18) producing antibody reactive with TNF- 
(LuKII) was isolated and subcloned. Media from Tl-18 
hybridoma cultures served as a soiu'ce of TNF(LuKII) 
antibody. 

Affinity Chromatography. Affinity chromatography proce- 
dures were carried out at room temperature and column 
fractions were collected into polypropylene tubes or bottles. 
The column matrices used were controlled-pore glass 350 
(Electro-Nucleonics. Fairfield, NJ). lentil lectin-Sepliarose 
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Fig. 1. {A) ControUed-pore glass column chromatography. LuKII culture medium (8 liters) containing 200 units of TNF activity per ml was 
applied to a controUed-pore glass column (SO ml) equilibrated with phosphate-buffered saline (20 mM sodium phospbate« pH 7.0/0^15 M NaO) 
(PBS). The column was washed with the following buffers in sequence: PBS (75 ml); PBS containing 20% ethylene glycol (vol/vol) (£0 (225 
ml); PBS (120 ml)/20 mM sodium phosphate, pH 7.0/1.15 M NaCI (E2) (175 nU); PBS (50 ml)/5 mM sodium phosphate, pH 6.8 (E3) (225 ml); 
and 5 mM sodium phosphate. pH 6.8/5% triethylamine (vol/vd) (E«) (ISO ml). Eluted fractions were collected in polypropylene bottles. The 
material eluted with the E4 buffer was collected in 50-ml aliquots. (B) Lentil Ifectin Sepharose column chromatbgri^y. Partially purified 
TNF(LuKII) (ISO ml) eluted from the controUed-pore glass column was loaded onto a lentil lectin>Sepharbse column (10 ml) equilibrated with 
PBS. The column was washed sequentially with PBS (40 ml), PBS/1 M NaO (Ei) (24 ml), and PBS/i M NaCl/0.2 M methyl-o-b-mannoside 
(Ej) (60 ml). The material eliited with the methyl-a-D-mannoside-containing buffer was collected in 10-n)l aliquots. (iC) Procion red agarose 
column chromatography. Partially purified TNF(LuKIl) (60 ml) eluted from the lentil lectin-column was diluted 1:1 with PBS aiid loaded onto 
a procion red agarose column (4 ml) equilibrated with 20 mM sodium phosphate. pH 6.8/0.65 M NaCl (PBS/O.S M NaCl). The column was washed 
with the foUowing buffers in sequence: PBS/0.5 M NaQ (E,) (^ nil); PBS/1 M NaCl (E2) (8 ml); PBS (8 ml); PBS/50% ethylene glycol (vol/vol) 
(E3) (8 ml); PBS (8 ml); 0.1 M Tris HCl. pH 9.4/0.1 M NaCl (E4) (8 ml); and 0.1 M Tris HCl, pH 9.4/0.1 M arginine (Ej) (24 ml). The material 
eluted with the 0.1 M Tris-HCl, pH 9.4/0.1 M arginine buffer was collected in 4-m] aliquots. 



(Pharmacia), and procion red agarose (Bethesda Research 
Laboratories). 

Protein Determinations. Protein determinations were car* 
ned out with the Bio-Rad dye reagent (Bio-Rad) using bovine 
seriim albumin as a standi. 

RadiokMUnation (tf TNF(Laiai). TNF(LuKII) was labeled 
with using l,3,4,6-tctrachloro-3a,6a-diphenylglycouril 
(iodo-Gen, Pierce). Polypropylene tubes were coated with 100 
/ig of lodo-Gen (dissolved iii chloroform) by evaporation of the 
solvent. A 2-nil sample of TNF(LuKn) (50,000 units/ml) with 
a specific activity of 1.5 x 10^ uiiits per mg of protein was 
incubated for 25 min at room temperature in an lodo-Gen- 
coated tube containing 2 mCi of ^"I (1 Ci = 37 GBq). The 
labeled protein was then separated from the unbound by 
using a P-4 colunui (Bio-Rad). equilibrated with phosphate- 
buffered saline containing 50 /ig of cytochrome c per ml. The 
iodinated material ehited in the voki vohime of the column was 
divided into aliquots and stored at —WC, 

NaDodS04/PAGE. NaDodS04/PAGE was carried out in 
18-cm slab gels according to published methods (12). 

Isoelectrbfociisiiig. Isoelectrofocusing was performed by 
using Ampholine Pagplates (pH 3.5-9.5) (LKB). The gels 
were run at 30 W for 1 .5 hr, at which time the pH gradient was 
measured and the gel was sliced into 18 equal pieces. The gel 
fractions were incubated for 18 hr in Eagle*s minimum 



essential medium (ME medium) containing 10% fetal calf 
serum and fractions were assayed for TNF in vitro. 

V^fldAt Mapping Analysis. A ^^I<labeled preparation of pu> 
rified TNF(LuKlD was fractionated by NaPodS04/PAGE and 
individual bands localized by autoradiogrsq[>hy were cut from 
the gel and treated with L-l-tosylamido-2-phenylethyl 
chloromethyl ketone-treated trypsin or TV-a-tosyllysine 
chloromethyl ketone-treated chymotrypsin. Digested fractions 
were analyzed according to the methods of Elder et at, (13). 

InununoblottUig Analysis. Immunoblotting was performed 
essentially as described (14). Briefly, preparations of purified 
TNF(LuKII) were transferred to nitrocellulose paper over- 
night at 100 mA. After incubation of the nitrocellulose paper 
in buffer containing bovine senmi albumin, the paper was 
exposed for 2 hr to 40 ml of Tl-18 antibody-containing culture 
medium. The nitrocellulose paper was then washed exten- 
sively and incubated overnight in 10 mM Tris*HC], pH 
7.4/0.9% NaCl/5% bovine senim albumin/^"I-labeled rabbit 
anti-mouse IgG. The mtrocellulose paper was further washed 
and exposed to x-ray film. 

RESULTS 

PurificatkMi inf TNK(LuKII)» ControUed-pore glass beads 
bound all TNF activity from LuKII culture fluids. After 
washing with several buffers in sequence, TNF activity was 



Table 1 . PurificaUon of TNF(LuKlI) 
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Controlled-pore glass 


1.6 X 10* 


5.3 X lO' 


9.6 X 10» 


3.8 X 10* 


60 


72 


Lentil lectin-Sepharose 


9.6 X lO' 


3.8 X 10» 


6.3 X io* 


1.3 X 10* 


39 


245 


Procion red agarose 


6.3 X 10* 


1 X 10* 


6.3 X 10* 


1.5 X 10' 


39 


2830 
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eluted with a 5 mM sodium phosphate buffer (pH 6.8) 
containing 5% triethylamine (Fig. \A). The eluted TNF was 
then applied to a lentil lectin-Sepharose column, which was 
washed first with phosphate-buffered saline and then with 
0.02 M sodium phosphate buffer (pH 6.8) containing 1.15 M 
NaCl (buffer A). THF activity was eluted from this column 
with buffer A containing 0.2 M methyl-o-D-mannoside (Fig. 
\B), All TNF activity bound to the lentil lectin-Sepharose 
column and 39% of the activity was recovered in the 
methyl-ot-D-mannoside-ccntaining buffer. (Further washing 
of the column with buffer containing 50% ethylene glycol 
chutes only a small amount of additional TNF activity.) TNF 
from the lentil lectin column was then diluted 1:1 with 
phosphate-buffered saline and loaded onto a procion red 
agarose column. The column was washed sequentially with 
several buffers that remove protein having no TNF activity. 
The column was then washed with 0.1 M Tris-HCl, pH 
9.4/0.1 M arginine. TNF activity was eluted with this buffer, 
yielding TNF with a specific activity of 1.5 x 10^ units per mg 
of protein. Table 1 summarizes the purification scheme for 
TNF(LuKII) with specific activities of the resulting fractions. 

Biochemical CharacterizatJon of Purified TNF(LuKn). 
Isoelectric focusing of purified TNF(LuKII) indicates an 
isoelectric point of «6.7 (Fig. 2). ^"l-labeled TNF(LuKII) with 
a specific activity of 1.5 x 10^ units per mg of protein was 
ansdyzed by Na£>odS04/PAGE and found to contain seven 
protein bands with Mj values of 80,000, 70,000, 43,000, 25,000, 
23,000, 21,(X)0, and 19,000 (Fig. 3). The same seven protein 
bands were observed when nonlabeled purified TNF(LuKII) 
was fractionated by NaI>odS04/PAGE and examined by silver 
staining. The proteins of 80,000 and 70,000 were eluted from 
the gels and reanalyzed by NaDodS04/PAGE. They migrated 
once again to the Mr 70,000-80,000 r^on, and no lower 
molecular weight components were observed. In further exper- 
iments, purified TNF(LuKII) was boiled in NaDodS04, urea, 
and 2-mercaptoethanol, and the same characteristic seven 
bands were found. 

To determine which bands in TNF(LuKII) showed TNF 
activity, parallel samples of purified TNF(LuKII), one ^^^I- 
labeled and one unlabeled, were treated with 0.1% 
NaDodSO4/0.1 M 2-mercaptoethanol and fractionated by 
NaDodS04/PAGE. After electrophoresis, the lane contain- 
ing the unlabeled material was cut into 4.4-mm slices and the 
proteins were eluted from each slice by overnight incubation 
at 4X in ME medium containing fetal calf serum. The parallel 
lane containing ^I-labeled TNF(LuKII) was dried immedt* 
ately after electrophoresis and protein bands were located by 
autoradiography. As seen in Fig. 4, TNF activity was recov- 
ered fr9m the gel at values of 70,000 and 19,000-25,000, 
corresponding to ^^I-labeled protein bands at these posi- 




Froction 



Fig. 2. Isoelectrofocusing of TNF(LuKll). A 60*^ sample of 
purified TNF(LuKII) containing 1500 units was applied to an 
ampholine gel (pH 3.5-9.5). 
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Fig. 3. NaDodS04/PAGE of puri- 
fied »"l-labcled TNF(LuKlI). TNF(Lu- 
KII) was iodinated and fractionated by 
NaDodS04/PAGE. Auto radiographs 
were developed for 18 hr (lane 1) and 0.5 
hr (lane 2). The following proteins pro> 
vided M, markers: myosin (M,, 200,000), 
0-galactosidase (Mr, 130,000), phospho- 
rylase b (A/r. 94,000), bovine serum albu- 
min (Afr. 67,000), ovalbumin (A/,. 
43,000), a-chymotrypsinogen (Af,, 
25,700), ^-lactoglobulin (Mr. 18,400). 
lysozyme (Mr, 14,300). and cytochrome 



tions. TNF(LuKII) samples that were not exposed to 2- 
mercaptoethanol before NaDodS04/PAGE also showed 
TNF activity at M^ values of 70.000 and 19,000-25,000. 

To examine the relationships among the various protein 
bands in purified TNF(LuKII) preparations, two-dimensional 
chymotryptic and tryptic peptide mapping analyses were per- 
formed. As seen in Fig. 5a, the chymotryptic peptide maps 
denionstrate that the proteins of Mr 43,000, 25,000, 23,000, 
21,000, and 19,000 are related and the proteins of A/r 80,000 and 
70,000 are related. To examine the relationship of the larger 
molecular weight proteins to the snialler proteins, chymotryptic 
digests of the Mr 70,000 and 25,(X)0 proteins containing equal 
amounts of radioactivity were mixed and analyzed. As seen in 
Fig. 5^, three of the fragments (termed A, B, and C) generated 
by digestion of the Mr 25,000 protein migrate to the same 
position as three fragments generated by digestion of the Mr 
70.000 protein. A similar analysis was carried out using trypsin 
as the proteolytic enzyme. The results also indicate that the 
seven distinct forms are closely related. 

Further evidence for the relationships among the various 
proteins in purified TNF(LuKlI) comes from inmiunoblotting 
analysis with Tl-18 monoclonal antibody to TNF(LuKlI). 
Fig. 6 shows that the antibody reacts with the Mr 43,(X)0 and 
the Mr 19,000-25,000 components. 

Biological Characteristics of Purified TNF(LuKII). Limulus 



M, BO 70 43 25-10 




Fraction 



Fig. 4. Recovery of TNF activity after NaDodS04/PAGE frac- 
tionation of TNF(LuKn). A sample of TNF(L.uKIl) containing 6000 
units adjusted to contain 0.1% Na£>odS04 and 0.1 M 2-niercapto- 
ethanol was applied to a 12% polyacrylamide gel. After electropho- 
resis, the gel was sliced and activity was eluted and assayed. In an 
adjacent lane. *"l>labeled TNF(LuKII) was fractionated and auto- 
radiographed to determine the molecular weight of the TNF(LuKlI) 
active fractions. 
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Fig. 5. Chymotryptic peptide mapping of *^Wa- 
beled proteins in purified TNF(LuKll). ^"l-labeled 
TNF(LuKII) was fractionated on NaDodS04/PAGE 
and individual protein bands present in gel slices were 
incubated overnight in the presence of 50 ^ig of N-a- 
tosyllysine chloroniethyl ketone-treated chymotrypsin 
per ml. The individua] gel slices were then washed with 
water and samples (10,000 cpm) of each were lyophilized 
to dryness. These samples were dissolved in a buffer 
containing formic acid and acetic acid and were applied 
to cellulose precoated glass TLC plates at the origin ( x ). 
Electrophoresis (E) was performed from right to left, 
followed by ascending chromatography in a buffer 
containing butanol, pyridine, and acetic acid. Autora- 
diographs of the chymotryptic maps of the M, 80,000, 
70,000, 43,000. 25.000. 23.000, 21.000, and 19,000 pro- 
teins are presented (a); {b) chymotryptic maps of the Af, 
25,000 and 70.000 and a mixture of the Mr 25,000 and 
70.000 proteins. 



tests of purified TNF(LuKII) indicate 25 ng of endotoxin per 
ml. TNF(LuKU) causes hemorrhagic necrosis of Meth A 
sarcoma after intratumoral or intravenous iiyection and total 
tumor regression has been observed in some treated mice. 
L-cell lines made resistant to mouse TNF or to partially 
purified TNF(LuKII) are resistant to purified TNF(LuKII). 
With the panel of human cell lines studied by Williamson and 
coworkers (11), purified TNF(LuKil) showed the same 
pattern of reactivity (cytotoxic/cytostatic/no effect) as 
mouse TNF and partially purified TNF(LuKII). In addition, 
purified TKF(LuKII) and interferon showed synergistic 
cytotoxic activity for human tumor cells, similar to what has 



previously been reported for partially purified TNF(LuKII) 
and mouse TNF (12). 

PISCUSSION 

We have recently described the production, characterization, 
and biological properties of a human factor with TNF activity 
from the LuKII lymphoblastoid ceU line (11). The further 
purification and characterization of this factor, designated 
TNF(LuKII), is the subject of this report. A protocol for the 
purification of TNF(LuKII) has been developed that yields 
both good recoveries of TNF and material with high specific 
activity. This purification protocol allows active fractions 



Immunology: Rubin et ai 

Mr X 10"^ 




Fig. 6. Immunoblottiiig analysis of TNF- 
(LuKII) with Tl-18 mouse moooclonal anti- 
body. A sample of TNFCLuKII) containing 
lO^OOO units was fractionated by NaDod- 
S04/PAGE. Fractionated proteins were trans- 
ferred to a nitrocellulose membrane and pro* 
cessed as described. 



eluted from one colunui to be applied either directly or after 
dilution onto the next column, thereby eliminating any need 
for dialysis and thus avoiding the losses associated with 
dialysis. TNF(LuKII) has a Mr of 70,000 by gel filtration 
under nonreducing conditions and an isoelectric point of 6.7. 
Examination of purified TNF(LuKII) by NaDodS04/PAGE 
under reducing as well as nonreducing conditions revealed 
the presence of seven protein bands ranging from Mr 19,000 
to 80,000. The M, 80,000 and 70,000 proteins could not be 
dissociated into smaller molecular weight components even 
after boiling in NaI>odS04/2-mercaptoethanol/urea. Frac- 
tions from NaDodS04/PAGE were assayed for TNF activity 
and proteins m the Mr 70,000 and 19,000-25,000 region had 
TNF activity. Using peptide mapping analysis, we found that 
the seven proteins present in piuified TNF(LuKII) were 
related. Immunoblotting analysis with monoclonal antibody 
to TNF(LuKII) showed shared determinants on the Mr 
43,000 and 19,000-25,000 proteins. Antibody did not react 
with the higher molecular weight forms, even though these 
have been shown to be related to the Mr 43,000 and lower 
molecular weight components. This could be due to the 
inaccessibility of the determinant on the Mr 70,000 and 80,000 
species. Thus, our analysis indicates that there are a number 
of stnicturally related proteins in purified TNF(LuKII) and 
that TNF activity is associated with nondissociable high 
molecular weight and low molecular weight forms. We 
conclude that the seven proteins in our purified TNF(LuKII) 
are either the products of related genes or products DCof a 
single gene that undergoes extensive processing. 

TNF(LuKlD has the fiiU range of biological activities 
associated with mouse TNF. It produces hemorrhagic ne- 
crosis of Meth A sarcoma in the standard TNF assay and 
cannot be distinguished from mouse TNF in its pattern of 
reactivity on a laiige panel of human cancer cell lines. In 
addition, L cells made resistant to mouse TNF are resistant 
to TNFGLuKII), and cells made resistant to TNF(LuKII) are 
resistant to mouse TNF. Recent work has indicated that there 
are surface receptors for TNF on TNF-sensitive cells (un- 
published data). Competitive binding studies showed that 
mouse TNF and TNF(LuKID compete for the same receptor. 

The relationship between TNF(LuKIl) and lymphotoxin 
(15, 16) is unclear. While they share certain properties, such 
as their ability to kill mouse L cells, their affinity for lentil 
lectin, their multiple forms, and, in some cases, their cellular 
origin, they differ in certain biochemical properties. Ag- 
garwal and coworkers (17, 18) have observed that lympho- 
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toxin from the RPMI 1788 lymphoblastoid cell line has a Mr 
value of 25,000 and 20.000 under reducing conditions, where- 
as TNF(LuKII) exists in several molecular weight forms, 
some of which do not dissociate under reducing conditions. 
Granger et al. (15) have also reported that species of 
lymphotoxin exist in higher molecular weight forms. It seems 
likely that there is a family of cytotoxic factors with TNF 
activity and that lymphotoxin and TNF(LuKII) are members 
of this family. This would be comparable to the interferon 
system in which there are three major species, all having 
antiviral activities, but each t>eing coded for by a different 
gene with varying degrees of homology. As in the interferon 
field, the cloning of factors with TNF activity will provide the 
basis for distinguishing various molecules of this family. The 
recent cloning of human TNF (19-21) and human lympho- 
toxin (22) has clarified the relationship between these two 
molecules. Recombinant TNF and lymphotoxin are distinct 
molecules of Mr 17,000 and 18,600 that share considerable 
sequence homology. Both kill L cells and have tumor 
necrosis activity. In light of the observation that TNF(LuKII) 
exists in higher molecular weight forms, it seems imlikely that 
all forms of TNF have been cloned. 

This research was supported in part by grants from the National 
Cancer Institute (CA-38661 and CA-08748). 
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ABSTRACT Human ceU lines of hematopoietic origin were 
tested for production of tumor necrosis factor (TNF). B-cell lines 
transformed by Epstein-Barr virus release a factor (referred to 
as hTNF) that is cytotnik for mouse L oeib sensitive to nw^ 
but not for L oefls resistant to mouse TNF. Exposure to 4/3-phor- 
bol 12^myristate 13ar-aoetate au|pnented production of hTNF. 
hTNF activity was not found in supematants of cell Imes ci T-ceD, 
monocytic, or promyelocytic origin. Partially purified hTNF has 
a molecular weight of approximately 70,000» has no interferon ac- 
tivity, is add labile, is destroyed by heating at 70^ for I hr, in- 
duces cross-resistance to mouse TNF in oi^no, and causes hem- 
orrhagic necrosis of Meth A mouse sarcoma in the standard m 
utoo mouse TNF assay. Tests with a panel of 23 human cancer cell 
lines showed that hTNF is cytotoxic for 7 cell lines, cytostatic for 
5, and has no effect on 11. Comparative studies with human or, 
P, and Y interferons indicated that sensitivity to hTNF and in- 
terferon can be distinguished. Combined treatment with hTNF 
and a or y interferon resulted in a synergistic cytotoxic effect. 



Tumor necrosis factor (TNF) was recognized by Carswell et al. 

(I) during a study of the antitumor activity of serum from mice 
iniFected with bacillus Calmette-Gu^rin (BCG) and subse- 
quently injected with endotoxin. Serum from these mice con> 
tains a &ctor that induces hemorrhagic necrosis of certain mouse 
sarcomas in vivo and has cytotoxic/cytostatic effects on mouse 
and human tumor cells in vitro (1-5). Serum from mice singly 
treated with BCG or endotoxin did not have these properties. 
Other agents, such as Corynebacterium parvum or zymosan, 
that cause hyperplasia of the reticuloendothelial system and in- 
crease sensitivity to endotoxin lethality, can substitute for BCG 
in priming for TNF release. Endotoxin, however, apptears to be 
unique in its ability to elicit TNF release (1, 4-6). By using these 
me^ods for TNF production, factors with TNF-like activities 
have also been found in rats (1) and rabbits (1, 7, 8). 

Biochemical characterization has shown that mouse serum 
TNF exists in at least two forms: a 150, (XX) form (4, 9) and 
a 40,000^,000 Mr form (S-S). TNF in rabbit senim has been 
reported to have a molecular weight of 39,0(X) (10) and 67, (XX) 

(II) . Our studies have indicated that both in vivo and in vitro 
activities of mouse TNF appear to be a property of the same 
molecule. The cellular source of TNF in the mouse was initially 
assumed to be the macrophage, because the agents used to prime 
for TNF production cause massive hyperplasia of macrophages 
in liver and spleen (1). From studies of macrophage-rich cell 
populations in vitro, Matthews (12) and Mannel et al, (13) reached 
a similar conclusion with regard to the source of mouse and rab- 
bit TNF. Direct evidence that macrophages are at least one cell 
type in the mouse capable of producing TNF comes from stud- 
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ies with cloned lines of mouse histiocytomas (ref. 13; unpub- 
lished data). These cells constitutively produce low levels of 
TNF that are greatly increased after exposure to endotoxin. 

In the present study we have examined the capacity of hu- 
man cell lines of hematopoietic origin to produce a TNF-like 
&ctor. 

MATERIALS AND METHODS 

CeU Lines. See Tables 1 and 2. LuKII cells were obtained 
from W. Stewart (14). B-cell lines transformed by Epstein-Barr 
virus were derived from patients with melanoma (15). These 
and other cell lines were from our cell bank or from the col- 
lection of J0rgen E. Fogh or Peter Ralph of Sloan-Kettering 
Institute. 

In VUro TNF Assay. TNF-sensitive and -resistant L-M cells 
were derived from clone 929 mouse L cells (American Type 
Culture Collection). TNF activity was assayed by adding equal 
volumes of serially diluted samples to wells (24-well Costar plates) 
seeded 2-3 hr previously with 5 X 10* trypsinized L cells and 
determining cell death at 48 hr by phase-contrast microscopy 

(1). 

In Vivo TNF Assay. (BALB/c x C57BL/6)Fi female mice 
were injected inbradermally with 5 X 10^ Meth A BALB/c sar- 
coma cells. After 7 days (tumor size approximately 7 mm av- 
erage diameter), mice received a single intravenous injection 
of the TNF preparation. After 24 hr, tumor hemorrhagic ne- 
crosis was scored according to ref. 1. 

Interferon (IFN) Assays. Human IFN activity was measured 
by inhibition of the cytopathic effect of vesicular stomatitis vi- 
rus on WISH or GM2767 cells (16) and compared against an 
international standard in the case of IFN-a and the laboratory 
standard in the case of IFN-y. 

Mouse TNF. A standard lot of partiallv purified mouse serum 
TNF widi a specific activity of 2 x 10* units of TNF per mg 
of protein was used in these studies. (A unit of TNF is defined 
as the amount of protein causing killing of 50% of the L cells 
in the standard in vitro TNF assay.) This preparation of mouse 
TNF lacked IFN activity. 

Source of IFNs. Human IFNs were obtained from the fol- 
lowing sources: Recombinant IFN-a (Hof?mann-La Roche), 2- 
4 X 10^ units/mg of protein; leukocyte-derived IFN-a (pre- 
pared for the Sloan-Kettering Institute by Kocher Laboratory, 
Beroe, Switzerland), 0.64 X 10® units/mg of protein; leuko- 
cyte-derived IFN-y, 1 X 10^ units/mg of protein; fibroblast- 
derived IFN-^ (Roswell Park Memorial Inst.). 1 x 10^ units/ 
mgof protein. 



Abbreviations: TNF. tumor necrosis iador; hTNF, human TNF; BCG. 
bacillus Calmette-Gu6rin: PMA, 4/3-phorbol 12/3-myristate 13a-ace- 
tate; IFN, interferon. 

*To whom reprint requests should be addressed. 
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Screening of Hunuin CeDs of Hematopoietic Origin for TNF 
Production. Cells were cultured at 5 X 10^ per ml in RFMI 
1640 medium containing 8% fetal calf serum with or without 
4/3-phorbol 12/3-myristate 13a-acetate (PM A) (Sigma) at 10 ng/ 
ml. After incubation for 48 hr at 3T*C» the cells were collected 
by centriiugation and resuspended at 1 X 10^ per ml in medium 
without PMA for an additional 48 hr. Culture supematants were 
freed of cells by centrifugation and frozen at — 20^C before 
screening 1/2 diluted supematants for TNF activity by the L 
cell assay. 

Preparation and Concentration of hTNF from LuKII Cells. 
LuKII cells were cultured at 5 x lO' per ml in RPMI 1640 me- 
dium containing 5% fetal calf serum and PMA at 10 ng/ml and 
incubated for 48 hr at 3T*C. At 48 hr, cells were collected by 
centrifogation and resuspended at 8-10 X lO' per ml in serum- 
free R-rrS PREMIX (insulin, transferrin, selenium serum sub- 
stitute, Collaborative Research, Waltham, MA) and 2 nM eth- 
anolamine and incubated an additional 48 hr. Supematants were 
spun free of cells and frozen at ~20^C. LuKII supematants were 
concentrated by Amicon stirred cells (PM 10 membrane) and 
apphed to a DEAE>Sephadex A-50 column (40 x 2.6 cm) equil- 
ibrated with 0.05 M Tris-HCl/0.15 M NaCl buffer, pH 7.3. 
Fractions (5 ml) were eluted with the same buffer and hTNF- 
active fractions were pooled and concentrated by Amicon cells. 
Molecular weight was estimated by Sephacryl S-200 column 
chromatography. 

Effect of hTNF and IFN on Human CeD Lines. Cells were 
trypsinized, rinsed twice, plated at 4-5 X 10^ per well (24-well 
Costar plates) in minimal essential medium containing 8% fetal 
calf serum and incubated at 3T*C. After 16-24 hr, the medium 
was aspirated and 1-ml dilutions containing DEAE-fractionated 
hTNF, IFN, or hTNF and IFN in medium were added. Total 
cell counts (viable and nonviable) were determined at 3, 5, and 
7 days by phase-contrast microscopy. Cultures were re-fed on 
day 5 with fresh medium containing the same concentrations 
of hTNF and IFN. 

RESULTS 

TNF Production by Human Cell Lines of Hematopoietic 
Origin. Supematants from the ceU hnes listed in Table 1 were 
screened for TNF production by using the L cell assay system 
developed for the detection of mouse TNF (1, 6). Supematants 
were tested on TNF-sensitive L cells [designated L(s}l ^d on 
tfaeTNF-resistant L cell line [designateid L^]. The I^k) line was 
developed by repeated passage of sensitive L cells in medium 
containing mouse TNF. Table 1 shows that cell lines of B-cell 
origin produce a fector with TNF-like activity — e.g., cytotoxic 
for L(Si but not — and that PMA is necessary for efficient 
TNF production. No TNF activity was detected in the super- 
natants of six T-cell or three monocytic cell lines (wi^ or with- 
out PMA stimulation). The LuKII line consistently produced 
high levels of TNF after PMA induction and was therefore se- 
lected as the source for subsequent studies of TNF production 
and characterization. 

Fractionation and Characterization of Human TNF (hTNF). 
Concentrated samples of LuKII supematants were fractionated 
on DEAE-Sephadex A<^ columns and individual fractions were 
assayed on l^s) and I^r) cells. TNF activity was detected in the 
initial flow-through fractions, and peak fractions were pooled 
and concentrated. The specific activity of these preparations 
was 2-5 X 10^ units/mg of protein (a 25-fold increase over the 
initial supematants), and these preparations were used for char- 
acterization. The molecular weight of hTNF was estimated to 
be approximately 70,000. hTNF activity was destroyed by a 12- 
hr exposure to low or high pH: 90% of the activity was lost at 
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Table 1. Screening of human cell lines of hematopoietic origin 
for TNF production 

CeU viability (%) after exposure 
to culture supematants 

Us) oells L|R) cells 



CeU line 


No PMA 


PMA 


No PMA 


PMA 


Bcell 

" ^^9aa 










BE* 


47 


20 


100 


100 


BI* 


57 


43 


100 


98 


CL* 


95 


37 


100 


97 


cw* 


28 


A 


100 


100 


DE* 


45 


18 


100 


100 

AW 


DM* 


18 


11 


100 

AW 


100 

AW 




88 


Oft 


100 

AW 


100 

AW 




27 


16 


100 


100 

AW 


EL* 

nil* 


70 


16 


100 

AW 


100 

AW 


EO* 


15 


11 
A A 


100 

AW 


100 

AW 


ER* 


47 


22 


100 

AW 


100 

AW 


PC* 


19 


ft 


100 

AW 


100 

AW 


FD* 


44 


16 

AV 


97 


100 

AW 


FG* 


80 


30 


100 

AW 


97 


ARH-77 


68 


12 


100 


100 


DAUDI 


97 


97 


NT 

A 


NT 

• ^ A 


LiiKII 


20 


13 

Aw 


100 

AW 


100 

AW 


RPMT 17Aft 


78 


35 


100 

AW 


100 

AW 


RFMI 8226 


96 


84 


100 


86 


RPMT AAfilS 
ivmi oooo 


^9 


in 


inn 




SK-LY-16 


98 


100 


100 


100 


SK-LY-18 


99 


100 


100 


98 


ABA-10 


77 


39 


100 


97 


BALL-1 


100 


100 


100 


100 


TceU 










CCRF-CEM 


100 


100 


100 


100 


P.12 


95 


94 


99 


98 


MOLT-4 


100 


100 


99 


96 


T-45 


100 


100 


100 


100 


HFB-ALL 


100 


97 


100 


100 


TALUl 


96 


94 


96 


96 


Bfonoqrte 










J-111 


100 


100 


99 


98 


THP 


96 


80 


99 


99 


U-937 


100 


100 


99 


99 


PromjreloQrte 










UL-60 


96 


97 


99 


98 



NT, not tested. 



pH 2.0 and 45% of the activity was lost at pH 10. Activity was 
stable over a pH range of 6 to 8. With regard to heat stabihty, 
hTNF was destroyed at 70°C for 60 min but was stable at 56°C 
for 60 min. Activity was preserved over long periods of storage 
at —78 or — 20*C. IFN assays demonstrated the presence of 100 
units of IFN-a per ml in tiie unfractionated supernatant from 
PMA-stimuIated LuKII cells. No IFN activity was detected in 
hTNF preparations after DEAE-Sephadex chromatography. To 
investigate further the relationship of mouse TNF to hTNF, L 
cells were made resistant to hTNF by repeated passage in hTNF- 
containing medium. hTNF-resistant cells showed complete cross- 
resistance to mouse TNF. DEAE-fractionated hTNF was as- 
sayed by the standard in vioo mouse TNF assay (1) and found 
to cause necrosis of Meth A tumors. Three hundred micro- 
grams of the hTNF preparations produced + -I- + necrosis in 
1/5 mice and + + in 4/5 mice; 150 MS produced + + necrosis 
in 5/7 mice and + in 2/7 mice; 75 fig produced + + necrosis 
in 1/5 mice and + in 4/5 mice. No necrosis was observed in 
control mice after injection of comparable DEAE-Sephadex 
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Fio. 1. Effect of hTNF on four human tumor cell 
lines. hTNF units/ml added to cultures: ; control 
cultures; o, 32 units (BT-20); 64 units (ME-180, 
SK>MEL-109} or 126 units (BT-20, T-24); ii, 129 units 
(MB-ISO, SK.MEL-109) or 253 units (BT-20, T-24); 
A, 258 units (ME-180, SK-MEL-109) or 505 units CBT- 
20, T-24); o. 645 units (ME-180, SK-MEL-109) or 
1,263 units (BT-20, T.24). 



fractions of culture medium alone. Limulus assays indicated no 
difference in the amount of endotoxin in TNF-active and TNF- 
inactive (control) fractions. 

Cytotoxic or Cytostatic Effect of HTNF on Human Tumor 
Cell Lines. Fig. 1 illustrates the effect of hTNF on four cell 
lines: BT-20 breast cancer line (cytotoxic effect), ME-180 cerv- 
ical cancer line (cytotoxic effect), SK-MEL-109 melanoma line 
(cytostatic effect), and T-24 bladder cancer line (no effect). Ta- 
ble 2 summarizes the results of tests with 23 tumor cell lines 
and 4 cultures of normal cells. With a 35% or greater reduction 
in cell viability or cell number at 7 days as the criterion of either 
cytotoxicity or cytostasis, hTNF has a cytotoxic effect on 7 cell 
lines, a cytostatic effect on 5 cell lines, and no effect on 15 lines. 
Three of four cell lines of breast cancer origin were sensitive 
to the cytotoxic effect of hTNF, whereas the predominant ef- 
fect of hTNF on melanoma cell lines was cytostasis. None of the 
four cultures derived from normal tissues was hTNF sensitive. 



Table 2. Effect of hTNF on human cell lines 



Qytotozic effect 
SK-M6-4 (astrocytoma) 
MCF-7 (breast cancer) 
BT-20 (breast cancer) 
SK-BR-3 (breast cancer) 
ME-180 (cervix cancer) 
SK-CO-1 (colon cancer) 
RFMI 7931 (melanoma) 

Cytostatic effect 
SK-LU-1 Gung cancer) 
RPMI 4445 (melanoma) 
SK-MELr29 (melanoma) 
SK-MEL-109 (melanoma) 
SK-OV-^ (ovaiy cancer) 



No effect 
T-24 (bladder cancer) 
5637 (bladder cancer) 
MDA-MB-361 (breast cancer) 
S-4S (colon cancer) 
SK-LC-4 (lung cancer) 
SK-LC-6 Gung cancer) 
SK-LC-12 (lung cancer) 
SK-MELrl9 (melanoma) 
SK-UT-1 (uterus cancer) 
SAOS-2 (osteogenic sarcoma) 
U20S (osteogenic sarcoma) 
WI-38 (fetal lung) 
MY (normal kidney epithelium) 
F-136-35-56 (fetal lung) 
F-136-35-66 (fetal skin) 



CaX VIABtUTY 



BT-20 



ME-180 



SK-MEL-109 



T-24 





DAYS 



FtG. 2. Effect of recombinant IFN-a on four hu- 
man tumor cell lines. IFN units/ml added to cul- 
tures: •» control cultures; ■, 781 units; A, 3,125 units; 
A, 12,500 units; o, 50,000 units. 
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Fig. 3. EfTectoflFN-yon four human tumor cell 
lines. IFN units/ml added to cultures: control cul- 
tures; 7 .8 units; A, 31.26 units; 125 unite; 500 
units; o, 2,000 units. 



Effect of Human IFNs on Human Tumor Cell Lines — Syn- 
ergistic Action of TNF and Interferon. Recombinant IFN-a 
and natural IFNs a, fiy or y were tested on a panel oif human 
ceU lines. Figs. 2 and 3 show the response of BT-20, ME-180, 
SK-MEL-109. and T-24 to recombinant IFN-a and natural IFN- 
y and Table 3 summarizes the results of tests with the various 
IFN preparations at 7 days. In general, the number of antiviral 
units of IFN required to cause a 35% or greater cytotoxic effect 
was higher with IFN-a than with IFN-^ or -y. None of the IFN 
preparations showed cytotoxicity or cytostasis on L(s) cells, in- 
dicating absence of TNF-like activity. Fig. 4 shows the influ- 
ence of combined treatment with hTNF and recombinant IFN- 
a or natural IFN-y. Synergistic effects of hTNF and IFN are 
cleariy evident in tests with BT-20, ME-180, and SK-MEL-109, 
in which the combined cytotoxic effect is greater than that seen 
with hTNF or IFN alone. Similar results were obtained with 
mouse TNF and human IFNs a and y. By using the approach 
suggested by Clarke (17), the combined cytotoxic effect of IFN 
and hTNF was clearly shown to be synergistic rather than ad- 
ditive. 

DISCUSSION 

The Bnding diat human cell lines of B-cell origin release a TNF- 
like molecule after stimulation with PMA suggests that normal 
B cells have this capacity as well. Whether other human cell 
types produce TNF remains to be determined, but the fact that 
cloned lines of mouse histiocytoma do so after endotoxin stim- 
ulation indicates the need for further studies of this cell type 
in humans. The identification of the cytotoxic fiictor released 
by the human B-cell lines as the human homolog of mouse TNF 



rests on its differential reactivity with TNF-sensitive and TNF- 
resistant mouse L cells and the similar pattern of response (cy- 
totoxic/cytostatic/no effect) of the human cell panel shown in 
Table I to the human B-cell factor and to mouse TNF (unpub- 
lished data). Further facts relating mouse TNF and hTNF are 
(t) L cells made resistant to hTNF show a cross-resistance to 
mouse TNF, and (it) hTNF preparations cause hemorrhagic ne- 
crosis of Meth A sarcoma, the standard in vivo TNF assay. Al- 
though TNF appears to lack species specificity with regard to 
its antitumor effects, we have found that hTNF has a higher 
specific activity on human cells as compared to mouse cells. 
Other features that are shared by mouse and human TNF are 
acid lability, relative resistance to heat, and augmented toxicity 
in the presence of metabolic inhibitors such as actinomycin. 

Sensitivity to TNF is not an uncommon trait of human cancer 
cell lines. The normal cell types that have been studied to date 
have not shown sensitivity to mouse TNF (18) or hTNF, but 
these tests must be expanded before the apparent tumor spec- 
ificity of TNF can be assessed. The response of TNF-sensitive 
tumor cells can be classified either as cytotoxicity or cytostasis. 
Whether this represents a continuum in response <with cyto- 
toxicity indicating higher sensitivity) or a qualitatively different 
response is unclear. In studies on the influence of T^F on the 
cell cycle in sensitive L cells, the most evident effects are an 
initial accumulation of cells in C2, followed by lysis of cells in 
telophase (Z. Darzynkiewicz, personal communication). Simi- 
lar studies with human cell lines showing a predominant cy- 
totoxic or cytostatic effect may clarify the relationship between 
these two TNF responses. 

Neither mouse nor human partially purified TNF has de- 



Table 3. Cytotoxic and cytostatic efifecto of human IFNs on five human cell lines 



Number of anti-viral units causing 35% cytotoxicity or cytostaaia at day 7 





BT- 


20 


ME-180 


SK-MEL-109 


T-24 




WI.38 


IFN 


CT 


CS 


CT 


CS 


CT 


CS 


CT 


CS 


CT 


CS 


Natural IFN-a 


270 


25 


6,500 


110 


1,250 


26 


>50,000 


1,900 


>50,000 


230 


Recombinant IFN-a 


3,125 


<781 


44,000 


<781 


1,400 


<781 


>50.000 


16,000 


>50,000 


>50,000 


Natural IFN-0 


<49 


<49 


260 


<49 


<49 


<49 


> 12,600 


<49 


>25,000 


160 


Natural IFN-y 


130 


72 


<7.8 


<7.8 


22 


27 


>2,000 


25 


>2,000 


>2,000 



CT. cytotoxic; CS, cytostatic 
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FiQ. 4. Effect of hTNF and IFNsoD four human 
tumor oeU lines. (A ) Recombinant IFN-o. hTNF units/ 
ml and IFN units/ml added to cultures: ; control 
cultures; o, hTNF alone at 4 units (BT-20), 16 units 
(ME-180)» or 33 units (SK-MEL-IOS, T-24); A, IFN 
alone at 7ai units (SK-ld£L-109, T-24) or 3»125 units 
(BT-20, ME-180); a, hTNF and IFN together, in the 
amounts given above for each cell line. (B) Natural 
IFN-y. hTNF units/ml and IFN units/ml added to 
cultures: control ciiltures; O, hTNF alone at 25 units 
(BT'20), 50 units (ME-180), 100 units (SK-MEL-109), 
or 200 units (T-24); IFN alone at 31.25 units (BT- 
20) or 125 units (ME-180, SK-MEL-109, T-24); A, 
hTNF and IFN together, in the amounts given above 
for each cell line. 



tectable IFN activity, and the various human IFNs lack de- 
tectable TNF activity. Hius, TNF and IFN represent two classes 
of antitumor substances produced by cells of the hematopoietic 
system and, in the case of IFN-^, by other ceD types as well. 
There are a number of other lympholdnes that have been de- 
scribed, but these appear to act predominantly by regulating 
immune responses rather than having a direct cytotoxic or cy- 
tostatic effect on cells. Lymphotoxin is an exception in this re- 
gard (19, 20), and further studies on the reactivity of lympho- 
toxin with human cancer cells and its relationship to TNF are 
required. 

In view of the strong anticellular activity of IFNs on a broad 
range of cancer cells in vUro^ as shown in this study and in stud- 
ies by others (21-24), it is surprising that the clinical effects of 
IFN have not been more evident. The in vitro synergistic an- 
titumor action of TNF and IFN observed in the present study 
suggests that combined treatment with both &ctors might re- 
sult in better clinical responses and may also provide clues widi 
regard to requirements for optimal antitumor effects of inter- 
feron in tdvo. If other factors, such as TNF, are necessary for 
maximal IFN action, deficiencies of these Eictors in the tumor 
or in the patient could account for the relatively modest anti- 
tumor effects (tf IFNs observed to date. 

This work was supported In part by Grants CA-06748 and AI-17920 
from the National Institutes of Health and by a grant from the Cancer 
Research Institute. 
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ROLE OF INTERLEUKIN I IN ANTI-IMMUNOGLOBULIN- 

INDUCED B CELL PROLIFERATION* 

By MAUREEN HOWARD, STEVEN B. MIZEL, LAWRENCE LACHMAN, 
JOHN ANSEL, BARBARA JOHNSON, and WILLIAM E. PAUL 

From ike Laboratory of Immunology, National Institute of Allergy and Infectious Diseases, National 

InstituUs of Health, Bethesda, Maryland 20203, The Microbiology Program, Pennsylvania State 
University, University Park, Pennsylvania 16602, and Immunex Corporation, 51 University Buildings 

Seattle, Washington 9SWI 

The use of anti-Ig antibodies to stimulate polyclonal proliferation through Ig 
receptors expressed on the B cell membrane has been a popular model for antigen- 
driven B cell activation (1-7). Initially it was proposed that anti-Ig-induced prolifer- 
ation proceeded independently of T cells and other accessory cells, as vigorous 
depletion of such cell types failed to prevent activation (3, 8), However, certain 
features of anti-Ig-induced B cell activation challenged the view that the response was 
a simple consequence of the interaction of anti-Ig and membrane Ig. In particular, 
the relationship between cell density and magnitude of the proliferative response was 
nonlinear and rapidly declined to background proliferation levels at cell numbers 
below 10^ per microliter well (3, 6). Furthermore, the optimum doses of anti-Ig used 
in such studies far exceeded the amount required to saturate membrane Ig receptors 
and induce capping of membrane components. Precise delineation of the stimuli 
required for anti-Ig-induced B cell activation requires a functional assay in which 
€:ontaminating accessory cells potentially capable of endogenous factor production 
have been excluded. To this end, we have investigated conditions required for 
polyclonal activation of highly purified mouse B lymphocytes cultured at low cell 
density (e.g., 5 X 10* cells/well). Using such an assay, we have shown the role of a T 
cell-derived factor, designated B cell growth factor (BCGF),^ in anti-IgM-induced B 
cell proliferation. (9). In this report we demonstrate that in the presence of optimum 
amounts of BCGF, anti-IgM-induced proliferation of B cells cultured at lower densities 
(1-2 X 10* cells/well) is enhanced by the addition of a second soluble factor. 
Biochemical analyses identify this second cofactor as the previously described mon- 
okine interleukin 1 (IL-1). 

Materials and Methods 

Mice, BALB/cJ mice were obtained firom The Jackson Laboratory, Bar Harbor, ME, and 
used at 8-12 wk of age. CH3/HeJ mice, also obtained from The Jackson Laboratory, were used 
at 5-8 wk of age. 

• Supported in part by grant AI 17559 from the U. S. Public Health Service and grant PCM-8 110370 
from the National Science Foundation, both to S. B. Mizel. 

^Abbreviations used in this paper: BCGF, B cell growth Eactor; lEF, isoclectricfocusmg; IL-l, IL-2, 
interleukins 1 and 2; PAGE, polyacrylamide gel electrophoresis; PMA, phorbo! myristate acetate; UV, 
ultraviolet. 
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Anti-IgM Antibodies. Affinity-purified goat anti-mouse Ig specific for ^ heavy chains (anti* 
IgM) was prepared as described previously (6). 

Culture Medium. The culture medium used throughout was RPMI 1640 (Gibco Laboratories, 
Grand Island, NY) supplemented with 10% fetal calf serum (Reheis, Kankakee, IL), penicillin 
(50 pig/ml), streptomycin (50 ^g/ml), gentamycin (100 fig/m\), L-glutamine (200 mM)» and 2- 
mercaptoethanol (5 X 10"* M) (2ME). 

Growth Factor Preparations. EL4 supernatant was produced by stimulating a cloned subline 
of EL4 thymoma with 10 ng/ml phorbol myristate acetate (PMA) as described elsewhere (10). 
Cell-free supematants were collected after 48 h and depleted of PMA by adsorption on 
activated charcoal (11). For some experiments, BCGF was partially purified from EL4 super- 
natants by phenyJsepharose chromatography or isoelectricfocusing (lEF) (12).^ 

Macrophages were stimulated to produce growth factors by four different procedures. 

(a) Stimulation of the cloned murine monocytic cell line P388Di with 1 /ng/ml PMA, as 
described by Mizcl et al. (13). Cell-free supematants were collected after 5 d and depleted of 
PMA by two successive adsorptions on activated charcoal. For some experiments, the super- 
natants were concentrated —200-fold by Amicon filtration, then fractionated by gel filtration 
on an AcA54 column as described previously (10) to yield material of 10,000-20,000 mol wi. 

(A) Treatment of P388Di cells with 2-4 min ultraviolet (UV) irradiation (200-400 mj/cm*) 
using a bank of four Westinghouse FS20 bulbs) followed by their culture for 24 h in serum-free 
RPMI as described (Ansel, J., T. Luger, and I. Green, manuscript submitted for publication). 

{c) A su peri n duct ion protocol, designed to enhance IL-1 production by P388Di cells for 
factor purification purposes. This protocol involved co-culture of cells with protein and RNA 
synthesis inhibitors in addition to PMA for 5 h, removal of these agents by washing, then 
continued culture of the activated P388Di cells for a further 24 h (14). 

{d) Stimulation of human acute monocytic leukemia cells with endotoxin and collection of 
cell-free supematants 48 h later, as described by Lachman et al. (15). 

B Cell CO'Stimulator Assay. Full details of this assay are given elsewhere (9). Briefly, splenic 
B cells were puriHed by the procedure of Leibson et al. (16), then cultured at densities ranging 
from 10^ to 5 X lOVwell in 200 fil medium in flat-bottomed 96-well microtiter plates (0.32 cm 
growth area). Some cultures contained afluiity-purified goat anti-IgM antibody at 5-10/ig/m], 
and/or dilutions of the various growth factor preparations. Cultures were incubated at 37^ C in 
a humidified atmosphere of air containing 7.5% CQz for 72 h. The proliferative resp>onse of 
these cultures was determined by adding flflthymidine (1 ^Ci, 6.7 Ci/mmol; New England 
Nuclear, Boston, MA) for the last 12-16 h of culture and measuring [^H]thymidine incorpo- 
ration. 

Purification of IL-l. Mouse and human IL-l were purified according to previously published 
procedures (14, 17, 18). Mouse IL-1 in super-induced P388D] supematants was purified to 
apparent homogeneity using the following protocol: gel filtration chromatography, lEF, sulpho- 
propyl Sephadex cationic exchange chromatography, and iris glyeinate polyacrylamide gel 
electrophoresis (PAGE). Human IL-1 was purified to high specific activity using the following 
procedures in sequence: hollow fiber diafiltration and ultrafiltration, lEF, and tris glyeinate 
PAGE (18, 19). Criteria for purity of the preparations yielded by these protocols are outlined 
elsewhere (14, 17, 18). 

IL-l Assay. Mouse IL-1 was measured as a co-mitogen with phytohcmagglutinin in a mouse 
thymocyte [^Hjthymidinc incorporation assay (19). Human IL-1 was measured as a direct 
mitogen for mouse thymocytes in a [^H]thymidine incorporation assay (18). 

Results 

BCGF Dependence of Anti-Ig-induced B Cell Proliferation. As has been shown previously 
(3, 6), a careful analysis of the relationship between B cell density and anti-IgM- 
induced proliferation revealed a sharp decline in response at cell numbers below 10^ 
per microtiter well (Fig. 1 A). This result strongly suggested that in addition to anti- 
IgM and the B lymphocytes, other elements in the culture were limiting. We have 

*Farrar, J., M. Howard, J. FuUcr-Farrar, and W. E. Paul. Biochemical characterization of a murine B 
cell growth factor distinct from interteukin 2. Manuscript submitted for publication. 
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already shown (9) that induced supematants from the mouse thymoma £L4 produce 
two striking effects in cuhures of anti-IgM stimulated B cells: (fl) a shift in the 
antibody concentration/response relation such that 5 fig/m\ anti-IgM plus EL4 
supernatant induces levels of proliferation greater than those induced by 50 /ig/ml 
anti-IgM alone, and (b) a shift in the cell density/response relation such that a very 
substantial response to 5-50 /ig/ml anti-IgM can be obtained with as few as 5 X lO'* 
B cells/well. Here we see that £L4 supernatant enhanced the B cell proliferative 
response at cell densities ranging from 5 X 10^/well to 5 X lOVwell, and that 
essentially identical results were obtained with concentrations of EL4 supernatant 
ranging between 5 and 30% indicating saturation conditions (Fig. 1 A). The B cell co- 
stimulating factor in ElA supematants has been examined in detail elsewhere (9, 12).^ 
These studies show that the factor, designated BCGF, has an apparent molecular 
weight by gel filtration of 18,000 and is clearly distinct from interleukin 2 (IL-2) by 
gel filtration, phcnylsepharose chromatography, sodium dodecyl sulfate-PAGE, and 
lEF, as well as by cellular absorption studies. BCGF partially purified by lEF caused 
the same maximal enhancement of response to anti-IgM as did unfractionated EL4 
supernatant (Fig. IB). This strongly suggests that the enhancing activity of EL4 is 
attributable to its BCGF content. 

A Monokine Enhances BCGF-dependent Anti-IgM-induced B Cell Proliferation. To explore 
the p>ossibility of a macrophage-derived factor also being involved in anti-IgM- 
induced B cell proliferation, we selected B cell co-stimulator assay conditions that 
yielded low proliferative responses on a per cell basis to anti-IgM even in the presence 
of saturating amounts of EL4 supernatant, i.e., 1-2 X 10^ B cclls/microtiter well plus 
10% £L4 supernatant (see Fig. 1 A), and examined the effects of supplementing these 
cultures with induced supematants fxxsm the cloned murine macrophage cell line 
P388Di. Others have shown that such supematants are excellent sources of monokines 
such as IL-1 (13, 14, 20). Unfractionated PMA-induced P388Di supematants were 
found to profoundly inhibit both BCGF-dependent anti-IgM-induced and unstimu- 
lated background levels of B cell proliferation (Table I). As unfractionated supema- 
tants may contain both inhibitors and enhancing factors, we repeated these experi- 
ments using fractionated components of induced P388Di supematants. Thus, P388Di 
suf>ematants were fractionated by gel filtration using a calibrated AcA54 column 
exactly as descrit>ed elsewhere (10). Each column fraction was dialyzed against culture 
medium, and added at five serial dilutions to cultures containing lO'* highly purified 
B cells, anti-IgM antibodies, emd a saturating amount of the EL4 suf>ernatant. The 
results obtained at a 1:16 dilution of column fractions revealed that proliferation of 

Table I 

Effect of Unfractionated PMA-induced Macrophage Supematants on B Cell 



Proliferation 


Macrophage factors 


T cell factors 


cpm/5 X lO'* B cells 


No anti-IgM 


Plus AntMgM 


PSaSDi supernatant 
P388Di supernatant 


£L4 supernatant 
EL4 supernatant 


1,713 ± 267 

10 ± 41 
3,436 ± 1,021 
358 ± 27 


2,133 ± 301 
189 ± 34 
16,277 ± 222 
482 ± 103 



25% P388Di supernatant; 10% EL4 supernatant; 5 fig/ml anti-IgM. 
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such low numbers of B cells was markedly enhanced by the addition of macrophage- 
dcrived product (s) in the molecular weight range of 10-20,000 (Fig. 2). This material 
was not mitogenic for resting B cells, nor did it stimulate resting B cells in the presence 
of £L4 supernatant only. For convenience, the material was initially termed M 1 5 to 
signify its origin (i.e., macrophage) and approximate molecular weight (i.e., 15,000). 
A pool was made of those fractions constituting the major peak of Ml 5 activity (see 
hatched area of Fig. 2), and this pooled material was titrated under identical assay 
conditions. The results showed Ml 5 to be (a) inhibitory at high concentrations and 
{b) active over a wide concentration range below the inhibitory doses (Fig. 3 A). These 
findings have been reproduced using several different batches of M15 prepared from 
separate PMA-induced P388Di supernatants. Similar B cell co-stimulator activity 
was found in supematants obtained from UV-irradiated P388Di cells (Fig. 3B), 
thereby allaying concern regarding the potential involvement of PMA in the co- 
stimulatory activity of P388Di supematants. 

The effect of M15 on the cell density /response relationship of anti-IgM-induced B 
cell proliferation was tested using a 1: 16 dilution of Ml 5, i.e., the optimal concentra- 
tion of this material (see Fig. 3 A). B cell proliferation was enhanced at all cell densities 
tested (Fig. IG). When the same experiment was performed in the presence of a 
saturating amount of the T cell-derived lymphokine BCGF, a synergistic effect of the 
two factors was observed, particularly at low densities (Fig, 1 G). Identical results were 
obtained when a PMA-free source of M15 (i.e., supernatant of UV-irradiated P388Di 
cells) was used (Fig. 4 A). The results shown in Fig. 1 have been corrected for 
proliferation obtained in the absence of anti-IgM antibodies, and thus represent anti- 
IgM-dependent activation. We emphasize that both M15 (see Figs. 2 and 3) and 
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Fig. 2. Gel filtration analysis ofB cell co-stimulator activity in PMA-induced P388D] supernatant. 
Fractions collected from an AcA54 column were dialysed against culture medium, then added at a 
final concentration of 1:16 to cultures containing tO^ purified B cells per well and 10% £L4 
supernatant with or without anti-IgM antibodies (5 f^/ml). Prolireration was assessed at day 2.5 by 
['H]thymidine uptake. The calibrating molecular weight markers were ovalbumin (43,500), myo- 
globin (18,800), and cytochrome r (12,384). All results represent duplicate cultures. 
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Fio. 3. B cell co-stimulacor activity in (A) a 10,000-20,000-nioJ wl pool made from PMA-induccd 
P38dDi supernatant (equivalent to hatched area of Fig. 2), or (B) supernatant collected from UV- 
irradiaccd P388Di cells (refer to Materials and Methods). Supemacants were added at final dilutions 
of from 1/2 to 1/512 to cultures containing 10* purified B cells and 10% EL4 supernatant, with 
(•) or without (O) anti-IgM antibodies (5 /ig/mJ). Proliferation was assessed at day 2.5 by pH]- 
thymidine uptake. Results repesent duplicate cultures. 




B CELLS/WEU B CaLS/WOJ. 

Fic. 4. Proliferative response of purified B cells cultured with anti-IgM (5 fig/ml) together with 
various co-factors either in culture medium containing 2 X 10~* M 2ME or in 2ME-frcc medium. 
Co-factor supplements were: (a) phenyl Sepharose-purified BCGF, prepared as in reference 12 (also 
footnote 2) and added at a 1:50 final dilution found to be saturating for co-stimulator activity at the 
B cell density of 5 X lOVwell; (6) supernatant collected from UV-irradiatcd P388Di cells and added 
at a final dilution of 1:4 (designated MI5}; {c) medium only (NO GF); (d) a mixture of (a) and (b). 
Other assay conditicMns were as in Fig. 1. 

BCX^F (9) produce little proliferation in the absence of anti-IgM antibodies. In 
summary, these data strongly imply roles for both BCGF and M15 in anti-IgM- 
Induced B ceil proliferation. 

Absolute Need for Ml 5 in 2ME-free Cultures. M15-mediatcd enhancement of anti-Ig 
induced BCGF-depcndcnt B cell proliferation was highly reproducible from experi- 
ment to experiment. However, the magnitude of its effect was often relatively small 
(two- to fourfold), making it difQcult to clearly determine whether BCGF and M15 
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acted synergist ically or additively. This problem was resolved by taking advantage of 
the observation that B cell proliferation exf>eriments performed in culture medium 
lacking 2ME show an exaggerated need for exogenously added M15. Fig. 4 shows B 
cell proliferative responses to anti-IgM plus cofactors in medium containing or lacking 
2M£. While the maximum proliferation levels obtained in the presence of anti-IgM, 
BCGF, and M15 were the same in both media, a significant response to anti-IgM 
alone, i.e., in the absence of exogenously added BCGF and Ml 5, was only obtained 
when B cells were cultured in 2ME-containing medium; in the absence of 2ME a 
response to anti-IgM and BCGF was only observed at 5 X 10^ cells/well, the highest 
cell density tested (Fig. 4 B). In cultures lacking 2ME, M15 has a marked costimulatory 
effect at all densities. These data prompt speculation regarding the possibility of 
endogenous 2M£-mediated Ml 5 production by residual macrophages and/or B cells 
in our standard BCGF co-stimulator assay. The striking Ml 5-dependent enhancement 
of B cell proliferation observed in 2ME-free cultures clearly establishes the synergy 
between BCGF and M15, suggesting that both co-factors operate on the same 
population of B cells. 

Identification of MI 5 as IL- /. In terms of approximate molecular weight and cellular 
origin, M13 resembles the previously described monokine IL-1. To investigate the 
possible identity of these two factors, we tested murine and human IL-1, purified 
according to previously established procedures, for M15 activity, i.e., ability to 
enhance the proliferation of low numbers of purified B cells cultured with anti-IgM 
and saturating doses of £L4 supernatant. 

Murine IL-1 can be obtained in large quantities by a superinduction procedure 
involving pre-culture of P388Di cells with PMA in the presence of protein and RNA 
inhibitors (14). Mizel and Mizel (14) have purified the IL-1 in this crude supernatant 
to apparent homogeneity by a modification of a previously published fractionation 
scheme involving gel filtration, lEF, sulpho-propyl Sephadex cationic exchange, and 
tris glycinate PAGE. As expected, the 15,000-mol wt material obtained by gel 
filtration of supernatant from superinduced P388Di cells showed excellent M15 
activity when added to cultures of highly purified B cells, anti-IgM, and saturating 
amounts of EL4 supernatant (data not shown). The 15,000-mol wt material was then 
ftirther purified by lEF, sulpho-propyl Sephadex cationic exchange, and tris glycinate 
PAGE. Fractions from the tris glycinate gel were dialyzed, then assayed for M15 
activity using the B cell co-stimulating assay described above and for IL-1 activity 
using the conventional mouse thymocyte proliferation assay. The two activity profiles 
were indistinguishable (Fig. 5), indicating either that M15 is IL-1 or that there is a 
great biochemical similarity between them. 

Human and murine IL-1 are functionally interchangeable in a variety of T cell 
assays (S. B. Mizel, unpublished observations). Thus, to further assess the relationship 
between MIS and IL-1, we tested purified human IL-1 for M 15 activity. Human IL- 
1 can be obtained in large quantities by incubating freshly collected acute monocytic 
leukemia cells with endotoxin (15). Lachman et al. (17) have purified the IL-1 in this 
crude supernatant to high specific activity by sequential fractionation involving 
hollow fiber filtration, lEF, and tris glycinate PAGE (17, 18). This combination of 
procedures generally yields a preparation that is free of endotoxin, as assessed by the 
Limulus assay (L. Lachman, unpublished data). The 10-50,000-mol wt material 
obtained by hollow fiber filtration was subjected to lEF, and dialyzed fractions were 
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Fio. 5. B cell cx>stimuLatmg activity of highly purified murine IL-1. Murine IL-l from superin- 
duced P388Di supernatants was purified to apparent homogeneity as described in text. Dialyzcd 
fractions obtained from the final tris glycinate gel electrophoresis were added at several dilutions 
ranging from 1/8 to 1/2048 to cultures of 2 X 10^ B celts, 10% EL4 supernatant, and anti-IgM (5 
^g/ml) (O), or to cultures of 1.5 X 10^ mouse -thymocytes and phytohennagglutmin (•). In both 
cases, proliferation was assessed at day 2.5 by [^H]thymidine uptake. For each fraction, relative 
units of activity r^resent the inverse of the dilution that produced 50% of the maximum proliferation 
obtained in that assay. All results represent means of duplicate cultures. 
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Fig. 6. B cell c^stimulating activity of partially purified human IL-l. Supernatants from endo- 
toxin -stimulated human monocytic leukemia cells were fractionated by hollow fiber filtration, and 
the 10~50,00O-mol wt component further purified by lEF. Dialyscd fractions were then added ai 
several dilutions ranging from 1/50 to 1/3,200 to cultures of 2 X 10* B cells, 10% EL4 supcmatani, 
and anti-IgM (5 fig/ml) (O), or at 1:100 (final dilution) to cultures of 1.5 X 10® mouse thymocytes 
(•). In both cases, proliferation was assessed at day 2.5 by [^H)thymidine uptake. Units of B cell co- 
stimulator activity were calculated as outlined in Fig. 5. All results represent duplicate cultures. 



assayed for M15 and IL-l activity. The two activities had a concordant distribution 
v^ith a peak in activity in the pi 7.0 fractions (Fig. 6). The pi 6.8-7.2 fractions from 
lEF that contained both M15 and IL-l activity were pooled and electrophoresed on 
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Fig. 7. B cell co-stimulating activity of purified human IL*1. Two separate preparations (#>■) of 
human IL-l from endotoxin-stimulated mcmocytic leukemia celk were purified to high specific 
activity as described in text. Each preparation was added at the various dilutions indicated to 
cultures of 2 X 1<>* B cells, 10% EL4 supernatant with or without anti-IgM at 5 iig/ai\ (B cell co- 
stimulator assay), or mouse thymocytes fT cell co-stimulator assay). Proliferation was assessed at 
day 2.5 by [^H] thymidine uptake. Results represent means of duplicate cultures. 



a tris glycinate polyacrylamide gel. In two separate experiments, the fraction that 
contained IL-l activity was shown to be also rich in M15 activity (Fig. 7). 

The biochemical results correlating B cell and thymocyte co-stimulating activities 
from two separate sources of IL-l provide strong evidence that M15 is the monokine 
IL-l and thus that IL-l, together with BCGF, plays a key role in the stimulation of 
anti-IgM treated B cells to enter the S phase of the cell cycle. 



Discussion 

We report here conditions for polyclonal activation of small numbers of highly 
purified mouse B lymphocytes. Three stimuli are required for induction of DNA 
synthesis by the particular subset of small B lymphocytes investigated: one delivered 
by antibodies specific for the IgM receptor expressed on the B cell membrane; one 
delivered by the T cell-derived factor BCGF; and one delivered by the 15,000-mol wt 
macrophage-derived factor IL-l. BCGF is a newly described lymphokine discussed in 
detail elsewhere (9, 12).^ Identification of the monokine involved in this process as IL- 
l is based on correlation of B cell and T cell co-stimulating activities following an 
extensive series of biochemical purification procedures. Indeed, the purest preparations 
of murine and human IL-l currently available show excellent B cell co-stimulating 
activity in our assay systems. We do not believe that the B cell co-stimulatory activity 
contained in IL-l -containing su}>ernatants can be attributed to the PMA generally 
used for IL-l induction for several reasons: (a) PMA-free IL-l -containing supematants 
obtained from UV-irradiated P38dDi cells contain the B cell-costimulant; (b) The 
IL-l -rich supernatant obtained by the superinduction protocol is produced by expos- 
ing P388Di cells to PMA for 5 h, extensively washing the cells, culturing them for 24 
additional h and collecting that supernatant. No PMA can be detected in the purified 
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preparation, as assessed by the inclusion of isotopicaily labeled PMA in test batches 
of supernatant (Mizel, unpublished data); (c) The human IL-1 -containing superna- 
tant is induced by endotoxin rather than by PMA; and (d) The B cell co-stimulant 
resided in IL- 1 purified to apparent homogeneity. We wish to emphasize that BCGF 
and IL-1 do not appear to be mitogenic for resting B cells. Furthermore, BCGF 
derived from an line (EL4) functions on BALB/c B cells and lL-1 derived 

from an line (P388Di) functions on CBA (H-2'') B cells (unpublished data). 

Thus, these factors act on anti-IgM-activated B cells in a non-antigen-specific, non- 
H-2-restricted manner. The fact that BCGF and IL-1 co-stimulatory effects are 
synergistic rather than additive suggests the two factors operate on the same B cell 
subset. We show elsewhere that BCGF does not lead to the appearance of antibody- 
forming cells, but is nevertheless an essential component in both an antigen-induced 
antibody-forming ceil response (9) and in the induction of cells with cytoplasmic Ig 
in resf>onse to anti-IgM.^ Preliminary experiments suggest a similar role for IL-1 
(unpublished observations). The involvement of accessory cells in anti-IgM-induced 
B cell proliferation has previously been a matter of controversy; however, involvement 
of either macrophages or T cells has been reported by some laboratories (28-30). We 
suggest that failure to obseve the need for one or both cell types reflects the use of 
assay conditions in which small numbers of one or both accessory cells are present 
with consequent endogenous factor production. Our own experiments show a need 
for exogenous BCGF and IL-1 only at low cell densities, despite the rigorous B cell 
' purification procedure we have used. Thus it would appear that very small numbers 
of accessory cells can provide B cell stimulatory co-factors. 

Three major models to explain antigen-specific B cell activation have previously 
been proposed: (a) activation occurs as a result of the binding of antigen to membrane 
Ig (8» 31); (b) activation occurs via an unrelated nonspecific receptor, with membrane 
Ig serving to specifically focus molecules capable of binding to this receptor (32); and 
{c) activation requires both a signal resulting from the binding of the Ig receptor and 
a signal delivered by an interaction at another antigen-nonspecific receptor on the B 
cell membrane (33). In this study, we arrive at the novel conclusion that IgM-specific 
induction of B cell proliferation requires three stimuli: one specific for the antigen 
receptor, and two antigen-nonspecific stimuli. Preliminary kinetic analyses investigat- 
ing the relative roles of BCGF and IL-1 in the B cell cycle suggest the two factors 
function independently, with BCGF operating on Go or early Gi cells and lL-1 acting 
at some later point of Gi (34; M. Howard and E. Rabin, unpublished data). It seems 
likely that the time of action of the co-stimulant correlates with that time that 
receptors for them are expressed on the B cell surface, just as T cell sensitivity to IL- 
2 appears to correlate with the expression of receptors for IL-2 on T cells that have 
been stimulated with lectins or antigens (35-37). However, no direct evidence for 
receptors for BCGF or IL-1 on activated mouse B cells has yet been obtained. 

This report establishes a role for IL-1 in stimulation of B cells. While others have 
previously proposed roles for IL-1 in B cell responses (38-44), such experiments were 
conducted using less pure B cell populations cultured at high cell density and thus 
failed to distinguish a direct action on B cells vs. secondary actions via other ceil 

* Nakanishi, K., M. Howard, A. Muraguchi, J. Farrar. K. Takatsu, T. Hamaoka, and W. E. Paul. 1983. 
Soluble factors involved in B celt diflerentiation: ideniification of two distinct T cell replacing factors 
(TRF).y. fmmunol. In press. 
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types. A particular problem in this respect is that IL-1 appears responsible for the 
generation of at least some T cell-derived lymphokines, e.g., IL-2 (22-25) and colony- 
stimulating factor (26). With the recent identification of several distinct B cell-specific 
lymphokines^ (9, 16, 27, 30, 45-48) , one could certainly envisage IL-1 -mediated factor 
cascades in many biological assays for B cell function. While such objections may be 
leveled at our experiments, we offer the following considerations: (a) We have used 
very highly purified B cells cultured at low cell densities, thus greatly reducing the 
possibility of accessory cell contamination. (6) Wc show elsewhere (34) that IL-1 may 
be added quite late in the B cell cycle, thus providing little opportunity for a factor 
cascade to develop, {c) Other T cell-derived B cell^pecific lymphokines, e.g., BCGF, 
and the T cell-replacing factors from the EL4 (EL-TRF) and the B 15K12 (B15-TRF) 
T cell lines^ fail to replace the need for IL-1 (unpublished data). Nevertheless, ultimate 
proof of IL-1 acting directly on B cells will await reagents capable of demonstrating 
its specific binding to cellular receptors. The observation that IL-1 dependency of B 
cell proliferation is heightened by omitting 2M£ from the culture medium is reminis- 
cent of the earlier studies of Mongini et al. (28), who demonstrated an accessory cell 
requirement for anti-IgM induced B cell proliferation under 2ME-free conditions. 
2ME has been used extensively as an additive to culture rhedium in a variety of tissue 
culture systems to either augment the magnitude or facilitate observation of the 
particular phenomenon under study. While 2ME has been directly implicated as a 
lymphocyte mitogen and polyclonal activator (49, 50), our experiments suggest an 
alternative mode of action, namely as a stimulant that induces IL-1 production by 
residual macrophages and/or B cells. 

We wish to emphasize that the conclusions reached from these studies reflect the 
activation requirements of the subset of B cells that can proliferate in response to 
anti-IgM antibodies. This subset has recently been quantitated as comprising ^50% 
of normal splenic B cells (21). Such cells are absent from xid mice and appear to be 
members of the Lyb-5^ population of normal B cells. Thus, our data do not exclude 
the existence of a separate subset of B cells which interact with T cells in an H-2- 
restricted manner, as demonstrated by others (51-53). Finally, it is difficult to relate 
the findings of this investigation to those recently obtained in other laboratories using 
elegant single-cell assays for B cell growth (54-56). As such studies have not used anti- 
IgM antibodies as the primary activant, different B cell subsets may be under study. 
Furthermore, the increased sensitivity of single-cell assays over those described in this 
report may allow detection of trace amounts of growth factors in cell supernatants or 
indeed in the fetal calf serum contained in culture medium itself, producing results in 
apparent conflict with ours. Attempts to resolve some of these issues arc in progress. 

Summary 

In this report we describe conditions for polyclonal activation of small numbers of 
highly purified mouse B lymphocytes. Three signals are required for induction of 
DNA synthesis by the particular subset of small B lymphocytes investigated: a signal 
delivered by antibodies specific for the IgM receptor expressed on the B cell mem- 
brane; a signal delivered by a T cell*derived factor (B cell growth factor [BCGF]); 
and a signal delivered by the macrophage-derivcd factor interleukin 1 (IL-1). The 
conclusion that IL-1 has B cell co-stimulator activity is based on the findings that 
highly purified preparations of mouse and human IL-1 have the capacity to cause 
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proliferation in B cells treated with anti-IgM and BCGF, Such cultures show an 
absolute dependence on exogenously added IL-1 when 2-mercaptoethanol is omitted 
from the medium. BCGF and IL-1 each act in a non-antigen-specific, non-H-2- 
restricted, synergistic manner. Their requirement is not observed when B cells are 
cultured at high density, presumably reflecting accessory cell contamination and 
endogenous factor production under these conditions. The B cell activation induced 
- by these three signals is restricted to proliferation without the production of antibody- 
forming cells. 

We thank K. Nakanishi and R. Asofsky for providing afTlnity-purified goat anti-mouse IgM 
antibodies, M. Hilfiker for assistance with gei filtration analyses, and J. Farrar for lEF-purified 
BCGF. 
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BINDING AND INTERNALIZATION OF INTERLEUKIN 1 BY 

T CELLS 

Direct Evidence for High- and Low-Affinity Classes of Interleukin 1 

Receptor 

By JOHN W. LOWENTHAL and H. ROBSON MacDONALD 

From the Ludwig Institute for Cancer Research, Lausanne Branch, 1066 Epalinges, 

Switzerland 

IL-1, a macrophage-derived 17,500 Mr glycoprotein, has been shown to 
mediate a wide spectrum of biological activities (1, 2). Insofar as T cell-mediated 
immunity is concerned, IL-1 has been shown in several studies (S-9) to induce 
Th cell secretion of the growth hormone lL-2, while in other systems it has been 
shown to provide an obligatory signal for IL-2-R expression (10, 1 1). The study 
of the regulatory role of IL-1 in T cell proliferation has been greatly facilitated 
by the recent development of a direct radiolabeled IL-1 binding assay by Dower 
et al. (1 2). These authors showed that IL-1 bound to specific cell surface receptors 
that have a molecular weight of around 80,000. The general level of IL-l-R 
expression by a variety of cell types was found to be much lower than that of 
receptors for other hormones, such as epidermal growth factor, insulin, and IL- 
2. Various T cell populations, for example, were shown to express only 27-550 
IL-l-R per cell (12). 

We have recently characterized a mutant subline of the £L4 thymoma (£L4- 
6.1), which is induced to simultaneously secrete IL-2 and express IL-2-R in the 
presence of IL-I (1 3). We show in the present report, using a direct IL-1 binding 
assay that these cells express extremely high numbers of IL-l-R (~20,000 per 
cell). The IL-l-R expressed by EL4-6.1 cells can be resolved into two classes: the 
vast majority of receptors bind lL-1 with a of '-200-500 pM, whereas a 
second class of receptor, making up 1-2% of the total, binds IL-1 with a 100- 
fold higher affinity {K^i of 3-8 pM). This latter value is equivalent to the 
concentration of IL-1 that gives 50% biological activity. In addition, the two 
classes of FL-l-R can be distinguished on the basis of their ability to internalize 
IL-I; it appears that only high-affinity IL-l-R can do so. Other cell tyf>es also 
express both classes of IL-l-R, but the absolute number of receptors per cell is 
considerably less than on EL4-6. 1 cells. Because of their high degree of respon- 
siveness to IL-1 and the expression of unusually high numbers of IL-l-R, EL4- 
6.1 cells offer a valuable system with which regulation of IL-l-R expression and 
the mechanism of IL-1 action can be studied in detail. 

1060 J. ExP. M^D. © The Rockefeller University Press 0022-1007/86/10/1060/15 $1.00 

Volume 164 October 1986 1060-1074 



LOWENTHAL AND MacDONALD 



1061 



Materials and Methods 



Cell Cultures, The murine T cell lines used in this study were: £L4 thymoma sublines 
EL4-6.1 (14)» EL4-10 (subclone of EL4-6.1), EL4-3 (an independent non-IL-2-secreting 
subline), EL4-RN. and EL4-RP (kindly provided by O. Kanagawa, Lilly Research Labo- 
ratories, La Jolla. CA); the I L-1 -responsive T lymphoma LBRM-33-I A5B6 (LBRM, 
reference 15); the lL-2-dependent cytolytic T cell line CTLL (16); and the T lymphomas 
ST-4.2 (17), BW5147 (18), and Yac (19). Nylon wool-purified peripheral T cells from 
lymph nodes (LNT)* were prepared as described (20) and were >95% Thy-l* as judged 
by FACS analysis. Cells were cultured in enriched DM£ (21) and maintained at 37 °C in 
a humidified atmosphere of 5% COs in air. PMA was purchased from Sigma Chemical 
Company, St. Louis, MO. lonomycin was obtained from Calbiochem-Behring Corp., La 
Jolla, CA. Human rIL-2 was provided by Biogen SA, Geneva, Switzerland, and was 
radioiodinated according to the method of Robb et al. (22). 

Interleukin I. Human recombinant I L-1 (o and forms, reference 23) was kindly 
provided by Dr. C. Henney» Immunex Corp., Seattle, WA. A detailed description of their 
preparation and purity is reported elsewhere (24). The specific activity of rIL-l-a and 
was 1 0^ U/mg in both the murine thymocyte assay (1.3) and in the EL4-6.1 IL-2 induction 
assay (13). 

Biological Assay for IL-l. The biological activity of lL-1 was measured by its ability to 
stimulate IL-2 production by EL4-6.1 cells in the presence of lonomycin (13). EL4-6.1 
cells (10* cells/microwell) were cultured in the presence of various concentrations of IL- 
1, and 0.1 fig/m] of lonomycin. After 24 h, the cell-free supernatant was measured for 
IL-2 activity using the CTLL line (16), according to the method of Landegren (25). In 
this assay. 1 U/ml of lL-2 supports 50% maximal proliferation during a 48-h culture 
f>eriod. In some experiments, I L-1 biological activity was measured by its ability to induce 
IL-2-R expression by EL4-6.1 cells in the presence of suboptimal concentrations of PMA. 
Briefly, EL4-6.1 cells (5 X 10* cells/ml) were cultured in the presence of 0.3 ng/ml of 
PMA plus various concentrations of IL-1. After 48 h, expression of IL-2-R was measured 
by using a direct IL-2 binding assay as described previously (26, 27). 

IL-1 Absorption Assay. Various cell lines were extensively washed and then incubated 
(10^ cells/ml) at 4 in the presence of a known concentration of rIL-1. After 5 h, the 
cell-free supernatant was measured for residual IL-1 activity in both the lL-2 production 
and IL-2-R expression assays using EL4-6.1 cells (13). 

Radioiodmation of rIL-L rIL-l-a was radioiodinated using the chloramine T method 
(28). The method used was similar to that described for IL-2 by Robb et al. (22), and was 
according to the procedure of Dr. S. K. Dower, with some modifications. Small quantities 
of rIL-l-a (500 ng in 50 ii\ of PBS pH 7.2) were added to a mixture of 1 5 m1 of cMoramine 
T (30 Mg/ml in PBS) and 1 mCi (10 mO of Na'^1 (Amersham IntemaUonal, Amersham, 
United Kingdom). The reaction was carried out in an Eppendorf tube at 4**C for 10 min. 
The labeled rIL-1 ('"I-IL-1) was separated from free Na'"I on a Sephadex G-25 column 
(PD- 10; Pharmacia Fine Chemicals, Uppsala, Sweden). ~90% of the radioactivity present 
in the pooled fractions 5-11 (Fig. 1), was TCA-precipitable. Biological activity of ^^I-IL- 
1 was measured as described above. ^^60-70% of the initial protein was recovered as 
measured from experiments using mock-labeled IL-1. The specific activity of the *^*1-IL- 
1 was 10^ cpm/ng and produced a single band of 17,000 Mr when analyzed by SDS- 



/L-7 Binding Assay. Binding of '^^I-IL>1 was measured according to the method 
of Dower et al. (12) and our previously published procedures for radiolabeled IL-2 (27). 
Briefly, aliquots of 10^ cells were incubated in the presence of various concentrations of 
***I-1L-1 at 4*'C for various periods of time (equilibrium binding occurs within 2-4 h). 
Free radioactivity was separated from bound by centrifugation through an oil gradient 
(26). Nonsj>ecific binding was measured in the presence of a 50- to 100-fold excess of 
unlabeled IL-1. Scatchard plot analysis of equilibrium binding data was performed as 
previously described (27). 

Inttma lization of IL-L Cells were washed thoroughly and incubated (5 X 10^ cells/ml) 

' Abbreviation used in this paper: LNT, nylon wool-purined lymph node T cells. 
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Figure 1. Separation of '"I-IL-1 from free Na*"I. The '"I-IL-1 -reaction mixture was 
placed on lop of a 1-ml Sephadex G-25 column that had been pretreated with BSA <10 mg/ml) 
and then extensively washed with PBS. ''^I-IL-i was eluted with PBS (pH 7.2) and 50 ^1 
fractions were collected and measured for radioactivity. Elution fractions 5-11 {arrows) were 
pooled and diluted in medium containing 10% FCS as carrier protein. 

in the presence of '^^I-IL-1 at 4"C for 3-4 h. The cells were then either washed extensively 
in medium or not washed and then transferred to 37 The amount of internalized IL- 
1 and total cell-associated IL-I was measured at various times by centrifugation of ceils 
through an oil gradient. Internalized IL-1 was distinguished from surface-bound IL-I by 
its resistance to treatment with tissue culture medium adjusted to pH 3 by adding IN 
HCI. At pH 3, >95% of surface-bound IL-1 dissociated within 1 min, whereas the level 
of intracellular IL-1 was not affected. The rate of increase in the pH 3-resistant IL-1 
binding with time at 37 was therefore uken as a measure of the rate of IL-1 
internalization. The level of radioactivity remaining after pH 3 treatment at the end of 
the 4**C incubation period was taken as background. No IL-1 internalization occurred at 
4**C. Internalization of radiolabeled rIL-2 was measured according to the same method. 
In other experiments, the exclusively internal localization of pH 3— resistant radioactive 
ligand was directly confirmed by quantitative autoradiography on electron micrc^scopy 
sections (J. Lowenthal and B. lacopetta, unpublished observations). 

Results 

Identification of IL-l-R'*' Cells by Absorption Experiments. In preliminary studies 
we used IL-1 absorption experiments to distinguish between IL-l-R'*' and IL-1- 
R* cell types. In these experiments, cells were incubated at 4® C in the presence 
of a known concentration of rIL-1 for 4-6 h. The cell-free supematants were 
then measured for residual IL-1 activity, either by their ability to induce IL-2 
secretion or IL-2-R expression by EL4-6.1 cells. Fig. 2 shows that cells from IL- 
|. responsive lines could absorb IL-1 activity, whereas nonresponsive cell lines 
such as CTLL and ST4 could not. This was not due to secretion of inhibitory 
products by these cells, as revealed by mixing experiments. EL4-6.I and cells of 
its subclone EL4-10 were very efficient at absorbing IL-1 activity. Results from 
exf>enments in which increasing numbers of EL4-6.1 cells were incubated in the 
presence of a constant concentration of IL-1 revealed that these cells bound 
about 10,000-20,000 IL-1 molecules per cell (data not shown). 
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Figure 2. Absorption of rlL-I by various cell lines. 10^ cells/ml were incubated hn the 
presence of rlL-l-o (5 ng/ml) for 4 h at 4**C. The IL-1 content of the cell-free supernatant 
was then measured at the indicated dilutions by its ability to stimulate IL-2 production by 
EL4-6. 1 cells in the presence of 0. 1 Mg/ml lonomycin. Similar results were obtained using rIL- 
1-^ or a mixture of a and ^. 



Radioiodinated rIL-1 Retains Biological Activity, IL-1 -a retained full biological 
activity after radioiodination, as measured by its ability to induce IL-2 secretion 
by EL4-6,1 cells in the presence of lonomycin (Fig. 3 A). We saw half-maximum 
biological activity at an IL-1 concentration of 6 pM. Furthermore, Fig. shows 
that both a. and P forms of unlabeled IL-1 competed with '^^I-IL-I-a for binding, 
although I L- 1-/9 had about threefold lower affinity. Ck>mpetition for '^^I-lL-l-a 
binding by IL-1 was specific since <5% inhibition of binding was observed when 
unlabeled rIL-2 was added at concentrations of up to 100 ng/ml (i.e., a 1,000- 
fold molar excess; Fig. 3^). 

Association Kinetics of ^^^I-JL-I with EL4-6,I Cells, Fig. 4 A shows the time 
course of association of ***I-IL-I with EL4-6. 1 cells at 4 **C over a 1 00-fold range 
in IL-I concentrations. Binding reached equilibrium within 2-4 h over the range 
of concentrations tested. Logarithmic conversion of these data revealed a first 
order association rate constant, which was similar at all three concentrations, 
with a ti/2 value of 30-40 min. Fig, 4B shows that binding of ^^^I-IL-1 to 
EL4-6.1 cells occurred more rapidly at 37**C than at 4°C, The difference in 
association rate constant was about fourfold. Dissociation of IL-1 occurred slowly 
at 4**C (^1/2 >4 h), which is in agreement with the results of Dower et al. (12). 

Equilibrium Binding Analysis Reveals Two Classes of IL-l-R. EL4-6. 1 cells were 
incubated at 4^C in the presence of various concentrations of *^*1-IL-1. After 
equilibrium binding was established (4 h), the cell-bound radioactivity was sepa- 
rated from the free radioactivity by centrifugation through an oil gradient, and 
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Figure 3. "'I-IL-1 retains biological activity. {A) Comparison of the capacity of labeled and 
unlabeled rlL-l-a to induce IL-2 secretion by EL4-6.I cells in the presence of lonomycin. (B) 
Ability of '"l-IL-l-a to compete for binding with unlabeled IL-l-« or IL-1-/3. EL4-6.1 cells 
(10* cells/well) were incubated in the presence of "'I-IH-a (0. 1 ng/ml) and various concen- 
trations of unlabeled IL-l-a or IL-1-^ for 4 h at 4'C. Bound radioactivity was measured as 
described in Materials and Methods. Maximum radioactivity was 3,660 cpm. Unlabeled rIL>2 
was also added as a negative control. 
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FiCiWE 4. Association kinetics of '*M-IL-1 with EL4-6.L (A) Cells were incubated at 4''C 
in the presence of the indicated concentrations of '"l-IL-1 for various periods of time. {B) 
Cells were incubated at 4'*C or 37°C in the presence of '"l-IL-1 (1 nM) and sodium azide 
(0.02%) for various periods of time. Nonspeciflc binding has been subtracted in A and B, 
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Figure 5.- Equilibrium binding analysis of '*^I-IL-1 to EL4-6. 1. Cells were incubated at 4°C 
in the presence of various concentrations of *^I-IL^1 (0.2—2,000 pM) for 4 h. Freshly thawed 
EL4-6.1 celk (O) were compared to EL4-6.1 cells that had been cultured for 3 months (•) in 
their ability to bind L-1 . (A) Specific equilibrium binding of ***I-IL-1 {solid lines) after 
subtraction of nonspecific binding {dashed lines). {B) Scatchard plot analysis of the binding 
data shown in A. Inset shows the binding of '^'I-IL-l to the high-afHnity class of IL-UR in 
more detail. Similar results were obtained in six independent experiments. 



Table I 

Expression of High- and Low-Affinity IL-l-R by D^erent Types ofT Cells 



Cells 


Number of 
experiments 


IL-l 'R per cell 
(mean ± SEM) 


Dissociation constant 
(mean ± SEM; pM) 




High affinity 


Low affinity 


High affinity 


Low affinity 


EL4-6.1 


5 


340 ± 120 


17.700 ±4,890 


4.6 ± 2.5 


380 ± 140 


EL4-16 


2 


280 ± 80 


11.100 ± 3.330 


3.9 ± 1.8 


440 ± 170 


EL4-RP 


2 


195 ± 105 


5,700 ± 1.060 


4.7 ± 1.2 


335 ± 1 10 


EL4-S 


5 


<2 


<2 






LBRM 


4 


46 ± 18 


2,550 ± 660 


5.3 ± 2.7 


360 ± 160 


Yac 


2 


35 ± 16 


1.650 ±300 


3.3 ± 1.2 


290 ± 60 


CTLL 


5 


<2 


<2 






BW5147 


3 


<2 


<2 






ST4 


3 


<2 


<2 






LNT* 


2 


12 ± 4 


180 ± 40 


5.0 ± 1.5 


405 ± 145 


LNT day 2* 


2 


23 ±6 


580 ± 180 


5.5 ± 1.8 


340 ± 90 



• Freshly isolated LNT. 

* LNT cultured 2 d in the presence of PMA (1 ng/ml) and lonomycin (0.25 ii^m\). 



then quantitated. Fig. 5^4 shows the specific equilibrium binding after subtraction 
of the nonspecific binding. Scatchard plot analysis of these data (Fig. bB) revealed 
that EL4-6.1 cells express two classes of IL-l-R. '-'98-99% of the toul IL-l-R 
(-^IS^OOO/cell) bound IL-1 with a of 380 ± 140 pM (Table I). A second class 
of IL-l-R that is expressed at a low level (340 ± 120/cell) bound IL-1 with a 
much higher affinity {Kd = 4.6 ± 2.5 pM; Table I and Fig. bB, inset). IL-l-R 
expression by EL4-6.1 cells was stable over a 3-mo culture period (Fig. 5). 

The level of high-affinity IL-l-R expressed by EL4-6.1 cells was compared 
with that of other T cell lines, as well as with normal peripheral T celb (Fig. 6 
and Table I). Other, independently derived EL4 cell lines varied in their 
expression of IL-l-R, which correlated well with their ability to respond to IL-1 
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Figure 6. Quantitation of high-aflinity IL-l-R expressed by different cell types. EL4-6.] 
(O), LBRM (#) and LNT cultured for 2 d in the presence of PMA and lonophore, (A) were 
incubated in the presence of various concentrations of '**I-1L-1 {0.2-10 pM) for 4 h at 4*C. 
After subtraction of nonspecific binding, the number of binding sites per cell and the 
dissociation constants were quantitated by-Scatchard plot analysis. 



and absorb IL-I activity. Cells of the EL4-3 line did not express IL-l-R (level of 
detectability is two receptors per cell), whereas EL4-RP.1 (subclone of EL4-RP) 
and EL4-16 (parental line of EL4-6.1) expressed intermediate levels of IL-l-R. 
Two other thymomas, LBRM and Yac, expressed lower numbers of IL-l-R (30- 
50 high-aflinity IL-l-R per cell). Cells of the IL-2-dependent cytolytic T cell line 
CTLL, and two other thymomas, ST4 and BW5147, were negative for IL-l-R 
expression as was already suggested by their inability to absorb lL-1 activity (Fig. 
2). In addition, activated normal T cells expressed a total of ^600 IL-l-R per 
cell, of which about 20 were of the high-affinity type. Interestingly, normal 
resting T cells also expressed detectable numbers of IL-l-R ('^10 high-affinity 
and 1 80 low-affinity receptors per cell). 

Internalization of ^^^I-IL-I. We used the ability to internalize IL-1 as an 
independent criterion for the expression of functional IL-l-R. EL4-6.1 cells were 
incubated at 4*'C in the presence of '*'I-IL-1 for 4 h to achieve equilibrium 
binding of IL-1 to both high- and low-affinity IL-l-R, Treatment with pH 3- 
buffered medium resulted in the rapid dissociation (95-98% within 1 min) of 
surface-bound ILl. The remaining radioactivity was taken as background. When 
the cells were transferred to ST^C there was a time- and temi>erature-dependent 
increase in the propK>rtion of cell-bound radioactivity that was resistant to acid 
treatment. This represents IL-1 that had been internalized. Internalization is a 
sjjecific process, which requires the expression of IL-l-R. Fig. 7 shows that EL4- 
6.1 cells (IL2-R") had the capacity to internalize ^^^I-IL-1, but failed to inter- 
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Figure 7. Capacity to internalize IL-l correlates with the expression of specific IL-l-R. 
EL4-6.1 (#) and CTLL (O) cells were incubated at 4'C in the presence of 30 pM of '**I-IL-1 
{dashed lines) or '**I-IL-2 {solid lines). After 4 h the cells were transferred to 37 and 
internalized radioactivity was measured at different times, as described in Materials and 
Methods. Internalized IL-l and IL-2 is expressed as a percentage of the total cell-associated 
ligand. The numbers in brackets refer to the total cell-bound cpm at time 0. 



nalize In contrast, CTLL cells (IL-2-RVIL-1-R") rapidly internalized 

*2n-IL-2, but could not internalize ^^^I-IL-L 

The data shown in Fig. 8 support the hypothesis that IL-l internalization 
occurs via high-afTtnity but not low-affinity IL-I-R. EL4-6.1 cells were incubated 
at 4** C in the presence of 100 pM *^*I-IL-1 for 4 h. According to the Scatchard 
data shown in Fig. 5, this allowed occuj>ancy of virtually all high-affmity receptors 
('^250 per cell) and ~2,000 low-affinity receptors. The cells were then washed 
to remove all of the unbound '^^I-IL-1. There was no change in the level of cell- 
bound '**I-IL-1 when the cells were subsequently transferred to 37**C and 
incubated for an additional 4 h (the level of cell-bound '^*I-IL-I stayed constant 
because there was no rebinding of IL-l, nor was there any appreciable dissocia- 
tion from either class of IL-l-R). Under these conditions, there was a rapid 
internalization of ^^^I-iL-I, which reached a maximum level within 20 min (Fig. 
8). This represented the internalization of —200 IL-l molecules per cell, close 
to the value of initially occupied high-affinity IL-l-R. There was no additional 
internalization of lL-1 after this time, despite the fact that '^2,000 IL-l-R per 
cell were still occupied. If, on the other hand, the cells were not washed before 
their transfer to 37**C, the level of internalized IL-l continued to increase with 
time (as does the total cell-associated IL-l). This may reflect the rebinding of 
free ***I-IL-1 to recycled or newly expressed high-affinity IL-l-R. After 60 min 
at 37 °C, the level of internal IL-l in the unwashed cells was three times that of 
the washed cells. Assuming that only the 200—300 high-afTlnity IL-l-R expressed 
by these cells can be internalized, and that these receptors return to the cell 
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Figure 8. Evidence that internalization of '"l-IL-1 occurs via high-affinity receptors. EL4- 
6.1 cells were incubated at 4''C in the presence of 100 pM '"l-IL-1. After 4 h, half of the 
cells were washed three times to remove unbound '**I-IL-1 {solid lines)^ whereas the remainder 
were not washed (dashed lines). The cells were then incubated at 37 °C. Toul cell-associated 
(O) and internalized "'l-IL-1 (•) was measured at different times using duplicate aliquots of 
1 0^ cells, as described in Materiak and Methods. 

surface, the data would be consistent with a receptor recycling time of 1 0-20 
min. 

Finally, Fig. 9 shows that the capacity of cells to internalize IL-1 correlates 
with the number of high affinity IL-l-R expressed per cell. IL-l-R^ cells such as 
EL4.3, CTLL, and BW5147 could not rapidly internalize ***I-IL-1 . LBRM cells, 
which express —50 high-afEnity IL-l-R per cell internalized fewer molecules 
than did EL4-6.1 cells, which express —200-400 such receptors per cell. 

Discussion 

We have previously described the EL4-6.1 thymoma line as a high IL-1- 
responder and have shown that these cells are induced in the presence of rIL-1 
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Figure 9. Internalization of IL-1 by various T cell lines. Cells were incubated at 37 ®C in 
the presence of 20 pM '**I-IL-1 and internalized '"l-IL-1 was measured at various times. Sec 
Table I for the number of hi^-affinity IL-l-R per cell. 

(of or P form) to both secrete IL-2 and express IL-2-R, provided that a second 
signal is given in the form of lonomycin or suboptimal concentration of PMA 
(13). The recent availability of human rIL-1 (23) and a procedure for its 
radiolabeling to high sfjecific activity has allowed us to directly quantitate the 
number and afHnity of IL-l-R expressed by various T cells. We show here that 
the IL-1 -responsiveness of EL4-6.1 cells correlates with the expression of an 
unusually high number of lL-1 binding sites (--20,000 per cell), which could be 
resolved into high- and low-affinity classes. Despite the large variation in the 
overall level of IL-l-R expression between different T cell types, the ratio of the 
number of high- to low-affinity IL-l-R was consistent. The results presented 
here further suggest that the biological activity of IL-1 is mediated exclusively 
via interaction with the minor class of high-aflinity IL-l-R. 

Radioiodination of rIL-l-a had no detectable effect on biological activity, as 
measured by its ability to induce IL-2 secretion and IL-2-R expression by EL4- 
6.1 cells. Dower et al. (12) radiolabeled biochemically purified human IL-1-/3 
using the Bolten-Hunter reagent. The IL-l-i3, incubated for 1 h with the reagent, 
was labeled to a high specific activity (2-5 x 10^* cpm/mmol) but retained only 
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5% of its biological activity. We radioiodinated human rIL-l-a by incubation in 
the presence of Na'^^I and chloramine T for 10 min, thereby achieving a high 
level of specific activity (1-2 X 10^^ cpm/mmol) while retaining full biological 
activity. *^*I-IL-l-a completed in an equimolar fashion with unlabeled IL-l-a for 
binding to EL4-6. 1 cells. Taken together with the functional studies, these data 
argue that '^^1-lL-l can be used as a valid estimate of the behavior of the natural 
form of the molecule. Furthermore, in agreement with the recent findings of 
Dower et al. (24), unlabeled rlL-l-a and rIL-1-/? both competed (albeit with 
slightly different afTinity) for the binding of labeled rIL-l-a suggesting that the 
two forms of rIL-1 bind to the same receptor. 

Scatchard plot analysis of ^^*I-IL-1 equilibrium binding revealed the presence 
of two classes of IL-l-R on a number of different T cell populations. The major 
class, making up 95-99% of the total IL-l-R, has a ATj of 300-500 pM. The 
minor class of receptors has an 100-fold higher affinity for IL-1 {Kd of 2-8 
pM). The level of expression of IL-l-R among different T cell lines was variable. 
Certain cell lines such as ST4, BW1547, CTLL, and EL4-3 were negative, 
whereas a variety of indep>endently derived EL4 lines were highly positive. Other 
thymoma cell lines such as Yac and LBRM expressed low, but detectable numbers 
of IL-l-R, which is consistent with an earlier report (1 2) in which the number of 
IL-l-R expressed by a variety of T cell types ranged from 27 to 550 per cell. In 
the latter study, the binding affinity and receptor number they reported for the 
LBRM cell line were similar to what we find here for the low-affinity class of IL- 
l-R. The reason why these authors failed to detect the high-affinity class of IL- 
l-R on LBRM cells is probably technical, i.e., they may not have been able to 
detect binding at sufficiently low IL-1 concentration (<I0 pM) because of the 
lower specific activity and/or biological activity of their '^^I-IL-1-/S preparations 
as compared with the '^"^I-IL-lor preparations used in the present study. 

All IL-l-R"*" T cell populations tested expressed both high- and low-affinity 
classes of IL-l-R, and the affinity of IL-I binding to either class was comparable. 
The existence of high- and low-affinity receptors has been reported in a number 
of peptide hormone systems, including nerve growth factor (29), epidermal 
growth factor (30), insulin (31), platelet-derived growth factor (32), and IL-2 
(27, 33). In all cases, the proportion of high-affinity receptors is low (5-15% of 
the total) and the difference in affinity l>etween the two classes for ligand binding 
is '-'50-100-fold. The biological significance of this receptor dichotomy is not 
known. 

Interestingly, normal resting T cells expressed very low numbers of high- 
affinity IL-l-R (-^10 j>er cell). This value is only an estimate of the average 
number of receptors p>er cell based on the population as a whole, and it is 
conceivable thai only a subpK>pulation of peripheral T cells expresses IL-l-R. In 
this context, recent results from experiments using highly purified f>opulations 
of lymph node T cells indicate that the L3T4''" subset is positive for IL-l-R 
expression, whereas the Lyt-2'*" subset is not.^ Expression of IL-l-R by resting T 
cells is probably constitutive, since activation by PMA and lonomycin (which 
induces growth of both LST4'*^ and Lyt-2'*" subsets [34]) did not significantly 



' Lowenthal, J. W., and H- R. MacDonald. Expression of tnterleukin-l receptors is restricted to 
the L3T4^ subset of mature T lymphocytes. Manuscript submitted for publication. 
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enhance IL-l-R expression after 2 d of culture, suggesting that there was no 
positive or negative selection for IL-l-R-expressing cells. 

In a number of other hormone systems, the ligand-receptor complex has been 
shown to be internalized upon ligand binding (35, 36), and this internalization is 
thought to be a necessary signal for the generation of biological responsiveness 
(37, reviewed in 38). For IL-2, internalization has been shown to occur via the 
high-aflinity class of IL-2'R, whereas low-afTmity IL-2-R are nonfunctional in 
this respect (39; 40; M. Nabholz, personal communication). We now provide 
evidence that a similar functional dissociation exists between high- and low- 
affinity lL-1 -R. Internalization of IL- 1 is a specific process, requiring the presence 
of cell surface IL-l-R. The proportion of total cell-bound IL-1 that can be 
endocytosed by EL4-6. 1 cells corresponds to the proportion of IL-1 that is bound 
to the high-affinity class of IL-l-R. Furthermore, when EL4-6.1 cells were 
incubated at 4^C in the presence of either 10 pM '^^I-IL-1, a concentration that 
permits occupancy of high- but not low-affinity IL-l-R, or 1 nM, a concentration 
that allows occupancy of all IL-l-R, and then washed and transferred to 37 °C, 
the rate and extent of IL-1 internalization was the same (data not shown). Finally, 
the capacity of different cell lines to internalize IL-1 correlated closely with the 
level of high-aflinity IL-l-R expression. As in other hormone systems, the 
functional significance of ligand internalization and degradation is not known. 
One possibility is that this process allows dissociation of the ligand from its 
receptor and permits the receptor to recycle back to the cell surface and to bind 
more ligand. So far, no function has been attributed to low-affinity hormone 
receptors, but it is conceivable that they represent an external pool of nonfunc- 
tional receptors that can be rapidly recruited into the functional high-affinity 
pool. The transition between the two classes of receptors may involve a confor- 
mational change, for example by association with a second accessory molecule. 
Irrespective of its functional significance, it would appear that the ability to 
internalize IL-1 can be used as a sensitive, independent measure for the expres- 
sion of high-aflinity IL-l-R. 

In contrast to EL4-6.1 cells, other T cell populations (including normal 
activated T cells) express low levels of IL-l-R. Even though they express only 
10-60 high-affinity IL-l-R per cell they can still internalize and respond to IL- 
1. Similar findings of low numbers (15-300 per cell) of high affinity hormone 
receptors have been reported for granulocyte and macrophage growth factors 
(41 and F. Walker and A. W. Burgess, p>ersona] communication). It is significant 
that the concentration of IL-1 corresponding to half-maximal occupancy of the 
high-affinity class of IL-l-R (i.e., the Kd value) is identical to the concentration 
of IL-1 that gives half-maximal biological activity (2-8 pM). This is similar to the 
situation found for IL-2, where the /^d value of the high-affinity class of IL-2-R 
corresponds to the IL-2 concentration giving half-maximal biological activity 
(10-30 pM) (26, 27, 33, 37). Taken together, the data presented here suggest 
that the biological activity of IL-1 can be attributed exclusively to its interaction 
with high-affinity IL-l-R. 

Because of the extremely high number of IL-l-R expressed by EL4-6.1, they 
constitute a valuable model system with which the structure, function, and 
regulation of IL-l-R can be studied. Furthermore, this cell line should facilitate 



1072 BINDING AND INTERNALIZATION OF INTERLEUKIN 1 BY T CELLS 

the isolation of the IL-l-R in purified form and cloning of the gene(s) that 
encode it. 

Summary 

In this report we describe, on the basis of direct IL-I binding assays and IL*1 
internalization studies, the existence of two classes of IL-l-R on a variety of T 
cell types. Cells of the EL4-6. 1 thymoma express large numbers (—20,000 per 
cell) of IL-l-R that have a Ka of '^300 pM for IL-1. Even though these receptors 
make up 98-99% of the total IL-l-R per cell, they appear to be nonfunctional, 
based on their inability to endocytose IL-I. A minor class of IL-l-R (200- 
400/cell) has an --lOO-fold higher afTmity for IL-1 (K^, -5 pM) and can rapidly 
internalize the ligand upon binding. All of the biological activity of IL-1 can be 
shown to occur via binding to high-affinity IL-l-R since the IL-1 concentration 
giving half-maximum biological activity in EL4-6.1 cells corresponds precisely to 
the Kd of this class of receptor. Other cell types, including normal T cells, also 
express both high- and low-affinity IL-l-R, but the absolute number of receptors 
per cell is considerably less. 

We thank Drs. C. S. Henney, S. Habu, and O. Kanagawa for providing cell lines. We also 
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ABSTRACT Several laboratories have recently demonstrated 
diat the requirement for macro^iages in mitogen^induced pro- 
duction of murine T-cell interleiudn 2 (IL-2; formerlv referred to 
as "T-ceD growth factor**) could be circumvented oy using the 
macrophage-derived peptide interleuldn 1 (ILrl; formerly re- 
ferred to as "lymphocyte-activating factor**). Using two doned T- 
oeO lymphomaj, we investigated the mechanism dirougfa which IL- 
1 eierted its effect on DL-S production. One of the edi lines used 
. (LBRM-33 5A4) produces large concentrations of iL-l upon mi- 
togen stimulation, whereas the second (LBRM-33 1A5) is incapable 
of producing IL-2 in response to mitogen. It was observed that 
addition of purified IL-1 to nonproducer 1A5 cells converted them 
to a state in which subsequent mitogen stimulation triggered pro- 
duction 1^ IL-2. Tlie concentration of ILr2 produced by IL-1 
treated IA5 ceDs was equivalent in magnitude to that generated 
by mitogen-stimulated 5A4 cells (500-1000 um'ts/ml, or approxi- 
mately 1000 times the concentration of IL-2 contained in conven- 
tional preparations <^ murine mitogen-conditioned medium). The 
observations that (t) brief exposure to IL-1 was sufficient for 1A5 
ceD conversion to EL-2 production and (m) IL-l could actively be 
absorbed from culture medium by live or fixed 1A5 ceDs led us 
to pnqiose the existence of IL-I receptors on responsive 1A5 cells. 
On die basis of these experiments, we have postulated that IL-I 
mediates its effect on immune reactiviH" (enhancement of thy- 
mocyte autogenesis and induction of antibody and cytotoxic T cell 
responses) by maturation of a subset of immature T cells to the 
point where diey are craable of IL-2 production. Subsequent re- 
lease of IL-2 after ligand activation allows for donal expansion of 
activated T ceDs whidi mediate particular effector functions. 



Recent experiments conducted in several laboratories have con- 
firmed the pivotal roles that interleukin 1 (IL-1; formerly re- 
ferred to as "lymphocyte-activating factor" or LAF) and inter- 
leuldn 2 (IL-2; formerly referred to as *T-cell growth &ctor" or 
TCCF) play in the generation of T and B cell immune reactivi- 
ties (1-8). Both proteins have been shovi^ (t) to enhance thy- 
mocyte mitogenesis, (tt) to support the induction of alloantigen- 
primed cytotoxic T-cell reactivity, and (ttt) to aid in the genera- 
tion of helper T cells for antibody responses after stimulation 
with heterologous erythrocytes (1, 2, 4-7). In marked contrast, 
IL-2 is the sole interleuldn capable of sustaining the in vitro ex- 
ponential proliferation of effector T-cell lines (1. 2, 8, 9). Simi- 
larly, only IL-2 has been shown to be capable of allowing for in 
vitro and in vivo generation of cytotoxic T cells from nude mouse 
spleens (1. 2. 10,11). 

Due' to the well-documented macrophage requirement for 
mitogen-stimulated IL-2 production (12), and the observation 
that IL-1 is a macrophage product (13), it has been hypothesized 
that IL-1 may be an essential signal required by IL-2 producer 
T cells. Such a hypothesis was first suggested by the observa- 
tions of Smith et al, (12) who found that IL-1 producer tumor cell 
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line supemates could restore adherent cell-depleted, mitogen- 
stimulated, murine spleen cell cultures to normal levels of IL-2 
production. IL-1 involvement in IL-2 production was also sug- 
gested by the studies of Farrar et al, (14) who showed that the 
tumor promotor phorbol myristate acetate [PMA; previously 
shown to act as a replacement for IL-1 in several immune re- 
sponse assays (15, 16)] could substitute for the macrophage re- 
quirement for 11^2 production by murine T cells. Finally, ad- 
dition of IL-1 to mitQgen-activated, adherent, cell-depleted, 
spleen cell cultures has been shown to reconstitute IL-2 pro- 
duction to normal levels (17, 18). Although the involvement of 
IL-1 in the production of IL-2 by ligand-stimulated T cells can- 
not be questioned, a precise mechanism by ^A^iic^ IL-1 func- 
tions in this capacity has not been presented. For example, it is 
not known if IL-1 and mitogen affect the same cell or different 
ceils that then must interact to initiate IL-2 production. 

Experiments in our laboratories have recendy documented 
the existence of an extremely potent IL-2 producer lymphoma 
cell, LBRM-33 (19). Mitogen-stimulated LBRM-33 cells pro- 
duce 1000-10,000 times the amount of IL-2 generated by con- 
ventional cultures of mitogen-stimulated mouse spleen celk. 
Umiting-dilution cloning LBRM-33 resulted in the isolation 
of both extremely high titer IL-2 producer clones and one non- 
producer lymphoma ceU line variant (19). On the basis of these 
observations,' we initiated studies to evaluate the utility of IL-2 
producer and nonproducer LBRM-33 clones in the dissection of 
the IL-1 requirement for IL-2 production. 

In this communication we report that IL-1 has the capacity to 
convert IL-2 nonproducer tumor clones to high-titer IL-2 pro- 
duction. Conversion to IL-2 production occurs after brief expo- 
sure to IL-1 at concentrations that do not produce detectable 
thymocyte mitogenesis (13), thereby leading to the develop- 
ment of a more sensitive bioassay for IL-1 activity. Further- 
more, the observation that LBRM-33 cells could absorb IL-1 
from cultures suggests that these cells possess sur&oe receptors 
for IL-1 and identifies a valuable cell population for further 
study of the molecular aspects of IL-l-induced immune 
reactivity. 

MATERLU.S AND METHODS 

CeU Lines. LBRM-33, a radiation-induced splenic lym- 
phoma from the BIO. BR mouse (originally isolated by G. Cud- 
cowicz, Rosweil Park Memorial Institute, Buffalo, NY), has 
been shown (19) to produce high-titer murine IL-2 upon 24-hr 
stimulation with T-cell mitogens. Limiting-dilution cloning of 
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LBRM-33 resulted in identification of both extremely high titer 
IL-2 producer (clone 5A4, 1000 units of IL-2 per ml) and non- 
producer cell line (LBRM-33 1A5) clones. LBRM cells were 
further found to be positive for cytosol terminal deoxynucleo- 
tidyl transferase and expressed TL, Thy 1, Lyt 1, Lyt 2, and Lyt 
3 markers at the cell surface. 5A4 and 1A5 cells used in these 
studies were maintained in vitro in RPMI-1640 medium sup- 
plemented with 5% heat-inactivated (56^C, 30 min) fetal calf 
serum. 50 /iM 2-mercaptoethanoI, penicillin (50 units/ml), 
steptomycin (50 fig/m\), and fresh L-glutamine (300 /ig/ml). 
IL-2 production cultures were conducted in this medium in 
either 200 ^ (no. 3596, flat-bottom microplate, Costar, Cam- 
bridge, MA) or 5 ml (no. 3013 tissue culture flasks, Falcon Plas- 
tics, Oxnard, C A). IL-2 production was initiated by addition of 
phytohemagglutinin M (PHA; 0.1-1% by volume, GIBCO) to 
LBRM cell hne cultures (10^ cells per ml). In several experi- 
ments, 1A5 and 5A4 cells were also stimulated with either PM A 
(10 ng/ml, Sigma) or IL-1 (for concentrations see Results) pre- 
pared as detailed below. Supematants harvested from 24-hr 
cultures were tested for IL-2 activity as detailed below. Main- 
tenance of LBRM-33 cell lines and IL-2 production experi- 
ments were at ZT^C in a humidified atmosphere of 5% CO2 in 
air. 

IL-2 Assay. The 24-hr culture supematants from stimulated 
LBRM-33 5A4 and 1A5 cells (either in the presence or in the 
absence of IL-1 at various concentrations) were tested for IL-2 
activity by using a standard microassay (20) based on the IL-2- 
dependent exponential proliferation of a murine cytotoxic T-cell 
line (CTLL) (8). Briefly, 3000 CTLL cells were cultured in rep- 
licate 20Q-/a1 volumes in flat-bottomed microplate wells in the 
presence of a log2 dilution series of putative IL-2-containing 
samples. Af^er 24 hr, the cells were exposed to 0.5 $iCi (1 Ci = 
3.7 X 10^^ becquerels) [^H]dThd (20 mCi/mmol; New England 
Nuclear) for an additional 4 hr af^er which the cultures were har- 
vested onto glass fiber filter strips with the aid of a multiple au- 
tomated sample harvester (MASH 11, Microbiological Associ- 
ates, Bethesda, MD). [^HjdThd incorporation was then 
determined by hquid scintillation counting. Only CTLL cells 
cultured in the presence of IL-2 incorporated [%]dThd in a 
dose-dependent manner. Consistent with the observation that 
CTLL cells cultured in the absence of IL-2 are >95% trypan 
blue-positive, cultures lacking IL-2 incorporated <100 cpm of 
pH]dnid. IL-2 activity was quantified by probit analysis of 
[^H]dThd incorporation data as described (20). A standard prep- 
aration of IL-2-conditioned medium [1 unit/ml, 48-hr super- 
nate of concanavolin A-stimulated (5 /xg/ ml) rat spleen cells (10^ 
cells per ml)l routinely generated 10,()00-15,000 cpm of 
[^H]dThd incorporation at a dilution of 1:2. Similarly, a super- 
natant containing 1000 units of IL-2 activity per ml stimulated 
identical levels of [^H]dThd incorporation at a dilution of 
1:2000. CTLL cells were routinely maintained in vUro in expo- 
nential proliferation in RPMI- 1640 medium supplemented 
with 50% rat spleen cell-conditioned medium (produced as de- 
tailed above) and 2% fetal calf serum (3T*C, 5% COj in air). 

Preparation of IL-1 and Assay for Activity. The IL-1 used 
for induction of LBRM-33 IL-2 production was prepared from 
the culture supernatant of PMA-stimulated P388D| macro- 
phage tumor cells as detailed (13). IL-1 was partially purified 
by a sequence of differential ammonium sulfate precipitations, 
DEAE-cellulose ion exchange chromatography, and Sephacryl 
S200 gel exclusion chromatography (21). ^er this, all of the 
biologically active IL-1 was associated with a protein of M, 
«w 15,000 (21). The IL-1 used in absorption experiments was fur- 
ther purified by phenyl-Sepharose hydrophobic affinity chro- 
matography (22). After Sephacryl S200 gel filtration, the spe- 
cific activity of the partially purified IL-1 was approximately 



10,000 units/mg of protein. Phenyl-Sepharose chromatography 
increased the specific activity ^SO fold. 

In some experiments, IL-1 was inactivated by the arginine- 
modifying agent phenylglyoxal (22). IL-1 was dialyzed into 200 
mM imidazole buffer (pH 8) and incubated in 1% phenylglyoxal 
for 6 hr at 25*C. After this treatment, IL-1 was passed over a 
column of Sephadex G-25 (equilibrated in phosphate-buffered 
saline, pH 7.2) to remove die phenylglyoxal. Control IL-1 was 
mixed with 1% phenylglyoxal immediately prior to gel filtration 
chromatography. Phenylglyoxal treatment had no effect on 
mobility of IL-1 on Sephadex G-25 chromatogr^hy columns. 

IL-1 activity was determined by its capacity to enhance thy- 
mocyte proliferation in response to in vitro stimulation -with 
PHA at 1 f^ml as detailed (13). Briefly, thymocytes from C3H/ 
HeJ mice (4-6 weeks old, Jackson Laboratory, Bar Harbor, ME) 
were cultured in 200 fx\ in flat-bottom microplate welb (1.5 
X iifi cells per well) in the presence of a logs dilution series of 
putative IL-l-containing samples. Under these conditions, a 
standard preparation of IL-1 (100 units/ml) induced approxi- 
mately 10,000-15,000 cpm of thymocyte [^H]dThd incorpo- 
ration at a dilution of 1:4. Half- maximal thymocyte proliferation 
was routinely observed af^er culture stimidation with 5-10 units 
of IL-1 per ml. Units of IL-1 activity were quantified by probit 
analysis as detailed elsewhere (20). 

IL-I Absorption. To test the capacity of LBRM-33 cell line 
derivatives to absorb IL-1, both 5A4 and 1A5 cells (10^ cells) 
were washed and resuspended in RPMI-1640/2% fetal calf 
serum containing either 100 or 0.5 unit of IL-l per ml. Aher a 4- 
hr incubation at 47 or 3T'C, the cells were pelleted by a lO-min 
centrifogation at 300 X g and the supernatant was tested for re- 
sidual IL-1 activity by its capacity to enhance thymocyte mi- 
togenesis or to induce LBRM-33 1A5 IL-2 production. In some 
experiments, LBRM-33 5A4 and 1A5 cells were fixed with 2% 
glutaraldehyde (15 min, 4**C) and washed five times with 50 ml 
€£ culture medium prior to use in absorption tests. 

RESULTS 

Tumor Cell Line ILi-2 Production. The results presented in 
Table 1 review the relative capacities of LBRM-33 5A4 and 1A5 
clones to produce IL-2 af^er incubation with 1% PHA. As pre- 
viously observed (19), 5A4 cells cultured for 24 hr (10" cells per 
ml) in the presence of 1% PHA produced approximately 1000 
units of IL-2 activity per ml whereas LBRM-33 1A5 cells pro- 
duced no detectable IL-2. It should be stressed that the amount 
of IL-2 generated by mitogen-stimulated 5A4 cultures was be- 

Table 1. IL-2 production by LBRM-33 5A4 and 1A5 cellUnes 



Stimulated with* 



11^2 present in 









PMA 


supernate, 


1%PHA 
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* +, Present in culture; — , absent from culture. 

^ 10" cells per ml in RPMI-1640/2% fetal calf serum. 
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. tween 1000 and 10,000 times the amount of ILr2 produced by 
identical concentrations of normal murine spleen cells. We rou- 
tinely found 1% PHA to be an optimal mitogen dose for eliciting 
production of IL-2 by the LBRM-33 5A4 cell line. Reduction of 
this mitogen concentration by a fector of 10 (0. 1% PHA) signifi- 
cantly diminished the amount of IL-2 produced by clone 5A4; 
stimulation of nonproducer 1A5 cells with 0.1% PHA did not 
lead to IL-2 production. 

It was interesting that incubation of 5A4 cells with 0. 1% PHA 
and PMA at 10 ng/ml resulted in maximal IL-2 production. Of 
potential importance was the observation that PMA in the pres- 
ence of PHA stimulated comparatively weak but significant, IL- 
2 production by 1A5 cells. Therefore, it was conceivable that 
1A5 cells possessed the capacity to secrete IL-2 but apparently 
required additional signals to activate the IL-2 producrtion pro- 
cess. Several laboratories have demonstrated an adherent cell 
requirement for murine IL-2 production (12, 14, 17, 18). In the 
absence of macrophages, concanavalin A-stimulated, purified 
T-cell populations could be restored to normal levels of IL-2 
production by addition of either macrophage tumor cell line su- 
pematants (12) or the monokine IL-1 (17, 18). Farrar et al, (14) 
found that addition of PMA to mitogen-stimulated, macro- 
phage-depleted, spleen cell cultures also reconstituted normal 
IL-2 production. These results coupled with those detailed in 
Table 1 led us to examine the effect of IL-1 on both the PHA- 
responsive LBRM-33 5A4 (IL-2 producer) and the PHA-unres- 
ponsive LBRM-33 1A5 (IL>2 nonproducer) cloned lymphoma 
cell lines. 

Effect of IL-1 on Production of ILr2 by Tumor CeO Line. 
Addition of P388Di macrophage-derived IL-1 to cultures of 
PHA-stimuIated LBRM-33 5A4 and 1A5 cells had profound ef- 
fects on IL-2 production. As with PMA, addition of IL-1 at 10 
units/ ml to suboptimally mitogen-stimulated 5A4 cells resulted 
in restoration of peak levels of IL-2 production (Table 2). Fur- 
thermore, IL-1 at 10 units/ml also induced optimal IL-2 pro- 
duction by PHA-stimulated 1A5 cultures. IL-1 induced high-ti- 
ter IL-2 production by 1A5 cells when the cells were stimulated 
with optima] (1%) or suboptimal (0. 1%) concentrations erf PHA. 
Addition of IL-1 to cultures of non-mitogen-stimulated 1A5 cells 
was not a sufficient stimulus for conversion of the 1A5 cell hne 
to IL-2 production. 

Perh^s the most striking aspect of the capacity of IL-1 to fos- 
ter 1A5 cell IL-2 production was the extremely low concentra- 
tions of IL-1 at which such a conversion was observed. 
Dose-response curves for both IL-l-induced murine thymo- 
cyte prohferation and IL-l-dependent conversion of 1% PHA- 
stimulated 1A5 cells to IL-2 production are shown in Fig. 1. 

Table2. Effect of IL-1 on IL-2 production by LBRM-33 5A4 

and 1A5 cell lines 





IL-2 in 24-hr 




supemate. 


CuttUTB* 


unitB/ml 


fiA4+ 1%PHA 
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5A4 + 0.1% PHA 


35 


5A4 + BLrl 


0 


fiA4 + IL-1 -fLOft PHA 
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5A4 + IL-1 -1- 0.1% PHA 


525 


1A5 + 1%PHA 


0 


1A5 + 0.1%PHA 


0 


1A6 + IL-1 


0 


U5 + IL-l -1- 1% PHA 


476 


1A6 + IL-1 + 0.1% PHA 


513 




iOO 50 25 . 12 6 3 1.5 0.75 0.4 0.2 0.1 
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PlO. 1. Dose-Hresponse of IL-1 activity as meaaured by induction 
of thymocyte mitogeneBia (•) and IL-2 production by LBRM-33 1A5 
cells U). 

Fifty percent of maximal IL-l-induced thymocyte proliferation 
was observed at an IL-1 concentration of ^proximately 5 units/ 
ml. However, IL-1 at 5 units/ml stimulated 1A5 cells to peak 
levels <^ IL-2 production. Hie capacity of IL-1 to convert 1A5 
cells to maximum production of IL-2 did not begin to decline 
until IL-1 concentrations as low as 0.5 units/ml were tested. 
In foct, PHA-stimulated 1A5 cells cultured in the presence of 
IL-1 at 0.2 unit/ml produced a significant amount of IL-2 (165 
units/ml). The observation that normal T-cell mitogen-stimu- 
lated murine spleen cells produce, under optimal conditions 
(10^ cells per ml; concanavalin A at 2. 5 /tg/ml) only 0.&-1.5units 
of IL-2 per ml further substantiates the capacity of low concen- 
trations of IL-1 to convert nonproducer 1A5 cells to hi^-titer 
IL-2 production. 

We are confident that the effects of IL-1 shown in Fig. 1 were 
due to the monokine itself as opposed to some unidentified con- 
taminant molecule present in the IL-1 preparation. The IL-1 
used in these studies was generated in a four-step purification 
procedure and was of relatively high specific activity. Addition- 
ally, treatment of purified IL-1 with 1% phenylglyoxal totally 
eliminated not only its capacity to augment thymocyte mitoge- 
nesis but also its ability to convert 1A5 ceUs to IL-2 production 
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*Ciiltare8 contained 10^ oella per ml in BPMI-1640/2% fetal calf 
eerum. When present, IL-1 was at 10 unit^/ml. 



Fio. 2. i%enylglyoxal-mediated inhibition of IL-l activity. IL-1 
(300 unitfl/ml) was exposed to 1% phenylglyoxal finr 6 hr (25't/) and 
then was separated from the phenylglyoxal by Sephadez G-25 gel ex- 
clusion chromatography. (A) Ffaenylg^yoxal (PG) treatment abolished 
all IL-1 activity as measured its capacity to augment th y m o c yt e 
mitogenesis. (B) Capacity of a log^ dilution series of both control (▲) 
and phenylglyoxal-treated (•) IL-1 to convert PHA-stimulated 1A5 
cells to IL-2 production. 
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(Fig. 2). Previous studies (22) have shown that phenylglyoxal 
modification of arginine residues in IL-l results in destruction 
of its biologic activity in a number of different biological systems 
(without altering its mobility on gel exclusion chromatography 
columns). As a site-specific modifier, it seems unlikely that 
phenylglyoxal treatment (and destruction of oonventionaiiy 
tested IL-1 activity) would have a similar effect on some other 
contaminant protein present in the IL-1 population which was 
responsible for converting 1A5 cells to IL-2 production. 

Absorption of IL-l Activity by LBRM-33 5A4 and 1A5 Cell 
Lines. It was clear that IL-1 itself was not a sufficient stimulus 
for IL-2 production in that 1A5 ceils exposed to IL-1 in the ab- 
sence of mitogen did not produce IL-2. In an attempt to disso- 
ciate requirements for IL-1 and mitogen sensitization, we tested 
whether conversion of 1A5 cells to a state in which they were 
capable of producing IL-2 required the continued presence of 
IL-1. Multiple cultures were prepared in vs^iich 1A5 cells were 
either treated continuously with both PHA and IL-1 (10 units/ 
ml) or sequentially with IL-1 and then PHA. 1A5 cells were first 
exposed to IL-1 fbr 4 hr at 3TC; after exhaustive washing, the 
cells were cultured in either the presence or absence of addi- 
tional IL-1 and PHA. Brief exposure of 1A5 cells to IL-1 did not 
affect their capacity to produce maximal levels of IL-2 upon sub- 
sequent exposure to PHA (Table 3). 

These results suggested that the interaction of IL-1 with 1A5 
cells was relatively rapid in nature. Based on previous studies 
(3) that demonstrated the capacity of activated T cells to absorb 
IL-2 from cultures, we questioned whether 1A5 or 5A4 cells 
possessed a similar cell sur&ce responsiveness (presumably me- 
diated by receptors) for IL-1. To test this hypothesis, laige num- 
bers of 1A5 and 5A4 cells were harvested from cultures, washed 
extensively, and resuspended in the presence of a known 
amount of IL-1. After a 4-hr incubation at 4^C, the cells were 
pelleted and the supematants were tested for residual IL-1 ac- 
tivity (assayed by enhancement of thymocyte mitogenesis as 
well as by conversion of IA5 cells to IL-2 production). Absorp- 
tions were conducted with two different IL-1 concentrations: 
200 units/ml in experiments in which activity was to be tested 
by conventional IL-1 assay (thymocyte proliferation); and 0.5 
unit/ml in experiments in which absorbed supematants were 
assayed by their effect on 1A5 cells. Different concentrations of 
IL-1 were used in the two assay systems because of the signifi- 
cant difference in IL-1 dose-responses noted in the two bioas- 
say systems ( Fig. 1). 

Both 1A5 and 5A4 cell lines were capable of absorbing IL-1 
activity (Fig. 3). As measured by the ability of the residual su- 
pernatant IL-1 to enhance thymocyte mitogenesis, 1A5 cells 
possessed a greater capacity on a per cell basis to absorb IL-L 
Perhaps this was predictable from the observation that IL-1 con- 
verted the PHA-unresponsive 1A5 cell line (an IL-2 nonprodu- 
cer) to IL-2 production whereas its effect on a similar cell line 
already capable of IL-2 production (5A4) was simply to enhance 
the effect of PHA. We are confident that the absorption shown 

Table 3. Converdon of 1A5 cells to IL-2 production does not 
require continual presence dTIL-l 
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no. 3. AbeorptionofIL-lbylA5and5A4oeU8.(A)arl(242iinitfl/ 
ml) was incubated for 4 hr at 4t^ either alone (control) or in the prea- 
ence of 10* 1A6 or 6A4 cells. Then the cells were removed and the su- 
pematants were tested for residual IL-1 activity by their ability to 
enhance thymocyte mitogenesis. (B) Capacity of these samples to in- 
duce IL-2 production by 1A5. Control IL-1 (0.5 unit/ml); AtA, IL-1 
(0.6 unit/nd) after abeorption with live or 2% glutaralddkyde-fized 
1A6 cells; ma IL-1 (0.6 units/ml) after absorption with live or 2% glo- 
taraldehyde-fixed 6A4 cells. 

in Fig. 3 was due to expression of a cell-surfece responsiveness 
to IL-1 rather than to an LBRM-33 cell-mediated degradation of 
ILrl activity because: (i) absorptions were conducted at 4X, a 
condition not supportive of hi^ levels of cellular metabolism, 
and (<l) identical absorption characteristics were found when 
trials were conducted with 2% glutaraldehyde-flxed 1A5 and 
5A4 cells (Fig. 3). 

DISCUSSION 

The experiments presented in this report took advantage of the 
existence of two cloned T-cell lymphomas of , identical lineage 
(19). One cell hne (LfiRM-33 5A4) produces large quantities of 
IL-2 (the interleuldn responsible for the sustained proliferation 
of activated T lymphocytes) whereas the second (LBRM-33 1A5) 
is incapable of IL-2 production after PHA stimulation. Because 
several previous investigations have suggested a requirement 
for the monokine IL-1 in the production of IL-2 by normal mu- 
rine splenic T cells (12, 14, 17, 18), we used these two lymphoma 
ceU lines in an attempt to dissect the mechanism(s) by which IL- 
1 participates in the initiation of IL-2 production. 

We found that IL-1 has the capacity to modify 1A5 cells in 
some manner to render them capable of producing IL-2 upon 
subsequent exposure to a T-cell mitogen (PHA). Such activation 
occurs after brief exposure to IL-1 at a concentration approxi- 
mately 1/lOth that shown to be effective in the conventional IL- 
1 assay based on enhancement of thymocyte mitogenesis. This 
observation is not totally surprising if one considers (on the basis 
of the experiments detailed in this report) what is truly being 
measured in a conventional IL-1 assay. It has been well docu- 
mented that IL-1 per se is incapable of triggering the sustained 
proliferation <^ activated T lymphocytes (6). This is a property 
unique to IL-2. Therefore, the enhanced mitogenesis observed 
when an entire thymocyte preparation is stimulated with PHA 
and IL-1 must be dependent upon production of the mitogenic 
&ctor IL-2. Because only the relatively small, cortisol-resistant 
population of the thymus (i.e.. the mature T-cell compartment) 
has been shown to produce IL-2 (12), stimulation of an entire 
thymocyte preparation with mitogen (in the absence of exoge- 
nous IL-1) results in poor IL-2 production and concomitant T- 
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ceU proliferation. However in the presence of IL- 1, presumably 
more T cells (perhaps immature thymocytes) are converted to a 
state in which they can produce IL-2 and, in so doing, markedly 
augment thymocyte mitogenesis. 

Based on such a mechanism of action for IL-1, it is not at all 
surprising that IL-1 converts 1A5 cells to ILr>2 production at a 
far lower concentration than has been shown to be necessary for 
thymocyte proliferation. Conversion to ILr-2 production as mea- 
sured on 1A5 cells focuses the effects of IL-1 on a homogeneous 
cloned cell population whereas the percentage of ILrl-respon- 
sive T cells present in an entire thymus preparation is more than 
likely diminished by the presence of several if not hundreds of 
distinct T<cell subpopulations at varying levels of IL-1 respon- 
siveness. It is unknown at present which cells in the thymus re- 
spond to IL-1 and if those cells (as is the case with 1 A5) respond 
by maturing to a state of potential IL-2 production. However* 
by using sensitive assay systems which can now be developed 
due to IL-1 effects on 1A5 cells, it may be possible to isolate and 
to identify this naturally occurring, IL-l-responsive, thymocyte 
subset. 

To this end, the observation that 1A5 cells and, to a lesser ex- 
tent, 5A4 cells possess cell surfece receptors to IL-1 is extremely 
important. 1A5 cells were shown to be converted to a state of 
lectin-inducible IL-2 production af^er only brief exposure to IL- 
1. These same 1A5 cells were also shown to be c^>able of ab- 
sorbing IL-1. In fact, both live and fixed 1A5 cells exhibited 
identical absorption. of IL-1 activity in 4 hr incubations at 4^C. 
These results argue in favor of the existence of a 1 A5 cell sur&ce 
responsiveness to IL-1 (presumably via the cell surface recep- 
tors). Further evidence in support of the existence of 1A5 cell 
sur&ce IL-1 receptors comes firom recent experiments con- 
ducted in our laboratories. Preliminary evidence indicates that 
biochemically identifiable IL-1 can be eluted from fixed 1A5 
cells by heating the cells to 56^C or by washing the 1 A5 cell pel- 
let with 0. 1% acetic acid. When the material extracted by heat 
or add was lyophilized and electrophoresed on two-dimensional 
Tris-glycinate polyacrylamide/gels, silver-stainable protein 
bands were detected that were identical in charge and molecu- 
lar weight to purified IL-1 (unpublished observations). 

It should be stressed that previous attempts to use whole thy- 
mocyte populations to absorb IL-1 activity consistently have 
been unsuccessful. Such results might be due to the relatively 
small percentage of IL-1 responsive cells in the thymus that are 
converted to a state of potential IL-2 production. It is hoped 
that, as radiolabeled, purified IL-1 becomes available, 1A5 cells 
will prove to be a valuable reagent in standardizing assays for IL- 
1 sur£u:e binding and perhaps quantifying numbers of IL-1 cell 
sur&ce receptors. With such assay systems at hand it would be 
possible to probe various T cell populations (selected on the ba- 
sis of either cell sur^u^ or functional phenotype) systematically 
for responsiveness to IL-1 and conversion to IL-2 production. 

In any event, it is clear that 1 A5 cells will be extremely useful 
in probing the molecular effects of IL-1 responses. The availa- 
bility of a cloned cell line that responds after brief exposure to 
purified monokine allows study of the events that program IL-2 
production at the level of gene activation, RNA transcription. 
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and protein synthesis. Additionally, 1A5 cells will be extremely 
useful in determining the consequences ILrl responses in 
terms of alteration of cell sur&ce phenotype and function. In 
this regard, it is interesting to note that 1A5 cells contain high 
levels of cytosol terminal deoxynucleotidyltransferase (unpub- 
lished observations). In that this enzyme is a biochemical 
marker typical of immature T cells, such a 1A5 cell phenotype is 
(t) suggestive that naturally occurring IL-l-responsive ceUs 
might be found in the immature compartment of the thymus and 
(tf) predictive that ILrl should have little effect on mature T cells 
lacking this transferase (a contention that is well supported by 
present studies detailing IL-1 action). Finally, by converting 
1A5 cells to a state of potential IL-2 production, it appears that 
the monokine achieves its effect on immune responses via a dif- 
ferentiation/maturation-inducing pathway as opposed to trig- 
gering cellular proliferation (an activity previously shown to be 
associated with IL-2). 

S.G. is a Special Fellow of the Leukemia Society of America and is 
supported in part by National Cancer Institute Grant CA28419 and 
Grant 1-724 horn the National Foundation-March of Dimes. 
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PURIFICATION TO HOMOGENEITY OF B CELL 

STIMULATING FACTOR 
A Molecule That Stimulates Proliferation of Multiple 
Lymphokine-dependent Cell Lines 

By KENNETH GRABSTEIN. JUNE EISENMAN. DIANE MOCHIZUKI, 
KURT SHANEBECK. PAUL CONLON. THOMAS HOPP, 
CARL MARCH, and STEVEN GILLIS 

From the Immunex Corporation, Seattle, Washington 98 10 1 

The differentiation of resting B lymphocytes to Ig secretion involves several 
sequential steps regulated by antigen, T lymphocytes, and macrophages. The 
requirement for T cells may in some cases be replaced by lymphoktnes secreted 
ufK>n T cell activation (1, 2). A number of distinct T cell-derived lymphokines 
have been described (3-9) that directly regulate the growth and maturation of 
B cells. These include lL-2 (3, 4), IFN-7 (5, 6), and at least two additional, yet 
less well-characterized molecules, B cell stimulating factor, (also known as B cell 
growth factor, BCGF or BSF-1)' (7, 8), and B cell differentiation factor (BCGF- 
n or BCDF) (9). BSF-1 was originally (7) described as a factor that stimulated 
proliferation of B cells in conjunction with a submitogenic concentration of anti- 
Ig. Recent studies (10, II) have shown that semipurified BSF-1 acts on resting 
B cells, facilitating their entry into S phase upon subsequent interaction with 
anti-Ig. These studies (11) indicate that BSF-1 might actually be a differentiation 
factor, and they further suggest that BSF-1 may not have growth factor activity. 

We have purified BSF-1 to homogeneity from culture supematants of mitogen- 
activated EL4 thymoma cells. We used the proliferation of highly purified splenic 
B cells in the presence of anti-IgM as an assay in our purification procedure. The 
purification was also monitored for additional lymphokines using the factor- 
dependent cell lines CTLL-2 and FDC-P2. Interestingly, fractions containing 
BSF-1 always contained stimulatory activity for both IL-2- and IL-3-dependent 
cell lines, even though these fractions were devoid of IL-2 and IL-3 protein. AT- 
terminal amino acid sequencing of homogeneous BSF-1 revealed a unique 
protein sequence completely different from that of murine IL-2 or IL-3. Based 
on these results, we conclude that BSF-I is both a growth and differentiation 
factor that may have biologic effects beyond the B lymphocyte compartment. 

Materials and Methods 

Mice, Female C57BL/6J mice were obtained from The Jackson Laboratory, Bar 
Harbor, ME, and were used at 8-12 wk of age. 

* Abbreviations used m this paper: BSF-1 , B celi-stimulating factor; DIEA-Ac, A^,A^-dii50propyJeih- 
ylaniine; TKA, trifluoroacetic acid; TMS«silica, trimethylsilyl-silica. 

J. Exp. Med. © The Rockefeller University Press * 0022-1007/86/6/1405/10 SLOO 1405 
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Preparation of B Lymphocytes. Mice were pretreated with intrapieritonea] injections of 
T24 rat anti-mouse Thy-1 mAb (12). 3d later, the spleens were removed and treated in 
vitro with a cocktail of T24, GK1.5 rat anti-mouse L3T4 mAb, rabbit anti-mouse 
thymocyte serum (liver- and bone marrow-absorbed) and rabbit complement (Pel-Freeze 
Biologicals, Rogers, AR). The T cell-depleted spleen cells were then passed over Sephadex 
GIO to remove adherent cells, and the B lymphocytes were positively selected by panning 
on petri dishes coated with goat anti-mouse IgM (Cooper Biomedical, Inc., Malvern, PA). 
Purified B cells contained undetectable levels of T cells and >98% B cells as judged by 
flow cytometer analysis (Epics-C; Coulter Electronics, Inc., Hialeah, FL) using T24 and 
anti-L3T4 rat mAb, followed by FlTC-conjugated rabbit anti-rat Ig (Becion Dickinson 
Immunocytometry Systems, Mountain View, CA) to detect T cells, and FITC-conjugated 
rabbit anti-mouse IgM to detect B cells. In addition, such a B cell |x>puIation was found 
to be completely unresponsive to the T cell mitogen, Con A, but retained full responsive- 
ness to the B cell mitogen LPS. 

Cell Lines. EL4 thymoma cells were maintained in RPMI-1 640 supplemented with 5% 
FCS. 50 U/ml penicillin, 50 Mg/nil streptomycin and 2 mM glutamine. For the production 
of lymphokine-containing supernatants, EL4 cells were stimulated in serum-free medium 
with 1% PHA (PHA-M; Difco Laboratories, Detroit Ml) and 10 ng/ml PMA (Sigma 
Chemical Co., St. Louis, MO). Cell-free supernatant was collected by centrifugation after 
24 h. 

CTLL-2 an IL-2-dependent murine T cell line (13) was maintained in Click's medium 
(Ailick Associates, River Falls, WI) containing 10% FCS, 50 U/ml penicillin, 50 Mg/ml 
streptomycin, and 100 U/ml of IL-2 from a supernatant of rat spleen cells cultured 24 h 
with Con A (Pharmacia Fine Chemicals. Piscataway, NJ). 

FDC-P2, a factor-dependent murine cell line derived from long-term bone marrow 
cultures (14) was maintained in RPMl-1640 containing 20% horse serum, 50 U/ml 
penicillin, 50 Mg/ml streptomycin, 50 /iM 2-ME, and 10% WEHI-3 cell line-conditioned 
medium. 

32D, an IL«3-dependent murine hemopoietic cell line (15) was kindly provided by Dr. 
James Watson (Auckland Medical School. Auckland, New Zealand) and was maintained 
in Click's medium containing 10% WEHI-3 cell line-conditioned medium, 10% FCS, and 
antibiotics. 

Cellular Assays, BSF-1 was assayed by its ability to stimulate the proliferation of 
purified B cells in the presence of a submitogenic concentration of goat anti-mouse IgM 
(7). B cells were cultured (10^ cells/culture) in 200 /xl containing 3-5 Mg/n^l of affinity- 
purified goat anti-mouse IgM (Cooper Biomedical, Inc.) and serial dilutions of test sample. 
After 72 h, cultures received 2.0 /iCi of [^H]thymidine (75 Ci/mmol, New England 
Nuclear, Boston. MA) for 6 h, were harvested onto glass fiber filters, and incorporation 
of radioactivity was measured. 

All proliferation assays using factor-dependent cell lines were performed using 2x10' 
cells/culture in 100 /ul of Click's medium containing 10% FCS, antibiotics, and test 
samples. After 24 h, cultures received 2.0 fjiCx of [*H]thymidine (75 Ci/mmol) for 6 h, 
were harvested onto glass fiber filters, and incorporation of radioactivity was measured. 
For all proliferation assays, 1 U of activity was defmed as the amount of lymphokine that 
induced 50% of maximal proliferation in 100 ^1 cultures. For example, if a sample induced 
50% of maximal proliferation at a dilution of 1:20, then 1 U was said to be contained in 
one-twentieth, of 100 /xl, or 5 a*I» and the sample said to contain 200 U/ml. 

Lymphokines. A mouse lL-2 cDNA was cloned from a library prepared from LBRM- 
33 lymphoma cell mRNA by Immunex Corf>oration and was expressed in yeast using the 
yeast a factor promoter and leader sequences to direct synthesis and secretion (16), rlL- 
2 was purified to homogeneity as previously described (17), and had a specific activity of 
10^ U/mg. 

A mouse lL-3 cDNA was also cloned from the above-detailed library by Immunex 
Corporation, expressed in yeast, and resultant rIL-3 was purified to homogeneity (18). 
Purified murine rIL-3 had sp act of 1.1 x 10'^ U/mg. 

BSF-I Purification. BSF-1 was purified from 71.5 liters of supernatant from PHA and 
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PM A-stimulated EL4 thymoma cells. BSF-1 activity was extracted from crude supemaunts 
by adsorption to trimethylsilyl-silica (C-l, Sepralyte; Analytichem International, Harbor 
City, CA) as previously described (19), with modifications. Crude supemaunt was acidified 
with 0.1% trifluoroacetic acid (TFA), and trimcthylsilyl (TMS)-silica added (10 g/liter). 
After stirring 1.5 h. the supernatant vras decanted. The TMS-silica was poured into a 
column, washed with 20% acetonitrile with 0.1% TFA, and the BSF-1 was eluted with 
75% acetonitrile and 0.1% TFA. Acetonitrile was removed by rotary evaf>oration, and 
the aqueous phase was adjusted to 5 mM sodium citrate, 50 mM NaCi, pH 5.5, 

The TMS-silica-purified BSF-1 was fractionated on carboxymethyl cellulose (CM52; 
Whatman, Inc., Clifton, NJ). BSF-1 bound and was eluted with a linear NaCl gradient. 
The fractions containing BSF-1 were }x>oled, dialyzed against 20 mM Tris, pH 9.0, and 
fractionated on quarternary aminoethyl cellulose (QA52, Whatman, Inc.) to which BSF- 
1 bound, and was eluted with a linear NaCl gradient. 

Reversed-phase HPLC fractionation of partially purified BSF-1 preparations was f)er- 
formed on a 4.6 X 250 mm Vydac 2 1 8TP (Cih) column (The Separations Group, Hesperia, 
CA) using a Beckman Model 344 solvent delivery system. For purification step HPLC I 
(Table 1), the column was equilibrated with 0.1% TFA in water at a flow rate of 0.8 
ml/min. Fractions containing BSF-1 activity from the QA52 column were pooled, adjusted 
to pH 2 with TFA, and injected onto the column. The column was washed for 10 
additional minutes with 0.1% TFA and then brought to 10% acetonitrile (containing 

0. 1% TFA) over 2 min. After an additional 8 min of equilibration at 10% acetonitrile, a 
linear gradient from 10 to 70% acetonitrile in 0.1% TFA was run over 60 min (1% per 
minute), and 1-min fractions were collected. 

For HPLC 11, active fractions from HPLC 1 were pooled and concentrated in vacuo to 
250 /il, and injected onto the same column, equilibrated with 50 mM acetic acid adjusted 
to pH 4.50 with iVJV-diisopropylethylamine (DIEA-Ac), at a flow rate of 0.7 ml/min. 
After a 5 min wash with DI£A-Ac. the column was brought to 5% n-propanol over 2 min. 
After an additional 8 min of washing, a linear gradient from 5 to 40% n-propanol in 
DIEA-Ac over 70 min (0.5%/min) was run, and 1 min fractions were collected. 

N-terminal Protein Sequencing, Amino-terminal amino acid sequencing was performed 
on an Applied Biosystems Model 470A protein sequencer (Applied Biosystems Inc., Foster 
City, CA). Homogeneous BSF-I from HPLC II fractions 61 and 62 (Fig. 1) were 
concentrated in vacuo to a final volume of 30 ii\ and then spotted onto a conditioned 
sequencer filter. Sequencing and PTH amino acid analysis was performed as described 
previously (20). 

SDS'PAGE. Fractions from the purification steps were monitored by SDS-PAGE and 
subsequent silver staining, as described previously (21). 

Results 

Initial Purification, BSF-1 was purified from serum-free supernatants of PH A- 
and PMA-stimulated EL4 cells as described in Materials and Methods and Table 

1. BSF-1 was extracted from crude supernatants by adsorption to TMS-silica. 
BSF-1 was eluted in TFA/acetonitrile, and further fractionated by a combination 
of cation- (CM52) and anion- (QA52) exchange chromatography (Table I). 

HPLC. Biologically active fractions eluted from the QA52 column were 
pooled and fractionated by HPLC on a CIS column as described in Materials 
and Methods. After one fractionation using a TFA/acetonitrile buffer system 
(HPLC I), an intense band at 18.4 kD was detected by silver staining on SDS- 
PAGE gels (Fig. 1, lane i, 5Af), which correlated with BSF-1 biological activity. 
BSF-1 biological activity was eluted from the HPLC column in 42% acetonitrile. 

The biologically active fractions from HPLC I were further fractionated using 
the same HPLC column equilibrated in a different buffer system (see Materials 
and Methods). As shown in Fig. 1 , this fractionation yielded homogeneous BSF- 
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Table I 








Purification ofBSF-l 




Step 


Total 


Specific 


Yield 


activity 


activity 










% 


Crude 


21.4 


3 


100 


TMS-silica 


19.7 


100 


92 


CM52 


13.7 


2.500 


64 


QA52 


5.6 


26.000 


26 


HPLC I 


5.1 


127.100 


24 


HPLCII 


1.6 


328.000 


8 



BSF-1 was purified from 71 .5 liters of crude EL4 supernatant as described 
in Materials and Methods. Units of BSF-I activity were determined using 
purified B cells. Protein was measured using the Bradford protein assay 
(BioRad Laboratories. Richmond. CA). 



I protein as detected by silver-stained SDS-PAGE, and as confirmed by protein 
sequencing. The peak of BSF-1 biological activity eluted at 52% n-propanol. 
Although acceptable yields of protein were recovered at this step, some loss of 
biological activity was observed, which possibly was due to denaturation of the 
BSF-1 molecule in the propanol solvent. The observed sp act of this BSF-1 was 
3.28 X 10" U/mg. 

Protein Sequencing. Fractions 61 and 62 (Fig. 1) from the HPLC Il-purified 
preparation were subjected to amino-terminal protein sequencing. Only one 
sequence was obtained, consistent with the preparation of BSF-1 having been 
purified to homogeneity. The yields for each cycle of amino-terminal sequencing 
are shown in Table II. The initial yield, 56%, is based upon isoleucine in cycle 
two, due to typically poor yields for histidine residues (cycle one). The absolute 
yield of 67.4 pmol isoleucine in cycle two is consistent with the expected yield 
(50-70%) from the 120 pmol (2.2 Mg) of BSF-1 applied to the sequencer filter. 
The first 20 residues, His-Ile-His-Gly-Cys-Asj>Lys-Asn-His-Leu-Arg-Glu-Ile-lle- 
Gly-Ile-Leu-Asn-Glu-Val, were (ound to be a unique sequence when compared 
to previously published reports (22). The assignment of Cys to f)Osition 5 is 
tentative, since no signal was observed for cycle 5 and the protein was not 
modified before sequencing. 

Biological Activity. All fractions throughout the purification were simultane- 
ously monitored for BSF-1, IL-2 (CTLL proliferation), and IL-3 (FDC-P2 
proliferation) activities. The majority of IL-2, as measured by CTLL prolifera- 
tion, separated from BSF-1 on the CM52 column. However, subsequent to that 
step, all CTLL activity eluted precisely with the BSF-1 activity. EL4 supernatants 
contained no lL-3 protein, and all FDC-P2 activity eluted with BSF-1 . The CTLL 
and FDC-P2 cell line response assays, using HPLC-purified BSF*1 fractions, are 
shown in Fig. 2. It is clear that purified BSF-1 has the capacity to score positively 
in both GonveniionaJ IL-2 and IL-3 assays. 

As seen in Fig. 3, several factor-dependent cell lines were compared for their 
responses to IL-2, IL-3, and BSF-1. CTLL, FDC-P2, and 32D all respond to 
BSF-I, but to a lesser extent (a lower maximum incorporation and a shallower 
slop>e) than to either lL-2 or IL-3. 
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fraction number 



Figure I. Purification of BSF-1 by HPLC. HPLC fractions from the final purification step 
(HPLC 11) shown in Table 1 were assayed for BSF-1 activity {top) and analyzed by SDS-PAGE 
{botiom) as described in Materials and Methods. SM (starting material) in lane 7 is the pool of 
active fractions from HPLC I. 



Discussion 

We have purified murine BSF-1 to homogeneity from supematants of stimu- 
lated EL4 thymoma cells» and have determined its amino-terminal amino acid 
sequence. Homogeneous BSF-1 was found to have a molecular mass of 18.4 kD, 
with a sp act of at least 3.28 X 10*^ U/mg. In addition, this lymphokine was found 
to stimulate the proliferation of both IL-2- and IL-3-dependent cell lines. 

BSF-1 was originally described as B cell growth factor because it stimulated 
partially purified splenic B lymphocytes to proliferate in the presence of submi- 
togenic concentrations of goat anti-mouse IgM (7). However, recent evidence 
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Table II 

Protein Sequence from Purified Murine BSF-l 



Yield (pmol) from cycle number: 



acid 
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Sequence assigned to fim 20 amino acids of BSF-1 is: Hts-Ile-His^ly-Cys-AapLys-Asn-His-Leu-Arg-Clu-lle-llc-Gly-Ile-Lcu-Asn-Clu-Val. 

* Vakie* <]0 pmol are omitted for clarity. 

* Assigned as Cys due to lack of other agnab; sullhydryl groups were not modified before sequencing. 




d8 59 60 61 62 63 64 65 66 67 



fraction number 

Figure 2. Assay of HPLC II fractions on CTLL and FDC-P2 cells. HPLC fraclions from 
the final purification step (HPLC II) shown in Table 1 and Fig. 1 were assayed in the CTLL 
and FDC>P2 proliferation assays as described in Materials and Methods. 



(10, 11) suggests that semipurified BSF-1 is not a direct growth factor for small 
resting B cells, but rather primes them for subsequent entry into S phase on 
encounter with anti-lgM, indicating that BSF-1 is a differentiation factor. Con- 
sistent with this role of BSF-1 are the findings that partially purified BSF-1 
induces the cell surface expression of la on B cells (23, 24), and the secretion of 
IgGl by LPS-activated B cells (25-27). Roehm et al. (24) have also observed that 
partially purified BSF-l augments the antigen-presenting function of B cells. 
Our observation that BSF-1 directly stimulates proliferation of IL-2- and IL- 
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Figure 3. Response of B cells and factor-dependent cell lines to homogeneous lymphokines. 
Homogeneous BSF-1 (□), rIL-2 (A), and rIL-S (O) were tested for the stimulation of purified 
B cells {A), FDC-P2 (5). 32 D (C)» and CTLL (D) as described in Materials and Methods. 
Threefold dilutions of each lymphokine were tested starting at 500 U/ml and results reported 
in cpm X 10~*. 

3-dependent cell lines indicates that BSF-1 is a grrowth factor as well as a 
differentiation factor. This should not be surprising* as similar results have been 
obtained in studies of other lymphokines. For example, IL-2 stimulates prolif- 
eration of T cells, however, it has recently become evident that IL-2 also elicits 
a variety of differentiated T cell functions (28-30). Moreover, granulocyte- 
macrophage-CSF (GM-CSF) has also been shown to activate difTerentiated mac- 
rophage (3 1) and granulocyte (32) effector functions, in addition to its traditional 
growth-stimulating activity on myeloid stem cells in the marrow. 

Lymphokines that act on multiple lineages of cells are not without precedent. 
lL-2, once thought to act only on T lymphocytes, has recently (3, 4) been shown 
to drive proliferation of B cells as well. Additionally, several IL-3-dep>endent 
nonlymphoid cell lines have also been shown capable of responding to IL-2 (33, 
34). Sanderson et al. (35) have reported that another B cell stimulating factor, 
BCGF-II, also acts as an eosinophil differentiation factor. One important caveat 
remains: although we have shown that BSF-1 stimulates non-B cell, factor- 
dependent cell lines to proliferate, it has yet to be determined whether this 
reflects a role of BSF-1 on normal cells, or whether those responses represent 
abnormal capabilities on the part of cells maintained in vitro for extended periods 
of time. 

The ability of BSF-1 to act on IL-2- and IL-3-dependent cell lines suggests 
that BSF-1 has target cells beyond the B cell lineage, and may have significant 
functions other than the regulation of B cell growth and maturation. 



Murine B cell stimulating factor 1 (BSF-1) was purified to homogeneity from 
supernatants of a stimulated thymoma cell line. A protein of 18.4 kD with a 
unique iV-terminal amino acid sequence was identified. BSF-1 had a sp act of at 
least 3.28 X 10^ U/mg. In addition to its B cell-stimulatory activity* BSF-1 also 



Summary 
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Stimulated the proliferation of several IL-2- and IL-3-dependent ceil lines. We 
conclude that BSF-1 is both a growth factor and a dilTerentiation factor. Finally, 
these results also suggest additional biologic properties of BSF-1 on lineages 
besides B lymphocytes. 

We gratefully acknowledge the technical contributions of Carol Ramthun, Elizabeth 
Ck>nrad» and FJt/abeth McGrath, and we thank Judy Byce for typing the manuscript. 
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The possible role of homodimerization events in intra- 
cellular signal transduction triggered by the bipartite 
human interleukin-4 receptor was addressed. We gener- 
ated cell lines functionally expressing derivatives of the 
two receptor subunits a and y, which allow for a specific 
and background-free experimental induction of intra- 
cellular homo- and heterodimers. A heterodimer of a 
and y released an intracellular signal, whereas a y^y 
homodimer did not. Unexpectedly, we found the intra- 
cellular domain of interleukin-4 receptor a chain to 
evoke cell proliferation and activation of tyrosine ki- 
nase Jakl as well as of transcription factor Stat6 upon 
homodimerization. Both recruitment of the common y 
chain and activation of kinase JakS were shown to be 
dispensible for these processes. 



Interleukin-4 (IL-4)^ is a pleiotropic immune regulator with 
a pivotal role in certain allergic processes (1). The bipartite IL-4 
receptor comprises the interleukin-4 receptor a chain (IL-4Ra) 
(2) and the common y receptor chain (7c) (3, 4). Both receptor 
subunits belong to the cjrtokine receptor superfamily (5) and 
are shared by other cytokines; yc is also psirt of the receptors for 
IL-2, IL-7, IL-9, and IL-15 (6), and IL-4Ra contributes to the 
IL-13 receptor (7, 8). 

Ligand^induced juxtaposition of the cytoplasmic domains of 
IL-4Ra and yc is believed to be a mandatory step in intracel- 
lular signaling which involves recruitment and activation of 
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kinases Jakl and JakS (9, 10), transcription factor StatB (11), 
and the adaptor molecule IRS-2 (12). However, the architecture 
of the IL-4R complex as well as the molecular mechanisms 
underlying the specificity of IL-4-induced signal transduction 
are to date poorly understood. 

Making use of the strictly species-specific interaction of in- 
terleukin-4 with IL-4Ra chain, factor-dependent murine cells 
were rendered responsive to hIL-4 by expressing human IL- 
4Rac (2, 13—16). An implication of these results is the ability of 
human IL-4 to activate IL-4 receptor complexes containing 
either human or murine common y chain, thus complicating ah 
analysis of the composition of the signaling competent receptor 
subunit assembly. 

In order to study the role of receptor chain dimerization 
events in signal release by the hIL-4R complex, we generated 
an expression system for receptor subunits that allowed us to 
experimentally induce specific and background-fi*ee intracellu- 
lar hetero- and homodimerization. 

Our results show that the juxtaposition of two intracellular 
domains of IL-4Ra can act as the trigger of specific signaling, 
including the activation of Jakl and StatB and the induction of 
cell proliferation. Surprisingly, a hitherto assumed participa- 
tion of the cytoplasmic portion of common y chain and of yc- 
associated kinase JakS is not required. 

MATERIALS AND METHODS 

DNA Manipulations, Stable Tranafection of Mouse Cells, and Detec- 
tion and Quantification of Receptor Expression — Recombinant DNA 
work was performed according to standard procedures (17). The murine 
pre-B cell line Ba/F3 (18) has been described. BAF-4a-py, a Ba/F3 
derivative expressing both subunits of the human IL-4R, is identical to 
BAF-4Rh7 (16). 

Hybrid receptor genes were generated by polymerase chain reaction 
amplification of gene fragments from pKCR-py (16) encoding the 
epitope-tagged extracellular domain and transmembrane/intracellular 
domain of human yc and exchanging them for the corresponding frag- 
ments iBamUVXhol or XhoVHindUl) in pKCR-4a. The resulting ex- 
pression plasmids pKCR-4a/y and pKCR-pVa were cotransfected into 
Ba/F3 cells as described (16). 

Surface expression of receptor constructs was assayed by reacting 
intact cells with antibodies X 14738 (16, 19) or P5D4 (20) specific for the 
extracellular portions of recombinant hIL-4Ra or epitope-tagged hu- 
man yc, respectively, and subsequent detection of bound antibodies by 
peroxidase-coupled secondary antibodies as detailed elsewhere (21). 
Briefly, 10* cells in a microtiter well were incubated on ice for 30 min 
with 5 /Ag of antibody in a volume of 50 fi.1 of phosphate-buifered 
saline/3% bovine serum albumin. After washing twice, cells were resus- 
pended in 100 ^1 of a 100 /xg/ml solution of peroxidase-conjugated goat 
anti-mouse IgG (Dianova) and kept on ice for 30 min. Cell-bound sec- 
ondary antibody was detected by transferring the cells to 50 ftl of a 
solution containing 0.1 m Tris/HCl, pH 8.5, 2.5 mM 3-aminophthalhy- 
drazide (Fluka), 400 ftM p-coumaric acid (Sigma), 5.4 mM Yi^O^ and 
measuring elicited chemiluminescence using a MicroLumat LB 96P. 
Quantitation of surface-bound antibody molecules was achieved by 
relating the determined intensity of luminescence to a calibration series 
of samples containing known concentrations of peroxidase. 

Cell Culture, Cytokines, and Proliferation Assay — Cell maintainance 
and preparation of hIL-4 and mutant Y124D has been described previ- 
ously (16). Recombinant murine IL-4 was purchased from Sigma. Cy- 
tokine-induced proliferation of cell lines was measured by [^H] th3mii- 
dine incorporation into cLe novo synthesized DNA as described (16). 

Immunoprecipitation, Immunoblotting, and Chemical Cross-link- 
ing — Samples of 3 x lO*' cells were incubated at 37 *C for 10 min in 1 ml 
of RPMI containing no cytokine, 7 nM of IL-4, or 50 nM of antibody P5D4 
and subsequently lysed as described (16). Cleared lysates were incu- 
bated with 1-5 ^tg of specific antibody. Antibodies used for immunopre- 
cipitations were 4G10 (anti-phosphotyrosine, Upstate Biotechnology), 
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E34-1 (22) and anti-Jakl rabbit serum (23). Immunocomplexes were 
precipitated from lysates with 50 /xl of anti-mouse IgG-agarose or pro- 
tein A-Sepharose (Sigma) and assayed as described (16) using peroxi- 
dase-conjugated antibody RC20 (Transduction Laboratories) at a final 
concentration of 0.1 /xg/ml. lodination of hIL-4, cross-linking of radioli- 
gand to cell-surface receptors, and analysis of immunoprecipitated com- 
plexes by electrophoresis was carried out as described (19). 

Analysis of Stat Activation by Electrophoretic Mobility Shift Assay — 
Whole cell extracts were prepared from cells stimulated with TLA or 
antibody as described above by suspension of cell pellets in a buffer 
containing 20 mM Hepes, pH 7.9, 400 mM NaCl, 1 mM EDTA, 20% 
glycerol, 1 mM dithiothreitol, 0.2 mM phenylmethylsulfonyl fluoride, 5 
^g/ml leupeptin, 5 ^ig/ml, and 100 ^tM sodium orfAo-vanadate followed 
by three freeze-thaw cycles and centrifugation at 4 °C and 14,000 rpm 
for 15 min. Supematants equivalent to 10* cells were used for bandshift 
assays performed as described (24). As a probe, the Stat6-binding se- 
quence 5'-GTCAACTTCCCAAGAACAGAA-3' derived from the human 
le-promoter (25) end-labeled with polynucleotide kinase to a specific 
activity of 8.000 cpm/finol was applied. Supershifting of Stat6 contain- 
ing complexes was achieved by adding to the binding reactions before 
electrophoretic mobility shift assay 1 /ig of a chicken antibody directed 
to amino acids 637—847 of murine StatB.' 

RESULTS AND DISCUSSION 

We intended to reconstitute in murine cells a functional 
interleukin-4 receptor complex activable exclusively by human 
IL-4, which would not evoke any background signaling due to 
interference with the endogenous murine IL-4 receptor. To this 
end, we generated a pair of expression constructs encoding 
hybrid receptor chains derived from hIL-4Ra and hyc with 
mutually exchanged intracellular domains (Fig. lA) and intro- 
duced it into the murine pre-B cell line Ba/F3. 

One clone expressing both 4oi/y and pyfAa chimeras was 
termed BAF-4a/')^py4a. The number of surface-expressed re- 
ceptor molecules per cell was determined in comparison with 
cell line BAF-4a-pY bearing both subunits of the authentic 
human IL-4R (Fig. IB). As measured by the binding of specific 
antibodies recognizing the extracellular receptor domains, in 
both cell lines surface expression of the receptor chain compris- 
ing the intracellular domain of 7c was considerably higher than 
that of the subunit bearing the intracellular part of hILr4Ra. 
Irrespective of the "authentic** or "cross-over" composition of 
the heterologous subunits, similar hIL-4 binding receptor com- 
plexes could be formed in both cell lines as revealed by immu- 
noprecipitation of receptor chains cross-linked to radiolabeled 
hIL-4 (Fig. IC). 

To test if the bipartite human IL-4R with exchanged cyto- 
plasmic domains was capable of transmitting specific signals to 
the cell interior, we measured IL-4-induced cell proliferation. 
When stimulated with hIL-4, BAF-4Q!/7-p7/4a cells expressing 
the combination of hybrid receptors, like BAF-4a-py cells, 
showed a proliferative response (Fig. 2A). 

In BAF-4a-pY cells, hIL-4 mutant Y124D evoked 60% of the 
DNA S3nnthesis induced by wild type IL-4. We have previously 
shown that this degree of reactivity is due to preferential in- 
teraction of Y124D with murine yc (16). When assaying BAF- 
4o£/y-py4a cells, we found, as earlier observed with human ILi-4 
reactive cells (19), only 30% of wild type activity for hIL-4 
variant Y124D. This result indicates that hIL-4 cross-over re- 
ceptor, as anticipated and unlike its authentic counterpart, 
precludes the formation of productive receptor complexes in- 
volving endogenous murine common y chain. 

Stimulation with hILr-4 resulted in equivalent patterns of 
t3Tosine-phosphorylated proteins in the two cell lines (Fig. 2B). 
The intracellular domain of hIL-4Ra is a major substrate of 
ligand-induced phosphorylation as revealed by specific immu- 
noprecipitation (data not shown). Moreover, the modified hIL-4 
receptor was found to recapitulate hIL-4-specific activation of 

^ M. Heim and R. Moriggl, unpublished data. 
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Fig. 1. Generation and characterization of cell lines express- 
ing hILi-4 receptor constructs. A, schematic representation of bipar- 
tite authentic hIL-4 receptor expressed in BAF-4a-p7 cell (top pair) in 
hILr4 cross-over receptor expressed in BAF-4cx/Y-p7/4a cells {bottom 
pair). 8, signal peptide; ex, extracellular domain; transmembrane 
domain; in, intracellular domain; p, epitope tag recognized by antibody 
P5D4. B, analysis of receptor chain surface expression in BAF-4a-p7 
and BAF-4ci£/y-py4a cells. Samples of 10® cells were reacted with anti- 
bodies directed to the extracellular domains of hIL-4Ra or epitope- 
tagged yc, respectively, washed, and stained with peroxidase-coixju- 
gated secondary antibody as described under ^^aterials and Methods.** 
Numbers of bound enzyme molecules per cell equivalent to receptor 
chain copies were determined by quantification of elicited chemilumi- 
nescence and correlation of the signal intensity with a calibration series 
obtained by measuring luminescence produced by different known 
amounts of peroxidase under assay conditions. C, analysis of ligand- 
receptor complexes formed on BAF-4a-py and BAF-4a/y-p7/4a cells. 
After chemical cross-linking of *^'^I-hIL-4 to the two cell lines, receptor 
complexes were immunoprecipitated using the indicated antibodies and 
subsequently resolved and visualized by SDS-polyacrylamide gel elec- 
trophoresis and autoradiography. Radioligand cross-linked receptor 
chains and complexes are marked with arrows. 



janus kinases Jakl and Jak3. 

We next employed the model receptor system to address the 
individual roles of the IL-4 receptor subunits in signaling. The 
ligand and antibody binding properties of the functionally ex- 
pressed receptor constructs enabled us to specifically induce all 
three possible intracellular receptor dimers (Fig. 3A). In BAF- 
4oc/y-p7/4a cells, not only hIL-4-induced heterodimerization of 
the two intracellular receptor domains but surprisingly also 
antibody-mediated cytoplasmic homodimerization of hIIr4Ra 
via the extracellular P5D4 epitope tag lead to cell proliferation 
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Fig. 2, Cytokine-induced responses of BAF-4a-py and BAF«4a/ 
Y>py/4a cells. A, cytokine-induced proliferation of Ba/F3 cells and 
transfected derivatives. Cells were incubated with saturating concen- 
trations of the indicated cytokines, and cell proliferation was deter^ 
mined by [^Hlthjrmidine uptake after 24 h. Radioactivity incorporated 
in response to murine IL-3 was set 100%. B, hIL-4-induced tyrosine 
phosphorylation in cell lines BAF-4a-p'y and BAF-4a/7-p7/4a. Lysates 
from hIL-4-8timulated or -unstimulated cells were subjected to immu- 
noprecipitation with the indicated antibodies and Western blot analysis 
using anti-phosphotyrosine antibody for detection as described under 
"Materials and Methods." Positions of prominent phosphorylated pro- 
teins are marked by arrows. 



(Fig. 3B). Antibody-induced homodimerization of 7c intracellu- 
lar domains in BAF-4a-py cells did not result in elevated DNA 
synthesis. The concentrations of hIL-4 and antibody P5D4, 
respectively, eliciting a proliferative response are in concord- 
ance with reported dissociation constants for the binding of 
hIL-4 to the high affinity hIL-4R of (100 pM) (26) and for the 
interaction between antibody P5D4 and its cognate epitope 
(100 nM) (21). The bell-shaped dose-response curve for the 
antibody-activity on BAF-4a/'y-py4a cells indicates a blocking 
of receptor cross-linking by monovalent antibody binding at 
excess concentration and thus underscores our notion of P5D4- 
induced receptor homodimerization causing proliferation. 

Comparing the activation of j anus kinases known to be in- 
volved in IL-4 receptor complex function by hetero- or ho- 
modimerization, respectively (Fig. 3C), we found that antibody- 
induced intracellular homodimerization of hIL-4Ra results in 
tyrosine phosphoiylation of Jakl but not of JakS. Homodimer- 
ization of intracellular 7c does not lead to a detectable phos- 
phorylation of Jakl or JakS, whereas the heterodimer of a and 
y evokes the activation of both kinases. Activation of Stat6, as 
assayed by its property to bind to a cognate DNA sequence 
derived &om the l€-promoter, is induced not only by an intra- 
cellular heterodimer of a and y but also by an a-a homodimer 
(Fig. 3Z)).'From these results we conclude that both the cyto- 
plasmic domain of yc and activated JakS are not mandatory for 
Stat activation and for the onset of a signaling cascade leading 
to cell proliferation. The essential trigger for the release of 
these events is rather the juictaposition of two intracellular 
domains of IL-4 receptor a chain and the concomitant activa- 
tion of Jakl by t3nrosine phosphorylation. 

It is to date poorly understood how the common y chain 
contributes to signaling mediated by dififerent cytokine recep- 
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Fig. 3. Directed cytoplasmic hetero- and homodimerization of 
hIL-4R subunits and resulting signal transduction. A, schematic 
representation of stimulus-induced dimerization events. /, hIL-4-in- 
duced selective intracellular heterodimerization of hIL-4Ra and human 
-yc in BAF-4a/y-p7/4a cells. //, intracellular homodimerization of hlL- 
4Ra via extracellular antibody and epitope tag in BAF-4a/y-p'>/4a cells. 
///, antibody-induced homodimerization of yc in BAF-4a-py cells. B, cell 
proliferation evoked by the stimuli depicted under A. The respective cell 
lines were incubated with the indicated concentrations of hILf-4 (/) or 
antibody P5D4 (// and III) for 24 h before pHJthyTnidine uptake was 
measured. C and Z>, activation of Jak kinases (C) and activation of StatG 
(D) by the stimuli depicted under A. The respective cell lines were 
stimulated for 10 min with 10 nM hIL-4 (/) or 100 nM P5D4 (// and III). 
Cells were then lysed and subjected to immunprecipitations with anti- 
Jak antibodies and probing with anti-phosphotyrosine antibody (C) or 
to a band shift assay using a labeled probe derived from the Ic-promoter 
(Z» as described under ''Material and Methods." 

tor complexes and how specificity of these receptors is achieved 
despite their sharing of a subunit. The only defined biochemical 
function of yc is the recruitment of Jak3 to the receptor complex 
(27), This very process, however, has been shown not to be 
essential for the specific activity of the IIi-2 receptor; it can 
rather be replaced by artificially introducing Jak2 into the 
assembly (28). In this report we show that in the human IL>-4 
receptor system neither yc and JakS nor any substitute is 
required for the release of an intracellular signal if the intra- 
cellular domain of hIL-4Ra is experimentally homodimerized. 
Our results raise new questions about the role of yc in the 
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function of the IL-4 receptor. The interaction of ligand with the 
extracellular domains of both IL-4Rcc and yc is necessary for 
IL-4-induced signal transduction, because mutant forms of 
hIL-4 defective in contacting yc fail to stimulate cell prolifera- 
tion (26, 29). Functional properties of the intracellular domain 
of yc, in the activation of this particular receptor system have 
not yet been addressed. Our data indicate that it is not involved 
in the release of intracellular signals specific for IL-4 and 
support the notion of a more general role for yc in the formation 
of the signaling competent IL-4R and probably also other C3^- 
kine receptor complexes. In ligand-induced IL-4R activation, 
one function of yc and Jak3 could be the promotion of a tran- 
sient assembly of two or more copies of hIL-4Ra, a situation 
which in turn would lead to specific intracellular signal trans- 
duction. Alternatively, in the natural receptor complex, yc- 
mediated recruitment of Jak3 might result in an activation of 
Jakl, an event that in our model experiment is mimicked by 
the juxtaposition of two Jakl molecules and serves as the 
master trigger for the various activities of hIL-4Ra. A more 
general version of such an interpretation of exchangeable Jaks 
in the hIL-4R complex would be the view that ligand-induced 
intracellular apposition of several combinations of two Jak 
molecules would suffice to evoke cell proliferation and the other 
reactions observed. In this scenario, the m^or function of the 
specific receptor chain (here: hIL-4Ra) would be to provide 
recognition sites for Stats and other downstream components 
that upon Jak-driven activation mediate the particular effects 
of IL-4. Directed homodimerization of yc does not result in 
similiar activities because of its lack of recognition sites for 
downstream signaling molecules. Also in line with such an 
explanation would be the notion of cytokine receptor signal 
transduction being relatively unselective and flexible in terms 
of interactions between receptor chains and intracellular bind- 
ing partners. This would imply that the main event regulating 
specificity in cytokine signaling is the recognition between re- 
ceptor and ligand and the thereby cross-linked combination of 
receptor subunits. 

To discriminate between the two principal explanations com- 
patible with our results (involvement of receptor multimers in 
"natural" hIL-4 receptor activation or low specificity of Jak 
activity combined with recruitment of signaling molecules by 
hIL-4Ra via specific recognition sites), careful investigation of 
the stoichiometric subunit composition of the active hIL-4 re- 
ceptor complex and a mutational analysis of the cytoplasmic 
portion of yc in this context are necessary. Also, the molecular 



details of Jak recognition, activation, and specificity in the 
hIL-4R assembly have to be addressed. 
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INTERLEUKIN 5 ENHANCES INTERLEUKIN 2-MEDIATED 
LYMPHOKINE-ACTIVATED KILLER ACTIVITY 

By TOMOKAZU AOKI, HARUHIKO KIKUCHI, SHIN-IGHI MIYATAKE, 
YOSHIFUMI ODA, KOICHI IWASAKI, TOSHIKI YAMASAKI, 
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After 3-6 d of in vitro incubation with IL-2, human PBL and mouse splenic lym- 
phocytes become cytotoxic to a variety of tumor cells, including not only NK-sensitive, 
but also NK-resistant tumor cells (1-3). These IL-2-dependent lymphocytes consist 
of a heterogeneous population of effectors (4-6). They also show non-MHC -restricted 
cytotoxicity, which has been referred to as the lymphokine-activated killer (LAK) 
phenomenon (2). 

On the other hand, murine IL-5 (mIL-5) was previously described as a T 
cell-replacing factor (TRF) (7, 8) and as a B cell growth factor II (9) based on its 
induction of proliferation and Ig secretion by activated B cells. This lymphokine 
has been shown to exert several other effects, including the induction of growth and 
differentiation of eosinophils from bone marrow stem cells (10), the induction of 
differentiation of peanut agglutinin-positive (PNA*) thymocytes into CTL (11), and 
the induction of mIL-2-R expression on both splenic B cells and PNA*^ thymocytes 
(12, 13). 

In this study, we examined the effect of mIL-5 on the in vitro induction of LAK 
activity as a basic study for modified adoptive immunotherapy. 

Materials and Methods 

Mice. Male C57BL/6 mice, 6-8 wk old, were supplied by the Shizuoka Animal Center 
(Shizouka, Japan). 

Tumors. 203 glioma is a malignant glioma cell line originally induced by 20-methylchol' 
antherene from the C57BLy6 strain. The tumor cells were maintained in a monolayer form 
in tissue culture with Dxilbecco's modified MEM (Nissui, Japan) supplemented with 10% 
heat -inactivated FXI^S (Gibco Laboratories, Grand Island, NY) and antibiotics (penicillin G, 
200 U/ml; streptomycin sulfate, 50 ^ig/ml). Methylcholanthcrene-induced mastocytoma (P815)y 
originating from DBA/2 mice, and Moloney leukemia virus-induced T cell lymphoma (Y\C-l)f 
originating from A/Sn mice, were maintained in RPMI 1640 (Nissui, Japan) with 10% heat- 
inactivated FCS and antibiotics. 

Lymphokines. Human rIL-2 (TGP-3; 4.2 x 10^ U/mg protein) was kindly supplied by 
Takeda Chemical Industry, Ltd., Osaka, Japan. 

rmIL-5 was the culture supernatant of X63>Ag8-653 myeloma cells transfected with mIL-5 
complementary DNA (14, 15). Culture supernatant (SN) (1,200 U/ml) was filtered and ali- 
quots were stored at -70**C. Activity was determined by polyclonal IgM plaque-forming 
cell (PFC) assay using TRF-responding BCL] cells (16). This culture SN contained no IL-1, 
IL-2, IL-3, IL-4, IL-6. IL-7, or CSF (data not shown). Control culture SN was collected 
from the original myeloma cell (X63-Ag8-653). 

J. Exp. Mexx <D The Rockefdlcr University Press • 0022-1007/89/08/0583/06 $2.00 583 
Volume 170 August 1989 583*588 



584 AOKI ET AL. BRIEF DEFINITIVE REPORT 

GenenUion of LAK Celis. Spleens were removed aseptically and crushed with the hub of 
a syringe in RPMI 1640. Spleen cells were incubated with mIL-5 at a concentration of 2 
X 10^ ceils/ml for 3 d in the presence of IL-2 in RPMI 1640 with 10% heat-inactivated FCS 
and antibiotics. The culture was performed in 5-ml wells of a 6-well |^tc (Corning Glass 
Works, Coming, NY). The cells were harvested, washed twice, and used as effectors. 

Cytotoxicity Assay. A 4-h '*Cr release assay in round-bottomed microtiter plates (Coming 
Glass Works) was used to measure cytotoxicity. Target cells were labeled with 0.1 mCi ^*Cr 
(Amersham Corp., Arlington Heights, XL) for 60 min at Sy^^C. Effectors were coincubated 
with the targets at various E/T ratios for 4 h at 37**C in humidified air with 5% CO2. The 
radioactive SNs were harvested and counted with a gamma counter The cytotoxic activity 
was calculated from the average of triplicate cultures as: percent of cytolysis « 100 x (ex- 
perimental release -spontaneous release)/(maximum release - spontaneous release). Sponta- 
neous release was determined by adding culture medium only to the target cells and max- 
imum release was counted in detergent. All determinadons in each experiment were performed 
in triplicate. 



Results 

The Effect qfrlL-5 on the Induction of LAK Ceils. Freshly isolated spleen cells were 
incubated with IL-2, rIL-5, or a combination of IL-2 and rIL-5. As shown in Fig. 1, 
rIL-5 enhanced IL-2-induced LAK activity, although rIL-5 alone induced no cyto- 
toxic activity. Control SN obtained from X63-Ag8*653 myeloma cells did not show 
such enhancement (data not shown). Freshly isolated spleen cells showed slight cy- 
totoxicity against YAC-1» but little cytotoxicity against P815 and 203-glioma (data 
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rIL-5 enhances r I L'2 •mediated LAK activity C57BL/6 mouse splenocytes were cut- 
X 10^ cells/ml in 5 ml of complete medium containing rIL-2 (1 U/ml) and/or rIL-5 



Figure 1. 
tared at 2 

(50 U/ml). After 3 d, effectors were tested for cytotoxicity against '^Cr-labeled tumor cells in 
a 4*h assay. Control values represent the killer activity of effSectors cultured in complete medium 
without any lymphokines. Results are presented as the means of at least three experiments. 
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not shown). Spleen cells cultured without lymphokines for 3 d were ineffective in 
lysing YAC-1, P815, and 203-^ioma. 

The Effect ofrIL'5 on the Induction of LAK Activity Depends on the Concentration of IL-2. 
IL'2 mediated LAK cell activity is also dependent on IL-5 concentration. To deter- 
mine the optimal concentration of IL-2 or rIL-5 to obtain the maximum cytotoxic 
activity* double titration experiments were carried out. In Fig. 2, these killer activi- 
ties show dose dependency at IL-2 concentrations from 0.01 to 10 U/ml and at rIL-5 
concentrations from 0.01 to 100 U/ml. 

The Effects of Various Time Schedules of Administration of IL-2 and rlL-S on the Induction 
of LAK Activity, As shown in Fig. 3, rIL-5 alone did not show killer activity when 
IL-2 was not present from the onset of culture (Fig. 3, j and k). If IL-2 was present 
from the onset of culture, however, rIL-5 enhanced LAK activity, even when added 
24-48 h after culture initiation (Fig. 3, h and i). Finally, the presence of IL-2 for 
the first 24 or 48 h and rIL-5 during the last 48 or 24 h, respectively, did not enhance 
LAK activity (Fig. 3, d and e). These results suggest that rIL-5 acts on the late stage 
of the LAK induction in the presence of IL-2. 

Discussion 

The present study demonstrates that mIL-5 enhances the LAK activity in the 
presence of IL-2, though mIL-5 alone can not induce killer activity (Fig. 1). We 
also found that in vitro the cell numbers are greater from cultures containing IL-2 
plus mIL-5 than those containing IL-2 alone (data not shown). As mIL-5 induces 
the expression of IL-2-R on both splenic B cells and PNA* thymocytes (12, 13), it 
may also induce the expression of IL-2-R on LAK cells. Although mIL-5 alone can 




JL-5 conoantraUon (U) L-2 concentration (U) 

Effector/ T«r9»t C«0 Rate « 20 : 1 

FiouRE 2. Generation of LAK activity. Dose-response cytotoxic activity by rILrS and rIL-2. 
C57BL/6 mouse splenocytes were cultured at various concentrations of rIL-5 and rIL-2 for 3 d. 
Cytotoxic activities against Y\C-1, P815, and 203-Glioroa were measured at an E/T ratio of 201 1 . 
All determinations in each experiment were performed in triplicate. 



t 



586 



AOKI ET AL. BRIEF DEHNITIVE REPORT 





100 




90 




80 




70 




60 






i 


50 




40 




30 




20 




10 





1 



day 

3 

2 
1 



J 



X 

b 



1 f i = r 

. A ^ ^ A A X 



Figure 3. Effect of various time sched- 
ules of administration of rIL-5 and rIL-2 
on LAK induction. C57BIV6 mouse spleno- 
cytes were cultured. rIL-5 (25 U/nU) or rIL- 
2 (1 U/ml) was present in the culture for 
limited periods as indicated. After 3 d, cy^ 
totoxic activities against Y\C-1 and P825 
were measured at an E/T ratio of 201 1 . All 
determinations in each experiment were 

performed in triplicate. ( ) IL-2; 

( ) iL-5. 
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not induce killer activity, some mechanisms involving IL-5*R probably may aug- 
ment IL-2 signal transduction {>athways. 

mIL-5 may act on the late stage of LAK induction in the presence of IL-2. As 
shown in Fig. 3, r, h, and i, if IL-2 is present from the onset of culture, mIL-5 en- 
hances LAK activity, even when mIL'5 is added at the late stage of culture. If mIL-5 
is present from the onset of culture, however, killer activity is not so enhanced, even 
when IL-2 is added at the late stage (Fig. 3,j and k), Hence^ it might be possible 
that IL-2 activates precursor cells in the first stage, and mIL-5 augments the acti- 
vated precursor cells. Moreover, this effect cannot take place in the absence of IL-2 
(Fig. 3, /and g). 

Mule et al. (17) found that mIL-4 alone induces killer activity as well as IL-2- 
mediated kiUer activity. mIL*5, on the other hand, enhances IL-2-mediated killer 
activity, but mIL-5 alone does not induce the killer activity. These results suggest 
that the mechanism by which LAK activity is augmented by mIL-5 is different from 
that of mIL-4. 

Unlike IL-2, mIL-4 has little capacity to generate LAK activity from mouse PBL, 
whereas both lymphokines induce this activity from splenocytes (17). Whether or 
not mIL-5 and IL-2 similarly generate LAK activity from mouse PBL remains to 
be determined. 

Hergen et al. (18) found that human IL-4 blocks IL-2-generated LAK activity. 
The reason for the discrepancy between human and mouse IL-4 is unclear. Apart 
from the possibility that the ability of IL-4 to block the induction of LAK activity 
is species restricted, it might be possible that mouse spleen cells contain already acti- 
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vated precursor cells. Hergen et al. (18) found IL-4 inhibits IL-2 -induced LAK ac- 
tivity in spleen ceUs. 

It would be interesting, for the purpose of adoptive immunotherapy; to see whether 
human IL-5 also enhances IL-2*induced LAK cell activity in PBL or not. 

Summary 

IL-5 expresses various biologic effects on several types of lymphocytes, including 
B cells, eosinophils, and T ceUs. We demonstrated that the incubation of resting 
splenocytes from C57BL/6 mice in murine rIL-5 enhances IL-2-mediated lymphokine- 
activated killer (LAK) activity against various tumor ceUs. IL-5 alone, however, does 
not induce killer activity. IL-2-mediated LAK activity increases in proportion to 
the dose of IL-5. During the late phase of the culture period, . IL-5 seems to have 
some effect on the induction of IL-2-mediated LAK activity. We expect that IL-5 
will prove useful for adoptive immunotherapy. 

Received for pubiuaiwn 24 March J 989 and in revised form 4 May 1969, 
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Interleukin-12 Is Chemotactic for Natural Killer Cells and Stimulates Their 
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W« invMtlgatacI the chemotactic activity of inteiioukin (IL)- 
12 on human natural killer (NK) celle and other leukocyte 
eubaete. It wee found that IL-12 Induced directional migra- 
tion of highly enriched preparations of NK cells 080% COIG" 
and CD56^) and CD3-activated T cells (both of 004 and CDS 
subset), but not resting T ceils and monocytes. On the con- 
trary, purified polymorphonuclear cells (PMN) showed sig- 
niflcBiit and reproducible chemotactic response to IL-12. The 
effects of IL-12 on leukocyte migration were observed in a 
nan-ow concentration range with a peak at approximately 
7.5 ng/mL, and were abrogated by monoclonal antibody 
(MoAb) anti-IL-12 or after cytokine boiling. We also investl- 

NATURAL KILLER (NK) cells comprise 5% to 10% of 
blood mononuclear cells and are characterized by the 
typical morphology of large granular lymphocytes (LGLs) 
and the CD3"CDi6^CD56* phenotype. NK cells are cyto- 
toxic without prior sensitization for a wide range of target 
cells* including some microorganisms and virus-infected and 
transfonned cells. In addition to cytotoxicity, NK ceils se- 
Crete cytokines and have a central role in the regulation of 
the immune response and hematopoiests.''^ Mf>st NK cells 
are found mainly in blood and spleen, but they are present 
also in the lung and intestinal mucosa, and accumulate in 
liver parenchyma on cytokine administration in mice.^ In 
certain pathologic conditions, NK cells accumulate at extra- 
hematic sites following viral and bacterial infections,^*^ and 
are found in the allograft infiltrate during the early phase of 
rejection.* 

The mechanisms underlying NK-cell recruitment from 
blood vessels to tissues have not been fully elucidated. We 
and others have reported that NK cells are rapid mobile 
cells during in vitro chemotaxis assays,^ '" express adhesion 
molecules, which can recognize ligands on resting and in- 
flammed endothelium' '-'^ or extracellular matrix proteins,'^*'^ 
and are able to actively transmigrate across endothelial 
monolayers.'* 

These functions are modulated by cytokines. Interleukin- 
2 (IL-2), interferon (IFN)-7 and tumor necrosis factor 
(TNF)-a stimulate their in vitro locomotor ability, and re- . 
- cenUy it was reported that IL-8 is chemokinetic for 11^2- 
activated adherent NK cells. ^^^^ The adhesive interaction 
of NK cells to in vitro-cultured endothelial cells (EC) is 
upregulated by IL-2,*^''^ while IL-4 partiy inhibited their 
binding."* 

In the present work, we studied IL-12, a cytokine produced 
by B lymphocytes and monocytes in response to lipopolysac- 
charide (LPS) and bacterial products, recentiy cloned and 
characterized.^''^ IL-12 has important functions in the regu- 
lation and amplification of the immune response and in- 
flanmiation.^ IL-12 regulates the cytotoxic activity, prolifer- 
ation, and cytokine production of NK and T cells,^'""*^* but 
no information is available on modulation of cell adherence 
or chemotaxis. 

Our results show that IL-12 is chemotactic for NK cells 
and increases their in vitro binding to and killing of EC. 



gated the Interaction of NK cells with vascular endothelium 
In vitro. Overnight treatnwnt of NK cells with IL-12 aug* 
mented their binding to cultured endothelial cells (EC) ot»* 
talned from umbilical veins. IL-12-lncreased binding was 
better observed when resting rather than IL-1 -activated EC 
were used as substratum of adhesion, lt-12-augmented 
binding of NK cells to resting or IL-1 -activated EC Invohred 
the U^A-I/ICAM-I, and VLA-4/VCAM-1 pathways. Thus, by 
inducing migration and interaction with EC, IL-12 regulates 
crucial determinants of NK-cell recruitment In tissues. 
O 1994 by The American Society of Hematology, 

Further, IL-12 induced directional migration of CD3-acti- 
vated T lyn^hocytes and polymorphonuclear cells (PMN), 
but not resting T cells or monocytes. 

MATERIALS AND METHODS 

Culture media and reagents. The following reagents were used 
for separation of effector cells, cell culture, and experimental assays: 
pyrogcn-free saline for clinical use and distilled water (Bieffe, Son- 
drio, Italy); medium RPMI 1640 (lOx concentrated; Biochrom KG, 
Berlin. Germany); medium 199 (GIBCO. Paisley, UK); glutamine 
(GIBCO); peniciUin and streptomycin (GIBCO); and aseptically col- 
lected fetal calf senim (PCS; Hyclonc, Logan. UT). The routinely 
used tissue culture medium was RPMI 1640 with 2 mmol/L gluta- 
mine, 50 mg/mL gentamicin, and 10% FCS, which is hereafter re- 
fened to as complete medium. 

All reagents were checked for endotoxin coataminalion by the 
Limulus amebocyte lysate assay (Microbiologica] Associates, Walk- 
ersville. MD) with a sensitivity of 0.02 to O.OS ng/mL Escherichia 
call Westphal LPSs. Sera were tested after 1:3 dilution and heating 
at 100^. All reagents were negative for eiKtotoxin contaminadon. 
LPS {E coll 0.55 :B5) was purchased from Difco (Detroit, MI). 

Recombinant C5a (a kind gift from Or V. Showell, Pfizer Control 
Research. Groton, CT) was used as reference chemoattractant for 
the different leukocyte populations. IL-12 was a kind gift of Genetics 
Institute, Cambridge. MA. IL-2 was a kind gift of Roussel Uclaf, 
Paris, France. 
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Preparation of leukocyte populations. Leukocytes were obtained 
from buffy coats of blood donors from normal volunteers through 
the courtesy of Centro Transfusionale, Ospedale Sacco, Milan, Italy, 
by centrifugation on FicoU-Hypaque gradient. PMN were isolated 
as described.^^ Briefly, cells from the pellet of the Ficoll-Hypaque 
gradient (Biochrom KG. Berlin, Germany) were lesuspended in 
isoosmotic (285 mOsm) complete medium, layered onto 62% isoos- 
motic Percoll gradient (diluted with complete medium), and spun at 
450^ for 20 minutes. Cells were collected at the medium/PercoU 
interface. The purity of PMN preparations was more than 95%, as 
assessed morphologically on Giemsa-stained cytopreps. Monocytes 
were separated on 46% Percoll gradients as described.^* Preparations 
were usually more than 85% pure as assessed by posttivity with 
monoclonal antibody (MoAb) anti-CD 14. 

Highly enriched NK-cell populations were prepared by discontin- 
uous (47%, 49%, and 52%) Percoll gradients as previously de- 
scribed,' Low-density cells were further depleted of contaminating 
T lymphocytes by panning with MoAb anti-CD6 (50 ng/mL).'-^ Cells 
from NK-enriched fractions were added to the dishes and incubated 
at 4*C for 1 hour. The nonadherent cells were then gently poured 
off. The resulting NK populations were less than 3% CD3'^ and 
more than 80% CD16* and CD56*. 

T lymphocytes were obtained from the high-density fractions of 
discondnuous Percoll gradients and further purified by positive pan- 
ning on CD6-coated petri dishes. In some experiments, T cells were 
separated in CD4 and CDS subsets by panning. For activation with 
anti-CD3, T cells were treated with a saturating amount of anti-CD3 
MoAb and incubated on a plastic Petri dish previously coated with 
affinity-purified goat-anti mouse IgG (10 p.'^TtlA Sigma Chemical 
Co, St Louis, MO) for 1 hour at 4''C. Nonadherent cells were gently 
poured off and cell incubation was continued at 37°C for an addi- 
tional 4 hours. 

Migration assay. Cell migration was evaluated using a micro- 
chamber technique" as previously described for monocytes and 
PMN," and for NK cells and T cells/ Twenty-five ± I /iL chemoat- 
tractant diluted in RPM! with 1% PCS was seeded in the lower 
compartment of the chemotaxis chamber and 50 /iL cell suspension 
(L5 X lO^/mL monocytes in peripheral blood mononuclear cells 
(PBMC) or PMN or 2 x lO^mL LGL or T lymphocytes) were 
seeded in the upper compartment. The two compartments were sepa- 
rated by 8-^m pore size nitrocellulose or polyvinylpyrrolidone-free 
polycarbonate membranes (Nucleopore Corp, Pleasanton, CA). 
Chambers were incubated at 37°C in air with 5% CO2 for 1 .5 hours 
(monocytes), 2 hours (PMN), 4 hours (LGL), or 3 hours (T cells). 
At the end of the incubation, nitrocellulose filters were removed, 
fixed, stained, and dehydrated using standard histologic methods. 
Five fields were examined for each sample. Migration was expressed 
as the distance (in microns) migrated by the two leading cells. Ex- 
pected values of migration, assuming no chemotactic response, were 
calculated as described by Zigmond and Hirsch.^ Polycarbonate 
filters were fixed and stained with Diff-Quik (Baxter, DUdinzen, 
Germany) and five high-power oil fields were counted. Migration 
was expressed as the number of cells migrated across the filter. 

Preparation of EC. Human EC were obtained from umbilical 
vein and cultured as previously described.^' Routinely, we used con- 
fluent cells (10^/2 cm' culture well) between the first and fourth 
passage maintained in 199 medium with 20% bovine serum 
(Hyclone), supplemented with EC growth supplement (50 A^g/mL; 
Collaborative Research Inc, Lexington, MA) and heparin (100 ^g/ 
mL; Sigma). The purity of EC cultures was checked by expression 
of von Willebrand factor and found to be more than 99% positive. 

Antibodies, The following MoAbs were used: anti-CD 18 (clone 
TSl/18), anti-CDlla (clone TSl/22), and anti-CD6 (clone 3Pt12) 
purchased from ATCC (RockvtUe, MD); anti-CDl la (clone NKI- 
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LI 6) kindly obtained from Dr C Figdor (Nijmegen, The Nether- 
lands); anti-CDl lb (clone 44a) kindly obtained from Dr R. Todd 
(Ann Arbor, Ml); anti-CDl Ic (clone L29) (Becton Dickinson. 
Mountain View, CA) kind gift of Dr L. Lanier (DNAX. Palo Alto, 
CA); anti-VLA4 (clone HP2/ 1) kindly obtained from Dr F. Sanchez- 
Madrid, (University of Madrid, Madrid, Spain); anti-CD3 (clone 
OKT3) kind gift of Dr S. Ferrini (1ST, Genova. Italy); anti-CD 16 
(clone B73.1) and anti-IL-12. kind gifLs of Dr G. Trinchieri (Wistar 
Institute. Philadelphia, PA); anti-CD56 (clone N90I) kind gift of Dr 
J. Griffin (Dana Farber, Boston, MA); and anti-CDI4 (clone 
UCHMl ) kind gift of Dr P. Beverly (University College and Middle 
Sex School of Medicine, London. UK). In the adhesion assays, 
MoAbs were used at a previously determined optimal concentra- 
tion.'- usually 1/100 for ascites or diluted 1/5 to 10 for hybridoma 
supematants. NK cells were pretreated for 15 minutes at room tem- 
perature with MoAbs and then plated (without washing out the 
MoAb) in the adhesion assay. In each experiment, isotype-matched 
antibodies were used as negative control. 

Adhesion assay. Adhesion of NK cells to EC was studied as 
described previously.'' EC were grown to confluence in flat-bot- 
tomed 96-well trays and, in some experiments, were activated for 
20 hours with 10 ng/mL of IL-I^ (Domp^, L*Aquila Italy). 

NK-cell preparations were resuspended at 2 x 10^/mL in complete 
medium and incubated overnight with the stimulating cytokine. Con- 
trol NK cells were incubated in medium alone. At the end of the 
culture period, cells were washed with saline and resuspended in 
complete medium at lOVmL for labeling with 100 /^Ci ^'Cr (sodium 
chromate; Amersham, Bucks. UK) at 3r*C for 1 hour. Cells were 
then washed with medium and resuspended at 10^/mL, 0.1 mL was 
dispensed to each 96-well tray and incubated for 30 minutes at 37**C. 
At the end of the incubation, the wells were carefully washed three 
times with phosphate>buffered saline (PBS) plus 1% fetal bovine 
serum; adherent cells were solubiliaied with 0.2 mL of 0.025 mol/L 
NaOH and 0.1% sodium dodecyl sulfate (SDS), and radioactivity 
was counted in a gamma counter. Results are presented as the per- 
centage of adherent cells ± SD, with 3 to 6 replicates per group. 
Statistical analysis was performed by the Mann-Whitney lest. 

Cytotoxicity assay. The cytotoxic assay was performed as de- 
scribed.-' EC were labeled with 50 pL ^'Cr (Amersham) for I hour, 
and 5 X 10-^ EC were cocultured with a different concentration of 
NK cells previously treated with IL-2 or IL-12 for 24 hours. After 
a 4-hour incubation, supematants were harvested and counted in a 
gamma counter for determination of isotope release. Results are 
presented as percent specific lysis (mean ± SD) of three replicates 
per group. 

RESULTS 

Chemotactic activity of IL-12. In the first series of exper- 
iments, we tested the chemotactic activity of IL-12 on puri- 
fied NK cells and for comparison on other leukocyte popu- 
lations. 

Figure lA shoves a representative dose/response experi- 
ment. A chemotactic effect was observed in a narrow dose 
range, with a peak at 7.5 ng/mL of IL-12; the curve was 
typically bell-shaped, as for many chemoattractants. At the 
optimal concentration of 7.5 ng/mL, IL-12 was as active as 
an optimal concentration (50 ng/mL) of recombinant C5a. 
This result was confirmed in eight experiments with different 
blood donors. When an MoAb anti- IL-12 or boiled IL-12 
were added in the assay, no migration occurred, confirming 
the specificity of the response (Fig IB). This enhanced loco- 
motor activity of NK cells was chemouctic rather than 
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Hg 1. ChemoftBCtie Offfect of IL-12 on highly anrichod human NK- 
call praparatfons. (A> Doaa/raaponaa oi IL-12. Raeombinant C5a (50 
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migration in rasponaa to IL-12 aJona or IL-12 In the pnsane& oil an 
irralavant MoAb was considarad as 100%. 



chemokinetic, as demonstrated in checkerboard experiments, 
although a chemotactic component was also present (Table 
I). Purified populations of T lymphocytes (>98% CD3*) 
responded to IL-12 only when appropriately stimulated with 
anii-CD3 MoAb. Resting T cells (Fig 2A) or T cells incu- 
bated with an irrelevant MoAb (data not shown) did not 
respond. IL-12 was as potent as recombinant C5a in inducing 
T-cell migration. We also investigated whether the CEM and 
CDS subset differed in their chemotactic response to IL-12. 
Purified preparadons of CD4* and CDS'** T cells, preacti- 
vated with MoAb anti-CD3, showed similar chemotactic ac- 
tivity (Fig 2B). 

While monocytes did not show increased locomotor activ- 
ity in response to IL- 1 2 (data not shown), surprisingly, puri- 
fied preparations of PMN showed a significant and reproduc- 



ible chemotactic response to IL-12, with a peak effect at 5 
ng/mL. Figure 3A shows a typical experiment representative 
of seven performed. IL-12 activity on PMN was blocked by 
anti -11^ 1 2 MoAb, and the heat-treated cytokine was ineffec- 
tive (Fig 3B). A checkerboard analysis was also performed 
for PMN, and it was found that IL-12 is chemotactic, al- 
though a small chemokinetic effect was also evident (data 
not shown). 

!L-12 stimulates adhesion and cytotoxicity ofNK cells to 
vascular endothelium. The effect of IL- 1 2 was also studied 
on the ability of NK cells to adhere in vitro to monolayers 
of cultured human EC. IL-12 increased NK-cell binding to 
resting, as well as to IL- 1 -activated EC, as shown in the 
representative experiment in Fig 4, although IL-2, used in 
parallel as an NK-cell -activating cytokine, was a stronger 
stimulus compared with IL-12, This IL-12- increased bind- 
ing was better observed when resting rather than activated 
EC were used as substratum for adhesion. In a large series 
of experiments, the estimated increase in NK-ccU binding 
induced by IL-12 was 68% ± 10% and 33% ± 3% on resting 
and IL-1 -activated EC, respectively (mean ± SE of 15 ex- 
periments, P < ,05). 

As shown in Fig 5, a small increase in NK-cell binding 
to EC was already observed with 5 ng/mL IL-12, and the 
peak effect was at 10 ng/mL. These concentrations were alsc 
the most effective in parallel experiments of tumor cytotoxic- 
ity against the K562 cell line (Fig 5). The effect of IL-12 
required an incubation time of more than 6 hours, and was 
optimal after an overnight incubation (not shown). No in- 
crease in NK-cell binding was observed when EC were stim- 
ulated for 24 hours with IL-12 (data not shown). 

As IL-]2--induced stimulation of binding to EC requirec 
a relatively long exposure, we questioned whether a putative 
factor secreted by IhlK cells was, in fact, responsible for the 
increased binding. IL-12 induces the production of severa! 
cytokines by NK cells, eg, IFN7 and TNFa,^^ These cyto- 
kines have stimulating effects on NK-cell function, but d^ 
not increase their binding ability to EC.^ Moreover, MoAbs 
against TNFa and IFNr did not decrease NK-cell binding 
(data not shown). AUo, the supernatant of NK cells culturec 
overnight with IL-12 did not stimulate NK-cell binding (no- 
shown). These experiments suggested that IL-12 acts directl) 
on the adhesive capacity of NK cells. 

IL-2-activated NK cells are able to kill EC.^~^^-^' There 
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Different concentrations of tL-12 (ng/mL) were placed in the upper and/or lowe 
compartments of the chemotaxis chamber. Migration is expressed as leadini 
front distance (pm ± SO). Numbers in parentheses are the expected values o 
migration calculated according to the method of Zlgmond and HIrsch,* assuminf 
no chemotactic response. 

* P < .05 V migration to control medium (above and below the filter). 



From bloodjournal.hematologylibrary.org by guest on April 12, 201 1. For personal use only. 



2264 



A 

15 -r— 

m 




IL-12 (ng/ml) 



ALLAVENA ET AL 

evident, as demonstrated by the fact that anti— VLA-4 MoAb 
used in concert with anti-CD18 gave a greater inhibition 
compared with ami -CD 18 alone (Fig 7). 

The observation that the greatest effect of IL- 1 2 was seen 
on resting EC, where most binding is mediated by CDl la/ 
GDI 8, caused us to check if this augmented functional activ- 
ity was associated with an augmented expression of 02 inte- 
grins. 

In several experiments, we were unable to demonstrate a 
consistent increased expression of 02 integrins after incuba- 
tion with IL-12, using MoAb anti-CDlla (clone TSI/22) 
and anti-CD 1 8 (clone TSl/18). When we used the MoAb 
NKI-L 1 6, which recognizes an activation epitope of 
CDUa,'*'^ we did seel an increased expression of NKI-L 16 
epitope after IL- 12 stimulation (Fig 8). As MoAb anti-NKI- 
LI 6 stained already more than 80% of the cells, increased 
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Rg 2. Chemotactie •ffact of IL-12 on purified T lymphocyto*. (A) 
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oells. Rasulte are axprosaad aa mlgratad dlslanca C|mi) of the two 
loading calls. (B) T cells ware separated tn CD4* and CD8^ calls by 
panning and stimulated wHh anti-C03 before the assay. Chemotactic 
response to rC5a was 12 ± 2 (fun) for unaeparated (total) T cells. 12 
± 2 for CD4\ and 9 ± 2 for CD8^. 



fore, we investigated whether the increased binding of IL- 
12-activated NK cells to EC also resulted in increased cyto- 
toxicity. As shown in Fig 6, unstimulated peripheral blood 
lymphocytes (PEL) did not lyse EC; after overnight exposure 
to IL-12) a significant increase of lysis was detected, al- 
though lower than that obtained after IL-2 stimulation. 

Characterization of the adhesive molecules. In previous 
reports, we characterized the adhesive molecules involved 
in NK-cell binding to EC.'^ Most binding of resting and IL- 
l2-activated NK cells on resting EC was inhibited by anti- 
CD 11 a/CD 18 MoAb, whereas MoAb anti-CDllb and 
CDllc had no effect (Fig 7). When EC were preactivated 
with IL- 1 , a clear role for the VLA-4/VCAM- 1 pathway was 
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expression is detected by mean channel fluorescence (eg, 
from 587 to 639). This result was confirmed in two additional 
experiments in which IL>12-treated cells (24 hours) showed 
an increase in fluorescence intensity of 32 and 40 channels, 
respectively. 

DISCUSSION 

The chemotactic activity of NK cells is of particular inter- 
est for the immunobiology of these cells* as only few known 
cytokines were reported to have modulatory effects on their 
locomotor ability. IFN7 and ILr2 have chemokinetic ef- 
fects'-** and IL-8 has chemokinetic activity on long-term 
cultured IL-2 -activated adherent NK cells, but the effect of 
this cytokine on resting NK cells was not tested.'* The fact 
that IL-12 induces directional migration of freshly isolated 
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resting NK cells describes a new function for IL-12 and 
identifies a novel cheinoattractant for this cell population. 

In parallel experiments, it was foimd that CD3-activated T 
cells and PMN responded to a gradient of IL-12. The loco- 
motor ability of T lymphocytes has recently received much 
attention after the reports that some members of the newly 
described chemokine family are active on T cells."**'^' Mac- 
rophage inflammatory protein lor (MlPla) was reported to 
preferentially chemoattract CD3-aclivated CD8^ T cells, 
while MIPl/? preferentially attracted CD4^ T cells. 
RANTES was active on unstimulated CD4'^C]>45RO'^ mem- 
ory T cells and stimulated CD4* and CD8* lymphocytes. 
Finally, interferon-inducible protein 10 chemoattracted both 
stimulated CEM* and CD8* lymphocytes. In this smdy. IL- 
12 induced a chemotactic response only in CD3-activated, 
CD4^, and CDS'^ lymphocytes, indicating that the state of 
cell activation might be crucial for their recruitment in vivo. 

IL-12 was also chemotactic for PMN and chemotacdc 
activity was abrogated by and -IL-12 MoAb and by boiling 
the cytokine, thus excluding a nonspecific effect. Further- 
more, the supernatant from mononuclear cells or punfled 
PMN cultured for 2 hours with IL-12 (the length of the 
chemotaxis assay) was not active in inducing directional 
migration of freshly isolated PMN, suggesting that IL-12 is 
woiicing directly, and not by inducing the secretion of a 
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uaa ara the mean ± SO (four rapllcataa) from one of three similar 
expariments. 



second factor (data not shown). We failed to observe moduJa- 
tion of other functions of PMN by IL-12, including expression 
of mRNA levels for IL-6, IL-l/^, IL-I receptor antagonist 
(IRA), IL-1 receptor type I and type H, c-fos, c-Jun, and c- 
myc. The significaiice of this selective effect of IL- 12 on mature 
myelocytes (mcmocyties were unaf^ted) xemains to be defined. 

We and others have reported that NK cells bind to EC 
and that this interaction is modulated by cytokines. It 
was, therefore, of interest to study the action of IL-12 at this 
level. IL-12 augmented the number of NK cells that bound 
to resting EC (68%) and activated EC (33%). This effect of 
IL-12 showed several differences compared with the effect 
induced by IL-2, a prototypic NK-cell- stimulating cytokine: 
(1) IL-2 had a more marked effect compared with IL-12; (2) 



the kinetics of IL-12 were slower, as it required overnight 
treatment, while a 1-hour exposure to lL-2 was sufficient to 
give a significant increase in NK-celJ binding''; and (3) IL- 
2-augmented binding was inhibited by IL-4.^* while the 
latter cytokine does not inhibit the effect of IL-12 (data not 
shown). IL-4 was also ineffective in inhibiting the IL-I2- 
augmented NK-cell cytotoxicity.-^^ It is of interest that IL- 
4-rrealed monocytes showed a decreased secretion of IL- 
12.'*" One of the mechanism.s through which IL-12 might 
mediate effects on NK cells involves the adhesion molecules 
critical for the interaction between NK and EC. Robertson 
et al'^'' have reported that IL-12 augments some adhesion 
molecules on NK cells, including CD! la, CD54, and CDS6, 
and this could explain the increased cytotoxicity. Also, Rabi- 
novich et al"*' have recently reported augmented expression 
of P2 inlegrin on IL-12- activated NK cells.*' However, ihi5> 
increased expression was observed after a relatively long 
incubation with IL-12 (6 days* culture), while we detected 
NK-ceJl augmentation of tumor killing and EC binding as 
early as 24 hours after incubation. Moreover, the effect on 
chemotaxis was readily observed in a 2-hour assay. In this 
study, we tested the expression of 02 integrins by fluores- 
cence-aclivaled cell sorter (FACS) with the most widely 
used anti-CD I la/CDI8 MoAb, and did not find a consistent 
modification of expression, despite increased binding to EC. 
However, we did find an increased expression of an acti- 
vation epitope of LFA-1 recognized by MoAb NKI-LI6.'^ 
Rabinovich et al'*' have reported that IL-12 did not induce 
NK-ccIl binding to fibroneclin and laminin-coated plastic. 
Moreover, ot40\ integrin expression was not augmented 
after 6 days' incubation with IL-12.*' These results further 
support that the IL-12 -activated NK-cell binding to EC is 
not mediated by modulation of a40l integrin. Thus, some 
evidences concur with the conclusion that IL-12 augments 
the adhesive ability of NK cells by affecting the LFA-1 
molecule. This effect is most likely due to activation of the 
molecule, rather than augmented expression. 
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Fig 8. Expression of CDlla (NKM.16 apitopal by resting (Al and IL-12-activatad (Bl NK cells. Calls ware labeled with anti>MICH.16 mv^tm^ 
diluted 1:100 In RPMI 1640 -i- 10% FCS (complete medium) at room temperature for 30 minutes. All sut>eequent waahinge and labeling wfth 
FrrC-conjugated second antil>odv were performed In complete medium. Cells were analyzed with a FACStar (Bacton Dickinson). NKM.16 
expression by NX cells incubated with medium: BZ% positive cells^ mean channel fluorescence IMCF) 587; NK cells incubated with II.-12 (10 
ng/mL 72 hours), 81% positive cells, MCF 639. 
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In addition to acting as natural effectors, NK cells play 
an important role in the generation of T helper 1 (THl)- 
versus TH2-type responses.^ IL-12 has been shown to di- 
rectly influence the generation of THl versus TH2 cells/^ 
The results presented have suggested that monocyte-derived 
IL-12 may induce or amplify recruitment of NK cells, im- 
portant determinants of THl versus TH2 responses. 
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Interleukin>13 (IL-13) is a cytokine secreted by acti- 
vated T lymphocytes that shares many, but not all. bio- 
logical activities with IL-4. These overlapping activities 
are probably due to the existence of common receptor 
components. Two proteins have been described as con- 
stituents of the IL-4 receptor, a ~140-kDa glycoprotein 
(IL-4R) and the y chain (yc) of the IL-2 receptor, but 
neither of these proteins binds IL-13. We have cloned a 
cDNA encoding an IL-13 binding protein (IL-13R) from 
the Caki-1 human renal carcinoma cell line. The cloned 
cDNA encodes a 380-amino acid protein with two con- 
sensus patterns characteristic of the hematopoietic cy- 
tokine receptor family and a short cytoplasmic tail. The 
IL-13R shows homology with the IL-5 receptor, and to a 
lesser extent, with the prolactin receptor. COS-7 cells 
transfected with the IL-13R cDNA bind IL-13 with high 
affinity but do not bind IL-4. COS-7 cells co -transfected 
with the cloned IL-13R cDNA and IL-4R cDNA resulted 
in the reconstitution of a small number of receptors that 
recognized both IL-4 and IL-13. Reverse transcription- 
polymerase chain reaction analysis detected the recep- 
tor transcript only in cell lines known to bind IL-13. 



IL-13^ is a cytokine secreted by activated T lymphocytes that 
regulates inflammatory and immune responses (1,2). It shares 
several biological activities with IL-4, another T-cell-derived 
cytokine, in a variety of cell types such as B cells, monocytes, 
fibroblasts, and endothelial cells (for review, see Ref. 3). But 
contrary to IL-4, IL-13 does not regulate T-lymphocyte function 
(4). Recent studies have shown that IL-13 competitively inhib- 
its IL-4 binding to the receptor present on some cell lines, 
whereas on other cell lines, IL-4 binding cannot be displaced by 
IL-13 (5. 6). Similarly. IL-13 binding was either completely or 
partially displaced by IL-4, depending on the cell line used as 
source of receptors (6, 7). It has also been shown that a mutated 
IL-4 (8) or anti-IL-4R antibodies that block the biological ac- 
tivity of IL-4 also antagonize IL-13 activity (5, 9-11) and that 
the signaling events induced by IL-4 and IL-13 are similar in 
the cells that respond to both cytokines (10, 12, 13). Together, 
these results suggest that IL-4 and IL-13 receptors are not the 
same but that they share receptor components. 

Two proteins have been described as components of the high 
affinity IL-4R complex: IL-4R, a glycoprotein of 140 kDa, 
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which, when expressed in COS cells, binds IL-4 with a of 
50-100 pM (14, 15): and the y chain of the IL-2 receptor (16, 17), 
which, when associated with the IL-4R, results in a 2-3-fold 
increase in affinity for IL-4 and also participates in some of the 
IL-4-mediated signal transduction events (17). Neither of these 
two proteins binds IL-13, nor does the complex of the two 
proteins (6). Thus, other protein (s). part of the IL-4 complex, 
are necessary for the recognition of IL-13. 

Recently, it has been discovered that some human renal 
carcinoma cells express, in addition to receptors shared by IL-4 
and IL-13, a large excess of specific IL-13 receptors (7). In view 
of the importance for molecular cloning of having a cell line 
expressing a large number of IL-13 receptors, we screened a 
panel of human carcinoma cell lines for IL-13 binding. We 
found one. the Caki-1 cell line, that expressed a high number of 
binding sites for IL-13. of which only a small fraction were 
shared with IL-4. We describe here the characterization of the 
IL-13 binding protein {IL-13R) present in these cells, its clon- 
ing, and the characterization of the recombinant protein ex- 
pressed in COS-7 cells. 

MATERIALS AND METHODS 

Growth Factors, Antibodies, and Cells — ^Recombinant IL-13 was pro- 
duced and purified in our laboratory as described previously (2). Human 
IL-4 was obtained from Tebu (Le Perray en Yvelines, France). The 
anti-IL-4 receptor antibodies X2/45 and CDW124 and the lL-4 antago- 
nist hIL-4.Y124D were a generous gift of Dr. Sebald (Wurzburg. Ger- 
many). Caki-1 cells (ATCC HTB 46) were cultured in RPMI 1640 me- 
dium, 20% fetal calf serum. 2 mM glutamine, and penicillin/ 
streptomycin (100 units/ml), at 37 "C in a humidified atmosphere 
containing 5% CO^- 

Binding and Biological Activity Assays — Binding and cross-linking 
experiments were performed as described with *^^I-labeled (Phe*^} 
IL-13-Gly-Tyr-Gly-Tyr (6) or ^^H-IL-A. For the induction of IL-6 secre- 
tion, Caki-1 cells were plated in 24-well plates at a density of 5 x 10"* 
cells/well, and after 3 days of culture, confluent monolayers were 
washed three times with Dulbecco's modified Eagle's medium without 
fetal calf serum. Stimulation of Caki-1 cells was performed with 30 
ng/ml of IL-4 or IL-13 in the absence or in the presence of hIL-4.Y124D 
or an anti-gpl40 monoclonal antibody. The level of IL-6 released into 
the culture media after 24 h incubation was measured by enzyme- 
linked immunosorbent assay (Innotest. Besangon. France). 

cDNA Library Constructions, Isolation ofcDNAs. and Sequence Anal- 
ysis— Total RNA was extracted from Caki-1 cells or tissues as described 
(2). Poly (A) RNA was isolated from total RNA with oligo(dT)25 magnetic 
beads (E^al, Inc.). The Caki-1 size fractionated cDNA library contain- 
ing 2X10^ clones was constructed using the primer-adapter procedure 
(18) and the vector pSE 1 (19). The expression cloning strategy used was 
reported previously (20). The human cDNA sequence for the IL-13R has 
been submitted to the GenBank™/EMBL Data Bank (accession num- 
ber X95302). 

Ceil Transfection and Receptor Characterization — For the functional 
characterization of the recombinant receptor, COS-7 cells were trans- 
fected in 25-cm^ plates using 0.6 ptg of different plasmids. After 24 h 
COS-7 cell monolayers were trypsinized and plated at 8 x lO* cells per 
well in 12-well plates. Three days later, binding and cross-linking 
experiments were performed as described with *^^I-labeled [Phe'^^l 
IL-13-Gly-Tyr-Gly-Tyr (6) or '"I-IL-4. 
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Fig. 1. Characterization of the IL-13R present in Caki-1 cells. 

a, Scatchard analysis {insei^ of the ^^*''I-IL-13 saturation curve, which 
indicated the presence of —76.000 sites/cell with a 440 pM; b, 

binding of *^^I-IL-13 in the presence of increasing concentrations of 
competing IL-13 (•) and IL-4 (O). Bars, S.D. {n = 3); Inset, cross-linl^ing 
experiments using *^^I-IL-13 in the absence {lane a) and in the presence 
of a 100-fold excess of IL-13 {lane or IL-4 {lane c). 

Tissue Distribution — cDNAs were generated from RNA samples with 
reverse transcriptase and a mixture of labeled D-NTP of l^nown specific 
activity for quantification. Then, for each sample. 10 ng of cDNA was 
submitted to PGR using a sense primer corresponding to the sequence 
+ 52 to +71 and an antisense primer corresponding to +489 to +470 
(numbering is based on the cDNA sequence shown in Fig. 3a). PCR- 
amplified products were hybridized with a probe complementary to 
sequence +445 to +461 of the cDNA. 

RESULTS 

Characterization of IL-4 and IL-13 Binding and Activity in 
Caki-1 Cells— The Caki-1 cell line, among several human cell 
lines tested, expressed the highest number of binding sites for 
IL-13. Saturation experiments with labeled IL-13 on Caki-1 
cells showed the presence of one class of binding sites with a 
of 446 ± 50 pM and 7.2 X 10^ receptors/cell (Fig. la). In 
competition experiments, unlabeled IL-13 totally displaced the 
labeled IL-13 in a dose-dependent manner, whereas IL-4 dis- 
placed with high ciffinity —10% of the labeled IL-13. Higher 
concentrations of IL-4 (>100 tim) did not displace the remain- 
ing 90% of the IL-13 binding (Fig. 16). These results are in line 
with the existence of two sites with similar affinity for IL-13, of 
which only one (~10% of the total binding) recognizes IL-4. 
Affinity cross-linking of IL-13 showed a '^70-kDa complex con- 
sistent with the complex observed in similar cross-linking ex- 
periments with IL-13 on several cell types (6). The labeled 
IL-13 was completely displaced from the complex by IL-13 
but not by IL-4, which is in agreement with the competition 
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Fig. 2. Inhibition of the secretion of IL-6 induced by IL-13 and 
IL-4 in the presence of the monoclonal antibody anti-IL-4R and 
the IL-4 antagonist hlL-4.Y124D. Stimulation of Caki-1 cells was 
performed with 30 ng/ml of lL-4 or IL-13 in the absence or in the 
presence of hIL-4.Y124D or an antl-gpl40 monoclonal antibody. The 
level of IL-6 released into the culture media after 24 h incubation was 
determined as described under "Materials and Methods." Each value 
represents the mean ± S.E. of triplicate determinations. 

experiments (Fig. lb). 

We also analyzed the IL-4- or IL-13-induced IL-6 secretion 
from Caki-1 cells. Both cytokines induced the secretion of sim- 
ilar levels of IL-6. and this secretion was inhibited by antibod- 
ies against the IL-4Ra chain and by the IL-4 antagonist hlL- 
4.Y124D (Fig. 2), suggesting that only the shared receptors in 
Caki-1 cells are responsible for the induction of IL-6 secretion. 
Similar results were observed when the IL-4- and IL-13-in- 
duced phosphorylation of the IRS1/4PS was analyzed in the 
presence or the absence of the anti-IL-4R antibodies and the 
IL-4 antagonist (results not shown). 

Cloning of the IL-13 Binding Protein — ^We constructed a 
cDNA library containing 2 X 10^ recombinant clones from 
Caki-1 cells. The library was divided into pools of 1000 cDNAs, 
and plasmid DNA from each pool was introduced into COS-7 
cells. Binding of labeled IL-13 to transfected COS-7 cells was 
used to identify pools of clones encoding an IL-13R. Positive 
pools were partitioned and rescreened until a single clone was 
identified that directed synthesis of a cell surface protein ca- 
pable of binding IL-13. Two independent IL-13 receptor cDNAs 
of identical sequence were finally isolated. The cDNA is 1299 
bases long, excluding the poly (A) tract, and has a short 3'- 
untranslated region of 103 bases. A canonical AATAAA poly- 
adenylation signal is found at the predicted location (Fig. 3a). 
The open reading frame between nucleotides 53 and 1 192 de- 
fines a polypeptide of 380 amino acids. The sequence codes for 
a membrane protein with a putative signal peptide of 26 amino 
acids, a single membrane-spanning domain, and a short cyto- 
plasmic tail. Four sites for potential TV- linked glycosylation are 
located in the extracellular region. Importantly, two consensus 
patterns considered as signatures of the hematopoietic cyto- 
kine receptor family (for review, see Ref. 21) are also found, as 
are four conserved cysteines in the amino-terminal half of the 
extracellular domain and the WSAWS motif located in the 
COOH-terminal region of the extracellular domain (Fig. 36). 
Alignment studies reveal homologies wUh the human IL-5Ra 
chain (51% similarity and 27% identity: Fig. 3c^ and to a lesser 
extent with the prolactin receptor (not shown). 

Distribution of the mRNA for the IL-13R—A ~1.4-kilobase 
IL-13R transcript was detected when the cDNA for the IL-13R 
was labeled with ^^P and used to probe a Northern blot con- 
taining polyadenylated RNA from Caki-1 cells (not shown). 
Surprisingly, in Caki-1 cells, similar amounts of IL-13R and 
IL-4R mRNAs are detected by Northern analysis, although a 
large excess of IL-13R is expressed. This observation suggests 
a higher translatability of this mRNA versus IL-4R transcripts 
and may explain the absence of IL-13R mRNA detection in cell 
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Fig. 3. Nucleotide sequence of the 
IL-13R cDNA and comparison of the 
IL-13R and IL-5R protein sequences, 
a. nucleotide sequence of the IL-13R 
cDNA, The human IL-13R cDNA se- 
quence has been submitted to the Gen- 
Bank'^/EMBL Data Bank (accession 
number X95302). The amino acids corre- 
sponding to the predicted transmembrane 
domain are underlined. Potential AAgly- 
cosylation sites (Asn-A^Ser/Thr) are un- 
derscored. 6, schematic representation of 
the protein. S. predicted signal peptide; C. 
conserved cysteines in the hematopoietic 
cytokine receptor family; W, the WSXWS 
motif; TM, transmembrane domain; CD, 
cytoplasmic domain. The black circles in- 
dicate the potential 7V-glycosylation sites, 
c, amino acid alignment of the IL-13R and 
IL-5R sequences. Human IL-13 and IL-5 
receptor protein sequences are aligned as 
described (27), Cysteine residues and the 
WSXWS motif characteristic of this fam- 
ily of receptors are boxed. 



1 GGTGCCTGTCGOCGGG6AGAGAGGCAATA7CAAGGTTTTAAATCTCOGA6AAATGGCTTT 

MAP 

61 CGTTTGCTTGOCTATCGGATGCTTATATACCTTTCTGATAAGCACAACATTTGGCrrGTAC 

4 VCLAIGCLYTFLISTTFCCT 

121 TTCATCTTCAGACACCGAGATAAAAGTTAACCCTCCTCAGGATTTTGAGATAGTGGATCC 

24 SSSDTBIKVNPPQDFBIVDP 

181 CGGATACTT AGGTTATCTCT ATTTGCAATGGCAACCCCCACTGTCTCTGGATCATTTTAA 

44 GYLGYLYLQWQPPLSLDHFK 

241 GGAATGCACAGTGGAATATGAACTAAAATACCGAAACATTGGTAGTG AAACATGGAAGAC 

64 ECTVEYBLKYRNI6SETHKT 

301 CATCATTACTAAGAATCTACATTACAAAGATGGGTTTGATCTTAACAAGGGCATTGAAGC 

84 IITKNLHYKDGFDLNKGIEA 

361 GAAGATACACAOGCTTTTACCATGGCAATOCACAAATGGATCAGAAGTTCAAAGTTCCTG 

104 KIHTLIiPWQCTMGSBVQSSW 

421 GGCAGAAACTACTTATTGGATATCACCACAAGGAATTCCAGAAACTAAAGTTCAGGATAT 

124 ABTTYHISPQGIPETKVQDM 

461 GGATTGCGTATATTACAATTGGCAATATTTACTCTGTTCTTGGAAACCTGGCATAGGTGT 

144 DCVYYNHQYLLCSWKPGIGV 

541 ACTTCTTGATAOCAATTACAACTTGTTTTACTGGT ATGAGGGCTTGGATCATGCATTACA 

164 LLDTNYNLFYWYEGLDHALQ 

601 GTGTGTTGATTACATC AAGGCTGATGGACAAAATATAGGATGCAGATTTCCCTATTTGGA 

184 CVDYIKADGQNIGCRFPYLB 

661 GGCATCAGACTATAAAGATTTCTAT ATTTGTGTT7JVTGGATCATCAGAGAAC AAGCCTAT 

204 ASDYKDFYICVHGSSENKPI 

721 CAGATCCAGTTATTTCACTTTTCAGCTTCAAAATATAGTTAAACCTTTGCCGCCAGTCTA 

224 RSSYFTFQLQNIVKPLPPVY 

781 TCTTACTTTTACTCGGGAGAGTTCATGTGAAATTAAGCTGAAATGGAGCAT ACCTTTGGG 

244 LTFTRBSSCEIKLKWSIPLG 

841 ACCTATTCCAGCAAGGTGTTTTGATTATGAAATTGAGATCAGAGAAGAT6ATACTACCTT 

264 PIPARCFDYEIEIREDDTTL 

901 GGTGACTGCTACAGTTGAAAATGAAACATACACCTTGAAAACAACAAATGAAACCCGACA 

284 VTATVEKETYTLKTTMETRQ 

961 ATTATGCTTTGTAGTAAGAAGCAAAGTGAATATTTATTGCTCAGATGACGGAATTTGGAG 

304 LCFVVRSKVNIYCSDDGIWS 

1021 TGAGTGGAGTGAT AAACAATGCTGGGAAGGTGAAGACCTATCGAAGAAAACTTTGCTACG 

324 EHSDKQCNBGBDLSKK7IiI.R 

1081 TTTCTGGCTACCATTTGGTTTCATCTTAATATTACTTATATTTGTAACCGGTCTGCTTTT 

344 F w L P F Q P X I I V I r V T g Xi It-JU 

1141 GCGT AAGCCAAACACCTACCCAAAAATGATTCCAGAATTTTTCTGTGATACATGAA6ACT 

364 RKPNTYPKMIPEFFCDT 

1201 TTCCATATCAAGAGACATGGTATTGACTCAACAGTTTCCAGTCATGGCCAAATGTTCAAT 

1261 AT6AGTCTCAATAAACTGAATTTTTCTTGCGAATGTTG 1298 
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MAFVCLAIGCLYTFLISTTFGCTSSSDTEIKVNPPQDFEIVDPGYLGYLY 50 

II I I II I I I I 

. .MIIVAHVLLILLGATEILQADLLPDEKISLLPPVNFTIKVTG.LAQVL 47 



LQWQPPLSLDHFKECTVEYELKYRNIGSETWKTIITKNLHYKDGFDLNKG 100 

II I I III III I I II 

LQWKPNPDQBQ.RNVNLEYQVKINAPKEDDYETRITBS. . .KCVTILHK6 93 



lEAKIHTLLPWQCTNGSEVQSSWABTTYWISPQGIPBTKVQDMdCV . 146 

i II I II I I I I I I ll 

FSASVKTILQ NDHSLLASSWASAE . LHAPPGSPGTSIVNL^yTTNTT 139 



IL13R . . YYNWQ YLL Z 3WKPGIGVLLDTNYNLFYWYEGLDHAL 

I I II II I I I I 

IL5R EDNYSRLRSYQVSLH Z TWLVGTDAPEDTQYFLYYRYGSWTE . 




DYIK 189 

I I 
EYSK 187 



IL13R 
IL5R 



AD.GQNZGp 
I I I 



^P. ^YLEASDYKDFYICVNGSSENKPIRSSYFTFQLQNIV 236 

111 II I II II III 

DTLGRNIAbHFPRTFILSKGRDWLSVLVNGSSKHSAIRPFDQLFALHAID 237 



IL13R KPLPPVYLTFTRESSCEIKLKWSIPLGPIPARCFDYEIEIREDDTTLVTA 286 

II I I II I I I I II i 

IL5R QINPPLNVTAEIEGT.RLSIQWEKPVSAFPIHCFDYEVKIHNTRNGYLQI 286 



IL13R 
IL5R 



TVENETYTLKTTNETRQLCFWRSKVNIYCSDDGl WSEWS DKQCWEGEDL 



lilt 
EKLMTNAFI S 1 1 DDLSK YDVQVRAAVS SMCRE AGlJNSEW^ 



336 

I I 

.PIYVGNDE 335 



IL13R SKKTLLRFWLPFGFILILVIFVTGLLLRKPNTYPKMIP EF 376 

II I II 

IL5R HKPLREWFVIVIMATICFILLILSLICKICHLWIKLFPPIPAPKSNIKDL 385 



IL13R FCDT 380 

I I 

IL5R FVTTNYEKAGSSETEIEVICYIEKPGVETLEDSVF 420 
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Fig, 4. Expression patterns of the IL-13 receptor mRNA. RNA 
was prepared from the following cells: Caki-1 {Jane a). A431 {lane b), 
TF-1 {lane c), U937 {lane di, Jurkat {lane e), and IM9 {lane ^. RNA 
samples were copied with reverse transcriptase and submitted to PGR, 
and amplified products were hybridized with a probe complementary to 
sequence +445 to +461 of the cDNA as described under "Materials and 
Methods." Size markers are indicated at the left of the figure. The 
estimated number of receptors/cell for Caki-1, A431, TF-1, U937, Jur- 
kat. and IM9 cells are «- 60000. 200, 250, 420, <25, and 400 receptors/ 
cell, respectively (6). 
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Fig. 5. Characterization of the recombinant IL-13 receptor. 
COS-7 cells were transfected with the IL-13R cDNA and assayed for 
binding of '^^I-IL-13. Bars, S.D, (/J = 3). Inset, Scatchard analysis of the 
saturation curve, which indicated the presence of ~ 560,000 sites/cell; 

« 250 pM, 

lines expressing low number of IL-13 binding sites. Reverse 
transcription-PCR analysis (Fig. 4) showed that the transcript 
found in Caki-1 cells is also present, at lower levels, in the 
keratinocytic A431 cells, premyeloid TF-1 cells, premonocytic 
U937 cells, and the B-cell line IM9. No transcript was detected 
in the T-cell line Jurkat. These results are in agreement with 
our previously reported binding studies (6) and with the known 
biological targets for IL-13. 

Expression and Characterization of the IL-13 Binding Pro- 
tein— cells transfected with the isolated cDNA encoding 
the IL-13R showed specific binding of labeled IL-13. Scatchard 
analysis of the saturation curve showed a single component site 
with a /C^y value of 250 ± 30 pM and a maximal binding capacity 
of 5.6 X 10^ receptors/cell (Fig. 5). The affinity displayed by the 
recombinant receptor is in good agreement with the K^j value of 
446 pM for the Caki-1 IL-13 receptor and for that described in 
several other cells (6). 

In competition studies. IL-13 was effective in inhibiting the 
labeled IL-13 binding to the cloned receptor, with an inhibitory 
affinity constant {K) of 1.5 ± 0.5 hm. whereas IL-4 did not 
inhibit binding (not shown). Cross-linking experiments with 
labeled IL-13 showed a radioactive band of —70 kDa similar to 
the one obtained with the Caki-l cells, shown in Fig. \b, inset 
The radioactivity was completely displaced by a 100-fold excess 
of IL-13 but not by a 100-fold excess of IL-4 (results not shown), 
in agreement with the binding experiments. Thus, the phar- 
macology and the electrophoretic mobility of the covalently 
cross-linked labeled receptor are those expected from the char- 
acterization of the IL-13R present in Caki-1 cells. 



Table I 

'^^ML-13 binding to COS cells expressing IL-13R. IL-4R. and 
yc alone and in various combinations 

COS cells were transfected with the indicated cDNAs and analyzed 
for ^2^I-IL-13 (320 pM) binding 48 h later, as indicated under "Materials 
and Methods," in the absence or presence of either cold IL-13 (1 17 nM) 
or IL-4 (67 nM). 



cDNAs U'ansfected 



|'25i]iL-l3. 
total bound 



Bound displaced by 



IL-13 



IL-4 





dpnf 


dpm 




Mock 


344 


117 


116 


IL-13R 


22,895 


22.612 


55 


IL-4R 


427 


146 


131 


IL-13R + IL-4R 


21,119 


20.942 


1772 


IL-13R + IL-4R + -yc 


7769 


7481 


389 


IL-13R + -yc 


8402 


8130 


484 


IL-4R + 7C 


351 


107 


109 



* Each value is the average of triplicates, the S.D. values of which 
were less than 10%. The results shown are representative of three 
independent experiments. 

Co-expression ofIL-13R and IL-4R in COS Cells— V^e inves- 
tigated whether the IL-13R and IL-4R would interact in the cell 
membrane to reconstitute a receptor that allowed cross-compe- 
tition of both cytokines. The results of the co-expression exper- 
iments are shown in Tables I and IL Table I shows the binding 
of radiolabeled IL-13 to COS cells transfected with cDNAs 
coding for IL-13R. IL-4R, and 7c alone and in various combi- 
nations. Mock transfected cells displayed low IL-13 binding 
that is displaced by both IL-13 and IL-4, as described previ- 
ously (6). Cells transfected with the cDNA for IL-13R showed 
high IL-13 binding that was completely displaced by IL-13 but 
not by IL-4; and the expression of IL-4R resulted in IL-13 
binding values similar to those observed in mock transfected 
cells. Thus, IL-13R recognizes IL-13 but not IL-4, and IL-4R 
does not bind IL-13, as described previously (5, 6). COS cells 
expressing both IL-13R and IL-4R showed similar binding for 
IL-13 to that of the cells expressing IL-13R alone; however, 
about 8-10% of that binding can be displaced by IL-4, suggest- 
ing the reconstitution of a binding site shared by both IL-13 
and IL-4. The co-expression of 7c did not improve the reconsti- 
tution of shared sites, but resulted in a diminution of IL-13 
binding. Similarly, the expression of the 7c with the IL-13R 
also resulted in a diminution of IL-13 binding. Work is in 
progress to investigate an eventual interaction of 7c and IL- 
13R that may explain these observations. The co-expression of 
IL-4R and 7c did not result in binding sites recognized by IL-13, 
as described previously (6). 

Cells from the same transfection experiments used for bind- 
ing of labeled IL-13 were also analyzed for binding of labeled 
IL-4. The binding results with labeled IL-4 (Table II) showed 
that IL-13 cannot displace labeled IL-4 from the IL-4R alone or 
from the IL-4R-7C complex. But when the IL-4R and the IL-13R 
were co-expressed, the IL-13 was able to displace a fraction of 
the labeled IL-4 binding, suggesting again an interaction of 
both chains to reconstitute shared binding sites for both 
cytokines. 

DISCUSSION 

Compelling evidence supports the idea that IL-13 and IL-4 
share receptor components as well as signal transduction ele- 
ments (5. 6, 9-13). However, neither of the two proteins de- 
scribed as components of the high affinity IL-4 receptor, the 
IL-4R and the 7c, is responsible for the binding cross-competi- 
tion of IL-4 and IL-13. Thus, another protein(s), part of the IL-4 
complex, is clearly necessary for the recognition of IL-13. Here 
we describe the characterization and cloning of an IL-13R. 

We used Caki-1 cells as source of mRNA for the molecular 
cloning of the IL-13R because our results suggested that these 
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Table II 

binding to COS cells expressing IL-13R. IL'4R, and 
yc alone and in various combinations 

COS cells were transfected with the indicated cDNAs and analyzed 
for '^°ML-4 (160 pM) binding 48 h later, as indicated under "Materials 
and Methods." in the absence or presence of either cold IL-13 (117 tm) 
or IL-4 (67 nM). 



cDNAs transfected 


l'"l)iL-4. 
total bound 


Bound displaced by 
IL-13 IL-4 




dpnf 




dpm 


Mock 


813 


436 


416 


IL-13R 


1159 


710 


830 


IL-4R 


20.862 


562 


20.276 


IL-13R + IL>4R 


22.008 


1793 


21.533 


IL-13R + IL-4R + yc 


27.000 


629 


26.571 


IL-13R + -yc 


1121 


455 


445 


IL-4R + 7C 


23,641 


88 


23.112 



" Each value is the average of triplicates the S.D. values of which 
were less than 10%. The results shown are representative of three 
independent experiments. 



cells express an IL-4-IL-13 receptor complex responsible for the 
binding cross-competition and biological activities of IL-4 and 
IL-13 similar to that previously described in other cells and, in 
addition, that these cells overexpress the IL-13R component 
that binds IL-13, 

The cloned cDNA for the IL-13R codes for a protein of 380 
amino acids with characteristics of the hematopoietic cytokine 
receptor family; and the pattern of IL-13R mRNA expression is 
consistent with the receptor distribution estimated by IL-13 
binding and with the known IL-13 target cells. 

The new receptor has a short cytoplasmic sequence, which is 
in line with the observation that probably only the receptor 
complex that is shared by IL-4 and IL-13 transduces a signal to 
the cells. However, further work is needed to assess whether 
this cytoplasmic domain plays a role in associating with other 
chains and contributes to signal transduction as described for 
the IL-5Ra (22). 

The homology of the IL-13R with the IL-5Ra chain is inter- 
esting because the latter binds IL-5 but needs another protein, 
the /3 chain shared with the IL-3 and GM-CSF (granulocyte 
macrophage colony-stimulating factor) receptors, to form a 
high affinity receptor complex that is capable of signal trans- 
duction (23). However, it should be noted that the IL-13R, when 
expressed in COS-7 cells, does not need a second chain to 
reconstitute a high afdnity binding site. It therefore resembles 
the recently described IL-15 binding protein, which also dis- 
plays high affinity binding for IL-15 in the absence of the two 
other components of the IL-15 receptor complex (24). 

The recombinant receptor recognized IL-13 with high affin- 
ity, but was not able to bind IL-4, as expected from the results 
obtained in Caki-1 cells. The radiolabeled complex observed in 
the cross-linking experiments with labeled IL-13 has the same 
mobility as that observed in other cell lines (6), It most proba- 
bly corresponds to the 60-70-kDa band observed in cross-link- 
ing experiments performed with labeled IL-4 (6. 1 5) in addition 
to the 140-kDa IL-4R complex, arguing in favor of the interac- 
tion of both proteins in the functional receptor complex. 

The results of the co-expression of the IL-13R and the IL-4R 
in COS-7 cells suggest that these proteins can interact with 
each other in the cell membrane to reconstitute a receptor 
where IL-13 and IL-4 can compete with each other. However, it 
should be noted that only a fraction of the expressed IL-4R and 
IL-13R resulted in shared sites for both cytokines. Several 
reasons may explain the low yield of reconstituted shared sites. 
For example, the existence of another protein (s)» present in 
limiting amounts in COS cells, necessary to reconstitute the 
IL-4'IL-13 receptor complex. Another explanation could be an 
incorrect stoichiometry of IL-4R and IL-13R in the cell mem- 
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brane, but this is unlikely because co-transfections producing 
different relative amounts of the IL-4R and IL-13R did not 
show major differences in the number of reconstituted recep- 
tors. Finally, our results do not exclude the existence of another 
IL-13R with a better capacity to interact with the IL-4R for the 
reconstitution of shared binding sites for IL-4 and IL-13. If this 
is the case, the IL-13R described here could either play a 
regulatory role in the IL-4'IL-13 receptor complex, or require a 
protein, different from the known IL-4R, to reconstitute a func- 
tionally independent IL-13 receptor complex. 

The results obtained in the co-transfection experiments with 
the yc demonstrate, as previously suggested (5, 25), that this 
protein is not the limiting factor for the reconstitution of an 
IL-4R-IL-13R complex, a conclusion also supported by the ab- 
sence of detectable yc mRNA in Caki-1 cells (7). It should be 
noted that the expression of the 7c improved IL-4 binding as 
described previously (16), but decreased the binding of IL-13. 
suggesting complex interaction between the different receptor 
components. Work is in progress to address this problem. 

The cloning of the IL-13R described here should help to 
address the question of the regulation of the expression of the 
receptor under normal and pathological conditions, such as 
allergy, where IL-13 may play a key role. Furthermore, the 
availability of the cDNA will facilitate the cloning of other 
proteins necessary for the reconstitution of the IL-4'IL-13 re- 
ceptor complex or of an eventual independent IL-13 receptor 
complex and may help the rational design of new drugs that 
specifically antagonize IL-4 or IL-13 activities. 
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ABSTRACT P-sdectin on activated platelets and stima- 
lated endothritol cdb mediate ceD adhedon wfth monocytes 
and neutrophils. Since activated platelets induce tissue functor 
on mononuclear leokocytes, we eiamined the effect of P-selec- 
tin on the expression of ttaoe factor activity in monocytes. 
Purified P-selectin stimnlatfd tissue factor expression on mono- 
nuclear leukocytes In a dose<4epciident manner. Qdnese ham- 
ster ovary (CHO) ceDs expressing P-sclectIn stimulated tfasue 
foctor procoagulant activity In purified monocytes, whc tc as 
nntransfectfd CHO cdb and CHO cells e^ressfaig £-selectfai 
did not. Anti-P-selectfai antibodies inhibited the effects of 
purified P-sdectin and CHO cdls expressing P-selectin on 
UMHiocytes. Incubation ctf CHO edis expressing P-sdectin with 
monocytes leads to the deveio|m>ent of tissue fiKtor mRNA in 
monocytes and to the expression of tissue (actor antigen on the 
monocyte surface. Tliese results Indicate tiiat P-sdectin ufNreg- 
ulates the expression of tissue factor on monocytes as weD as 
mediates the binding of platelets and endotheUd cells with 
monocytes and neutro|riiils. The l>indlng of P-sdectin to mono- 
cytes in the area of vascular ii^ury may l>e a component <tf a 
mechanism that Initiates thrombosis. 



Blood clotting is a host defense mechanism that, in paraUel 
with the inflammatory and repair responses, preserves the 
integrity of the vascular system after tissue ii\iury (1). Plate- 
lets, leukocytes, and endothelial cells are among the cellular 
components critical for this process. The plasma blood 
clotting proteins participate in a molecular cascade in which 
tissue iiuury activates blood coagulation, leading to the 
formation of a fibrin clot (2). The response to vascular injury 
culminates in the formation of a platelet plug, the deposition 
of leukocytes in injured tissue, and the initiation of inflam- 
mation and wound healing. Blood coagulation is initiated 
through the action of tissue factor. NormaUy not exposed to 
blood, tissue factor is an integral membrane protein ex- 
pressed constitutively on the surface of nonvascular ceDs. 
Monocytes and endothelial cells can be induced to express 
tissue factor on their surface (3, 4). Monocytes can be 
activated by endotoxin (5), immune complexes (6), certain 
cytokines (7), and platelets (8-11), leading to tissue factor 
expression, whereas endothelial ceils express tissue factor 
when stimulated by certain cytokines (12). Tissue factor, with 
a molecular weight of 43, (MX) (13, 14), binds factor VII and 
factor Vila to form the tissue factor/factor Vila complex that 
activates factor IX and factor X (15). 

P-selectin is a cell adhesion molecule that mediates the 
interaction of platelets and endothelial cells with neutrophils 
and monocytes (16, 17). P-selectin, a member of the selectin 
family of adhesion molecules (18-21), is an integral mem- 
brane protein found in the a granules of platelets (22, 23) and 
the Weibel-Palade bodies of endothelial ceUs (24, 25). Upon 
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stimulation of these cells by agonists such as thrombin, 
P-selectin is phosphorylated (26, 27) and rabidly translocated 
to the plasma membrane (23). P-selectin is a lectin that binds 
to lineage-specific carbohydrates on the surface of mono- 
cytes and neutrophils (28-30). This protein binds to a mucin- 
like glycoprotein PSGL-1 that must be properiy glycosylated 
to retain fiinctional properties as the P-selectin Ugand (31). 
P-selectin on platelets mediates the accumulation of leuko- 
cytes into the growing thrombus during experimental throm- 
bosis in vivo (32). Inhibitory antibodies that block the inter- 
action of P-selectin on pilatelets with the P-selectin ligand on 
leukocytes inhibit the uptake of leukocytes into the tbrombus 
and ii^bit the magnitude of thrombus formation. These 
experiments have demonstrated that P-selectin mediates 
monocyte and neutrophil interaction with activated platelets 
in vitro and in vivo. The potential exists for stimulation of 
leukocyte effector fiinction by P-selectin bindiiig. To evalu- 
ate this potential, we have examined the ability of P-selectin 
to upre^date tissue factor expression on monocytes. In the 
current study » we demonstrate that P-selectin induces the 
expression of tissue factor on monocytes exposed to P-se- 
lectin. 

EXPERIMENTAL PROCEDURES 

Protdns and Cell Lines. P-selectin was purified firom plate- 
lets by detergent extraction of platelet membranes, heparin- 
Sepharose chromatogr£4>hy, and afifinity chromatogr^hy 
using AC1.2 (30). P-selectin appeared homogeneous by SDS 
gel electrophoresis. Production of the noninhibitory anti-P- 
selectin antibody AC1.2 and the inhibitory P-selectin anti- 
body GA6 have been dcsc aibed (16, 32). The inhibitory 
anti-tissue factor antibody HTFl (33) was the gift of Yale 
Nemerson (Mt. Sinai School of Meidicine). Chinese hamster 
ovary (CHO) cells expressing P-selectin (CHO:P-selectin) or 
E-selectin (CHO:E-selectin) adhere to leukocytes, in con- 
trast to CHO cells (30). 

CeO Isolation and Catture. Blood was obtained firom normal 
volunteers and anticoagulated with 0.1 vol of 3.8% sodium 
citrate/0.15 M NaCl. Erythrocytes and leukocytes were 
sedimented at 150 x g for 15 min at lifC. Platelet-rich plasma 
was used for platelet isolation (11). The sedimented eryth- 
rocytes and leukocytes were acfjusted to the original volume 
with 0.38% sodium citrate/0.15 M NaCl, sedimented, and 
layered onto FicoU/Hypaque (Pharmacia). After centrtfuga- 
tion at 400 x g for 30 min at 8°C, mononuclear cells were 
removed, diluted in 0.38% sodium citrate/0. 15 M NaCl, and 
sedimented at 450 x g for 7 min at 8**C. The pellet was 
resuspended in buffer and the cells were washed three times. 
Mononuclear cells were resuspended in RPMI 1640 medium 



Abbreviations: GAPDH, glyceraldehyde phosphate debydrogCDase; 
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supplemented with penicillin (100 units/ml) and streptomycin 
(100 Aig/ml) at a cell concentration of 10^ cells per ml. Cell 
viability (>94%) was assessed by trypan blue exclusion. 
Platelet contamination was 0.3-0.5 platelet per mononuclear 
cell. The monocytes in this population, as assessed by 
nonspecific esterase staining, were 25-30%. Monocytes were 
purified by using a discontinuous Percoll density gradient. 
Mononuclear cells resuspended in RPMI 1640 medium sup- 
plemented with 10% heat-inactivated fetal calf serum were 
layered onto a 46% isosmotic Percoll solution. After centrif- 
ugation at 4°C for 30 min at 550 x the cells at the RPMI 1640 
medium/PercoU interface were collected and washed twice at 
440 X g in seruni-free RPMI 1640 medium. These prepara- 
tions contained about 85% monocytes and 15% lymphocytes. 
Mononuclear cells were incubated with platelets, purified 
P-$electin» or CHO cells at 37X^ in sterile, pyrogen-fi^e 
stoppered test tubes or in tissue culture plates under 5% 
C02/95% air. 

Control of EDdotfKdn Contamination. All reagents used for 
cell isolation and culture were prepared with endotoxin-firee 
water. Solutions were prepar^ in glassware rendered en- 
dotoxin-fim by high temperature. All reagents showed a level 
of endotoxin contamination, as assessed by a chromogenic 
Limulus assay (BipWhittaker), lower than 0.1 endotoxin 
unit/ml. All experiments were performed under sterile con- 
ditions. 

Coagulant Activity Assay. Except where otherwise indi- 
cated, cells were disrupted by three £reeze-thaw cycles prior 
to measurement of procoagulant activity by a one-stage 
dotting assay (11). In some experiments, intact cells were 
used. Intact or disrupted cells (100 /d) were mixed with 100 
id of normal human plasma at 37X. After 30 sec, 100 §j1 of 
25 mM CaCli at 37^ was added to the mixture and the time 
to dot formation was recorded. The values were converted 
to arbitrary units of procoagulant activity by comparison with 
a standard curve obtained by using a human brain thrombo- 
plastin standard (U.K. 1) fix>m the reference laboratory (gift 
of L. Poller, Manchester, U.K.). This preparation was as- 
signed a value of 10(X> units for a clotting time of 20 sec. All 
experimental results shown are the averages of duplicate or 
triplicate independent cultures within one experiment. 

Quantitative PCR. Oligonucleotide primers corresponding 
to bp 178-198 (sense) and bp 495-515 (antisense) of the 
human tissue factor coding sequence and bp 64-86 (sense) 
and bp 581-603 (antisense) in the coding sequence of human 
glyceraldehyde phosphate dehydrogenase (GAPDH) were 
synthesized. Quantitative PCR was performed with 1 §Aof 
cDNA fix>m 100 /il of total cDNA resulting from reverse 
transcription of 1 /ig of RNA. The assay mixture contained 
10 mM TrisHCl (pH 8.3), 50 mM KQ, 1.5 mM MgQi. 0.4 /xg 
of each primer, 250 /iM each dKTP, 2.5 units of Taq 
polymerase. The amplification conditions were 94°C for 1 
min, 55°C for 1 min, and 72"C for 1 min. After 25 cycles, the 
PCR products fi-om GAPDH mRNA (528 bp) and tissue factor 
mRNA (317 bp) were analyzed by electrophoresis in a 2% 
agarose gel stained with ethidium bromide. For tissue factor. 
Southern blot analysis was performed on 8 ^li of the PCR 
product using the labeled 317-bp fragment as a probe. The 
relative intensity of the bands visualized by autoradiography 
was measured by laser densitometry (34). 

Fluorescence Mionoscc^y. Mononuclear cells were grown 
on glass slides in the presence of buffer, lipopolysaccharide 
(LPS) {Escherichia coU 055:B5W; Difco), or P-selectin for 6 
hr. The cells were fixed with 2% paraformaldehyde for 1 hr 
at 4X and the reaction was quenched with TBS/10 mM 
NH4CI for 10 min. After incubation with 2% bovine serum 
albumin/phosphate-buffered saline (BSA/PBS) for 30 min 
and 2% BSA/PBS/0.2% human immunoglobulin for 30 min, 
the cells were incubated with HTFl (5 /ig/ml in 2% BSA/ 
PBS/0.2% human inununoglobulin for 30 min at 23*'C. After 
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washing, fluorescein isothiocyanate-coi\jugated anti-mouse 
inununoglobulin was reacted with the cells for 30 min at 23°C. 
The cells were examined with a Zeiss fluorescence micro- 
scope. 

RESULTS 

The incubation of platelets with mononuclear leukocytes 
leads to expression of procoagulant activity. Mononuclear 
cells or platelets alone incubated for 16 hr at 37^ expressed 
minimal procoagulant activity. However, when platelets and 
mononuclear cells were coincubated under these conditions, 
we observed a 50- to 100-fold increase in procoagulant 
activity, confinning earlier results (8). This procoagulant 
activity is due to tissue factor inasmuch as stimulated mono- 
nuclear cells incubated with the inhibitory anti-tissue factor 
antibody HTFl (30 /ig/ml) expressed no procoagulant activ- 
ity. 

Purified P-selectin induced tissue factor expression by 50- 
to 100-fold under the conditions used (Fig. 1 A). Tissue factor 
expression was inhibited if GA6, an inhibitory monoclonal 
P-selectin antibody, was added to cells prior to P-selectin 
exposure. This P-selectin preparation did not have detectable 
endotoxin, as monitored in the Limulus assay. However, 
endotoxin is a well-known potent stimulator of tissue factor 
expression in monocytes (5, 35). To distinguish between 
upregulation of tissue factor due to P-selectin and that due to 
contaminating endotoxin that may be present below the 
levels of detection of the Limulus assay, P-selectin was 
heated at 100°C for 30 min to denature the protein. Heat 
denaturation of P-selectin abolished the upregulation of tis- 
sue factor activity. In contrast, LPS, in its native form and 
after heating at lOO^C for 30 min, stimulated tissue factor 
activity. These results indicate that P-selectin, and minor 
endotoxin contaminants, upregulates tissue fiEu:tor activity on 
monocytes. Intact monocytes stimulated by either P-selectin 
or endotoxin expressed »*15% of the tissue factor activity of 
lysed cells. This is similar to the results of Drake et al. (36) 
and Levy et al. (37), where the tissue factor activity of intact 
cells was 18-21% that of lysed cells. These results indicate 
that the tissue factor antigen expressed on the cell surfsice 
after P-selectin exposure (see below) is fiinctional. 

The expression of tissue factor activity as a fiinction of 
P-selectin concentration was studied. Expression of tissue 
factor activity is dependent on P-selectin concentration; the 
response was saturable (Fig. IB). This activity was inhibited 
with an inhibitory anti-tissue factor antibody. 

Unstimulated monocytes do not contain tissue factor in a 
storage pool nor is any tissue factor expressed on the cell 
surface (36). After stimulation, the kinetics of tissue factor 
expression in monocytes induced by P-selectin was com- 
pared to that induced by LPS. Tissue factor expression 
induced by P-selectin was observed at 4 hr, peaked at 6 hr, 
and then gradually decayed (Fig. IC). The peak of P-selectin- 
induced tissue factor preceded the LPS-induced peak of 
tissue factor. The low level of tissue factor activity measured 
in the absence of P-selectin or LPS represents a low level of 
spontaneous activation of monocytes in culture. Although 
the endotoxin level in the tissue culture medium is below the 
sensitivity limit of the amebocyte lysate assay, we cannot 
eliminate a small amount of endotoxin as the cause of 
monocyte activation. 

(Tells expressing P-selectin also mediate the induction of 
tissue factor expression; we compared CHO:P-selectin with 
untransfected CHO cells for their ability to induce tissue 
factor expression on monocytes. CHO cells incubated with 
mononuclear cells had no effect on the development of tissue 
factor activity (Fig. 2A). However, CHO: P-selectin cells 
incubated with mononuclear cells yielded approximately a 
10-fold increase in tissue factor activity. This activation was 
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Fig . 1 . Efifect of purified P-selectin on tissue factor expression in 
monocytes. {A) Tissue factor expression was measured in monocytes 
that were untreated or exposed to P-selectin (250 ng/ml), P-selectin 
plus GA6 (100 ;jig/ml), or LPS (100 ng/ml) for 6 hr at 37X. (B) Effect 
of increasing concentrations of purified P-selectin on tissue factor 
expressicm. Mononuclear cells were incubated with P-selectin at 
various concentrations in stoppered, endotoxin-firee tubes at 37X. 
After 6 hr, cells were assayed for tissue factor activity. (C) Kinetics 
describing the development of tissue factor expression in mononu- 
clear cells as induced by P-selectin. Mononuclear cells were incu- 
bated with buffer, P-selectin (250 ng/ml), or LPS (100 ng/ml). After 
the indicated time, the cells were assayed for tissue factor activity at 
various time intervals, o, P-selectin; #, LPS; buffer control. 

blocked in the presence of the anti-P-selectin antibody GA6. 
While both isolated P-selectin and CHO:P-selectin cells 
induce tissue factor in monocytes, activated platelets elicit 
significantly higher levels of tissue factor when incubated 
with monocytes. 

Although among mononuclear leukocytes the monocyte is 
the only cell capable of expressing tissue factor (38), we 
confirmed that the cell responsible for P-selectin-induced 
upregulation of tissue factor expression was the monocyte. 
Monocytes isolated from mononuclear cells with a PercoU 
density gradient, when incubated with CHO:P-selectin, also 
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Fig. 2. CHO:P-selectin induces expression of tissue factor in 
monocytes. (A) CHO cells, either naive (CHO) or CHO:P-sclcctin, 
were cultured to confluency. Mononuclear cells were added and 
incubated for 6 hr in the presence or absence of F(ab')2 fragments of 
GA6 (100 /xg/ml) before assay for tissue factor activity. (JB) CHO, 
CHO:P-selectin, or CHO:E-selectin cells were cultured as described 
above. PercoU gradient-purified monocytes were incubated as de- 
scribed above before assay for tissue factor activity. 

expressed tissue factor activity (Fig. 2B). We asked whether 
the monocyte stimulation induced by CHO: P-selectin was 
due to P-selectin or whether the binding of monocytes to 
CHO cells mediated by any cell adhesion molecule might 
elicit adhesion-dependent ceU activation. Monocytes were 
incubated with CHO:E-selectin cells (30). Althougji mono- 
cytes express an E-selectin ligand, which is distinct from the 
P-selectin ligand (30, 31, 39), monocytes bound to CHO:E- 
selectin cells did not express tissue factor (Fig. 2B), 

To determine the steady-state levels of tissue factor 
mRNA, we performed quantitative reverse transcriptase 
PCR (40, 41). RNA prepared from untreated control cells, 
cells treated with LPS, P-selectin, or P-selectin plus the GA6 
antibody was reverse transcribed and used for parallel assay 
of tissue factor mRNA and GAPDH PCR amplification. The 
expected PCR product for tissue factor (317 bp) was obtained 
and its identity was confirmed by restriction analysis with 
Acc I and Rsa I. To quantify the effect of P-selectin on tissue 
factor mRNA levels, we performed a Southern blot analysis 
of the PCR products diuing amplification. The PCR product 
from the control cells is barely visible, indicating that the 
expression of tissue factor mRNA is negligible (Fig. 3). The 
level of tissue factor mRNA was increased by ""SO-fold in 
cells exposed to LPS or to P-selectin. GA6 suppressed the 
level of tissue factor mRNA. Control experiments measuring 
the levels of GAPDH mRNA indicated similar mRNA levels 
in control and P-selectin- and LPS-treated cells, indicating 
that the efficiency of reverse transcription was comparable 
among the experimental groups. For both tissue factor and 
GAPDH, an identical PCR performed in parallel without 
including reverse transcriptase gave no amplification prod- 
ucts, thus ruling out PCR carryover. 

To confiFm that P-selectin upregulates tissue factor ex- 
pression on the surface of monocytes » mononuclear ceUs 
were cultured on glass slides and incubated for 6 hr with 
P-selectin, LPS, or buffer. The adherent monocytes incu- 
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Fig. 3. Efifect of P-selectin stimulation on tissue factor mRNA 
levels. Reverse transcriptase PCR analysis of tissue foctor and 
GAPDH mRNA expression in monocytes untreated (C) or exposed 
to LPS (100 ng/ml), P-selectin (250 ng/mi) fdus GA6 (100 $M%/Bd), or 
P-selectin alone for 6 hr at 3TC, (Top) Analysis of tissue factor 
mRNA by Southern blotting. Tissue factor cDNA (317 bp) is 
indicated by the arrow and the position of the molecular weight 
markers is indicated. {Middle) Ethidium bromide staining of the same 
PCR products. The specific tissue factor cDNA product (317 bp) is 
indicsited by the arrow. iJBottom) Ethidiiun bromide staining of 
reaction products. The GAPDH cDNA product (528 bp) is indicated 
by the arrow. A parallel reaction without reverse transcriptase was 
carried out to test possible contamination (no KV), 

bated with P-selectin or LPS were reactive with an anti-tissue 
factor antibody, while control monocytes incubated witii 
buffer were not (Fig. 4). 

DISCUSSION 

The initiation of blood coagulation following tissue iiuury 
requires the exposure of tissue factor on the cell surface to 
plasma factor Vila, a vitamin K-dependent clotting enzyme 
that circulates at low concentration in the blood (42). The 
tissue factor/factor Vila complex on the cell surface acti- 
vates both factor IX and factor X, thus leading to the 
sequential conversion of the blood clotting proenzymes to 
enzymes and the generation of a fibrin clot. Tissue factor 
plays a pivotal role in this process insofar as the expression 
of tissue factor activity is required for initiation and propa- 
gation of the signals necessary for clot formation. Nonvas- 
cular cells express tissue foctor constitutively. During tissue 
ii\jury, these cells make contact with flowing blood» thus 
initiating blood clotting on a rapid time scale. In contrast, 
monocytes and endothelial cells do not express tissue factor 
constitutively but only after cell stimulation (12, 43, 44). De 
novo synthesis of tissue &ctor by cells in culture requires at 
least 2 hr before tissue factor activity is expressed on the cell 
surface (43). 

P-selectin is an adhesion molecule on activated platelets 
and stimulated endothelial cells that mediates the bindiiig of 
certain leukocytes to these cells. The P-seiectin ligand ex- 
pressed on these cells include the Le* carbohydrate struc- 
ture, sialic acid, and a protein component PSGL-1 (31). The 
binding of P-selectin to leukocytes may also up- or down- 
regulate ceU functions. Since monocytes express the P-se- 
lectin ligand and synthesize tissue factor after cell stimula- 
tion, we have evaluated whether P-selectin binding can lead 
to the expression of tissue factor on monocytes. 



Proc, Natl Acad, ScL USA 91 (1994) 



t 




Fig. 4. Immunofluorescence staining of P-selectin-stimulated 
monocytes with a tissue factor antibody. Mononuclear ceUs were 
cultured on glass slides and stimulated with P-selectin, LPS , or buffer 
control for 6 hr. The cells were stained with fluorescein isothiocy- 
anate-^^oqjugated HTFl, an anti-tissue factor antibody. (A) Buffer 
control. (0) P-selectin. (C) LPS. 

We have demonstrated that P-selectin in three distinct 
forms is c^>able of upregulating tissue factor: (i) platelets, 
which express P-selectin on the cell surface upon activation; 
(if) CHO cells stably transfected with the P-selectin; (Hi) 
purified P-selectin. Interaction of platelets or CHO:P-selectin 
with monocytes through P-selectin appears sufficient to 
initiate signal transduction for tissue factor expression since 
inhibitory antibodies to P-selectin block cell-cell interaction 
and the tissue foctor response. However, activated platelets 
induce significantiy more tissue factor expression than do 
either P-selectin or CHO:P-selectin cells. While P-selectin 
plays an important role in tethering neutrophils to activated 
endothelial cells, activation of the neutrophils, as measured 
by increases in cytosolic Ca^'*', upregulation of CD11/CD18, 
change in morphology, or priming for granule secretion, 
requires participation of a second molecule, platelet- 
activation factor, on the surfece of activated endothelial cells 
(45). It is likely that in addition to P-selectin the activated 
platelet provides a second ligand either s<^uble or on the 
surface of the platelet that augments the synthesis of tissue 
factor by bound monocytes. 
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The potency of P-selecdn alone in upregulating the syn- 
thesis of tissue factor by monocytes is small compared to 
LPS. However, the relative 16w potency of P-selectin as 
measured in our experiments does not exclude a physiologic 
role for P-selectin in thrombogenesis. For example, we have 
recently demonstrated in an ex vivo model that filnin depo- 
sition after leukocyte accumulation on a vascular graft is 
inhibited by anti-P-selectin antibodies (32). Low levels of 
tissue factor expression as mediated by P-selectin may be 
significant in vivo in fibrin generation during vessel wall 
ii\jury. Alternatively, P-selectin may be just one component 
required for the initiation of tissue factor expression; for 
example, a second molecule (e.g., l^HETE) sij^iificantly 
ampMes the signal for tissue factor expression (unpublished 
results). 

Blood coagulation is a host defense mechanism that in- 
volves an inunediate, rapid response to the loss of integrity 
of a closed vascular system and a slower, delayed response 
that leads to thrombogenesis, inflammation, and wound 
healing. Tissue factor on nonvascular cells, which becomes 
acces^le to flowing blood during vessel iiuury, is respon- 
sible for the immediate response. In contrast, tbe delayed 
expression of tissue factor in the region of tissue iqjury may 
be critical to maintenance of thrombus. In our proposed 
model, neutrophils and monocytes accumulate in the area of 
tissue injury due to the expression of P-selectin on activated 
platelets that collect in the subendothelium and on stimulated 
endothelium in the ii^ured vasculature. Upon interaction 
with P-selectin, and possibly other required ligands, on 
platelets and endothelial cells, monocytes upregulate the de 
novo synthesis of tissue factor. Within several hours, tissue 
factor expression on the cell surface supports and maintains 
the local activation of blood coagulation and the deposition of 
fibrin. It is likely that P-selectin initiates a general cell 
response, of which tissue factor biosynthesis is but one of a 
series of metabolic and synthetic pathways activated. P-se- 
lectin-mediated effector function in leukocytes will likely 
play an important role in inflammation and thrombosis. 
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We previously demonstrated that P-selectin binds 
with high affinity to a trace, homodimeric glycoprotein 
ligand on human myeloid cells. The ligand carries the 
sialyl Iiewis z (sLe^ epitope, a limited number of 
AT-linked glycans, and clustere«l, sialylated O-linked gly- 
cans. In this study we demonstrate that the polypeptide 
component of this ligand is identical to that of P-selectin 
glycoprotein ligand-1 (PSGL-1), a molecule recently 
identified by expression cloning from a human myeloid 
cell cDNA library. We have examined the effects of gly- 
cosidases on purified, radioiodinated PSGI/-1 fix>m hu- 
man neutrophils to further characterize the structure 
and function of the attached oligosaccharides. We found 
that PSGLi-l had poly-AT-acetyllactosamine, only some of 
which could be removed with endo-p-galactosidase. The 
minority of the Le* and sLe' structures were on endo-^ 
galactosidase-sensitive chains. Peptide^A^■glycosidase F 
(PNGaseF) treatment removed at least two of the three 
possible AT-iinked oligosaccharides from PSGL-1. Ex- 
pression of Le* and sLe' was not detectably altered by 
PNGaseF digestion, indicating that these structures 
were primarily on O-linked poly-iV-acetyllactosamine. 
Endo-p-galactosidase-treated PSGL-l retained the abil- 
ity to bind to P-selectin, suggesting that some of the 
oligosaccharides recognized by P-selectin were either 
on enzyme-resistant poly-iV-acetyllactosamine or on 
chains which lack poly-iV-acetyllactosamine. PNGaseF 
treatment did not affect the ability of PSGLr-1 to bind to 
P-selectin, demonstrating that the oligosaccharides re- 
quired for P-selectin recognition are O-linked. PSGLrl 
also bound to E-selectin, but with at least 50-fold lower 
affinity than to P-selectin. These data suggest that 
PSGL-1 from human neutrophils displays complex, sial- 
ylated, and fucosylated O-linked poly-JV-acetyllac- 
tosamine that promote high afGnity binding to P-selec- 
tin, but not to £-selectin. 



The selectina are a family of three Ca^-dependent mem- 
brane-bound lectins that initiate adhesion of leukocytes to 
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platelets or endothelial cells under the shear forces foimd in the 
venular circulation (1-3). L*selectin, expressed on leukocytes, 
binds to constitutive or inducible ligands on endothelial cells. 
E-selectin, expressed by cytokine-activated endothelial cells, 
and P-selectin, expressed by thrombin-activated platelets and 
endothelial cells, bind to ligands on myeloid cells and subsets of 
lymphocytes. Although the selectins interact weakly with small 
sialylated, fucosylated oligosaccharides such as sialyl Lewis x 
(sLe'^ NeuAca2-3Gaipi-4[Fucal-^]GlcNAc) (4, 5), they bind 
with higher affinity to glycans displayed on a limited number of 
glycoproteins (6-11) or proteoglycans (12). The high affinity 
ligands are potentially important as physiologic mediators of 
selectin-mediated leukocyte adhesion during inflammation. 
Tlius, understanding the structural basis for high affinity rec- 
ognition of these glycoconjugates by the selectins has attracted 
increasing interest. 

A subset of the high affinity selectin ligands consists of mu- 
cin-like glycoproteins (7, 10, 11, 13). We have previously char- 
acterized one of these molecules, a sialomucin ligand for P- 
selectin that is expressed by human neutrophils and the 
human promyelocjrtic HL-60 cell Hne (6, 13). Binding of P- 
selectin to the ligand is Ca^* dependent and is abolished by 
treatment of the ligand with sialidase. The ligcmd is a ho- 
modimer with two disulfide-linked subunits with relative mo- 
lecular masses of 120,000 as assessed by SDS-PAGE. It car- 
ries memy unmodified sialic residues as well as the sLe' 
antigen. The ligand contains at least one PNGaseF-sensitive 
^-linked glycan that P-selectin does not require for recognition. 
In contrast, it has clustered, sisdylated, O-linked oligosaccha- 
rides that render the polypeptide backbone sensitive to cleav- 
age by the enzyme O-sialoglycoprotease. IVeatment of intact 
HLf-60 cells with this enzjnme eliminates the high affinity bind- 
ing sites for P-selectin (14) and prevents cell adhesion to im- 
mobilized P-selectin (13, 15), without affecting the overall sur- 
face expression of sLe' (13). These data suggest that this 
sialomucin, which carries only a small portion of the cell sur- 
face sLe*, corresponds to functionally important, high affinity 
binding sites for P-selectin on human myeloid cells. 

Sako et al. (16) recently isolated a cDNA derived from human 
HL-60 cells that encodes a glycoprotein ligand for P-selectin in 
COS cells co-transfected with a cDNA for an al,3/4-fucosyl- 



* The abbreviations used are: Le*. Lewis x; CHO, Chinese hamster 
ovary; ConA, concanavalin A; HSA, human serum albumin; PBS, phos- 
phate-buffered saline; PAGE, polyacrylamide gel electrophoresis; 
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gand 1; sLe", sialyl Lewis x; tES, truncated soluble E-selectin; tPS, 
truncated soluble P-selectin; WGA, wheat germ agglutinin; mAbs, 
monoclonal antibodies; HBSS, Hanks' balanced salt solution; MOPS, 
4-morpholinepropanesulfonic acid; HSA, human serum albumin. 
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transferase. The recombinant glycoprotein also interacts with 
E-selectin in cell adhesion experiments. The cDNA-derived 
amino acid sequence of this molecule, termed P-selectin glyco- 
protein ligand-1 (PSGL-1), predicts a 402-residue type I mem- 
brane protein with a SerAThr/Pro-rich extracellular domain 
that contains three potential sites for addition of iV-linked oli- 
gosaccharides as well as a single cysteine that might promote 
dimerization. Like the P-selectin glycoprotein ligand that we 
identified in human neutrophils, recombinant PSGL-1 is a ho- 
modimer with a limited number of iV-Iinked glycans and many 
O-linked oligosaccharides. However, the identity of the 
polypeptide components of these two molecules has not been 
formally proven. Furthermore, the degree to which the oligo- 
saccharides on recombinant PSGL-1 resemble those on the na- 
tive glycoprotein ligand is unknown. 

In this study we have demonstrated that antibodies to a 
peptide derived from PSGL-1 also recognize the human neu- 
trophil glycoprotein ligand, establishing the identities of the 
polypeptide backbones of both molecules, lb ensure consistency 
in nomenclature, we have also adopted the term PSGL-1 to 
refer to this glycoprotein. We have studied the effects of various 
glycosidases on the structure and function of PSGL-1 from 
human neutrophils. We found that this molecule, unlike recom- 
binant PSGL-1, is largely resistant to treatment with endo-a- 
i^-acetylgalactosaminidase, suggesting that it has few simple 
core 1 Galj31-^GalNAc disaccharides linked to serine or thre- 
onine. Furthermore, human neutrophil PSGL-1 displays sLe" 
and its nonsialylated counterpart, Le', primarily on O-linked 
poly-^-acetyllactosamine. This molecule also binds to E-selec- 
tin, but with at least 50-fold lower cdfinity than to P-selectin. 
These data suggest that PSGL-1 on human neutrophils pre- 
sents an array of sialylated, fucosylated, O-hnked poly-N- 
acetyllactosamine that P-selectin preferentially recognizes. 

EXPERIMENTAL PROCEDURES 

Materials — IWton X-100 and sialidase from Arthrobacter ureafaciens 
(75 units/mg, EC 3.2.1.18) were obtained from Calbiochem. lodobead- 
3M Emphaze™ Biosupport Medium, Protein A-Sepharose CL4B, 
and Br\j-58 were from Pierce. Endo-jS-galactosidase from Escherichia 
freundii (5 units/mg, EC 3.2.1.103) was obtained from V-Labs» Inc. 
(Covington, LA). Streptomyces sp.l42 al,3/4-L-fuc08idase (EC 3.2.1.51) 
was obtained from PanVera Corp. (Madison» WI), and p-^^-acetylglu- 
cosaminidase from Jack Bean (EC 3.2.1.30, 53 units/mg) was from 
Sigma. Recombinant peptide JV-glycosidase F (EC 3.2.2.18, 25,000 
unita^mg), and endo-a-N-acetylgalactosaminidase from Diplococcus 
pneumoniae (EC 3.2.1.97, O-glycanase™) were purchased from Coen- 
zyme Corp. (Cambridge, MA). Concanavalin A-Sepharose (ConA, 10 
mg^m) resin) was from Fharmada/LKB (Uppsala, Sweden). Wheat 
germ agglutinin (WGA) agarose (7.6 mg/ml resin) and tomato lectin 
were from Vector Laboratories, Inc. (Burlingame, VT). Ibmato lectin 
was coupled to cyanogen bromide-activated Sepharose to a density of 2 
mg/ml resin in the presence of 7.5 m^ml chitotriose as described 
previously (17). 

Antibodies — The anti-human P-selectin mAbs Sl2 and Gl were pre- 
pared and characterized as described previously (18, 19). The anti- 
human E-selectin mAbs CL2 and CL37 (20) were kindly provided by C. 
Wayne Smith (Baylor College of Medicine, Houston. TX). LeuMl (CD 15, 
IgM) was purchased from Becton-Dickinson & Co. (San Jose, CA). The 
CSLElX-1 hybridoma (HB 8580) was purchased fi^m the American Type 
Culture Collection, and the IgM mAb was purified from ascites fluid by 
boric acid precipitation and gel filtration on Sepharose CL4B in PBS, 
pH 7.4, as described previously (21). Goat anti-mouse p-chain-specific 
IgG and MOPC104E (IgM) were purchased from Cappel-Organon Ibch- 
nika (Durham, NC). 

Peptide Antisera Production and Immunoprecipitations — ^Peptides 
corresponding to residues 42-56 (QATEYEYLDYDFLPEC) and 354- 
376 (GISSLLPDGGEGPSATANGGLSK) of the cDNA-derived amino 
acid sequence of PSGL-1 (16) were synthesized on an Applied Biosys- 
tems model 431 peptide synthesizer. Peptide 42—56 was coupled to 
maleimide-activated keyhole limpet hemocyanin (Pierce) through the 
added cysteine (underlined) and injected into New Zealand White rab- 
bits (Montana State University Animal Resources Center). Immune 



sera were collected and used to immunoprecipitatc ^I-PSGL-l. Protein 
A beads (25 yjX, 50% suspension) were preincubated with 1 pi of a 1;4 
dilution of either normal or immune rabbit serum for 90 min at 37 *'C. 
The beads were washed once with 0.1 m NaCl, 20 nm MOPS, pH 7.5, 
0.1% Triton X-100 (MBS/0.1% Triton). Puriaed "»I-PSGUl or '""I-WGA 
eluate was then incubated with the beads in the presence or absence of 
0,2 mg/ml of either the immunizing peptide or PSGL-1 peptide 354-376 
which served as a negative control. After a 90-min incubation at 37 '^C, 
the beads were washed five times with MBS/0.1% IViton and then 
eluted by boiling for 5 min with SDS sample buffer (22). Immunopre- 
cipitates were analyzed by SDS-PAGE (22), followed by autoradiogra- 
phy. All autoradiograms shown represent the entire gel from the stack- 
ing gel interface downward. 

Generation of CHO D' Cells Stably Expressing a Soluble Form of 
E'Selectin — An expression plasmid (ELAM-1:BBG 57, obtained from 
British Biolfechnology Products, Ltd.) containing the full-length cDNA 
for human E-selectin was used as a template in the polymerase chain 
reaction to modify the 5' and 3' ends of E-selectin for high level pro- 
duction of recombinant soluble E-selectin in CHO D^ cells. The oligo- 
nucleotide pair (5' GTC CCT CTA GAC CAC CAT GAT TGC TTC ACA 
GTTT + 5' TCC CGG TCGACTTAG GGAGCTTCACAGGT)u8edin 
the polymerase chain reaction was designed to introduce an Xbal site 
and a perfect Kozcdc sequence (23) preceding the E-selectin initiator 
codon, and to introduce a Sail site following the stop oodon introduced 
after amino add 550 located between the sixth consensus repeat and 
the transmembrane domain of full-length E-selectin. DNA sequence 
analysis verified the changes generated by the polymerase chain reac- 
tion as well as the integrity of the remainder of the E-selectin cDNA. 
The truncated E-selectin (tES) gene was then introduced as an X&al- 
Sail fragment into the mammalian expression vector pDSRa2 (Euro- 
pean patent application A20398753) that was modified to include 
unique Xbal and Sail restriction sites. The tES expression vector was 
transfected into a CHO cell line deficient in dihydrofolate reductase 
(CHO D") (24), and transfectants were selected in medium lacking 
hypoxanthine and thymidine (25). An RNase protection assay was used 
to screen for transfectants that had high levels of E-selectin-specific 
mRNA (26). 

Protein Purification — ^PSGL-1 was purified finom human neutrophil 
membranes prepared as described previously (6X with the following 
modifications. Neutrophil membranes extracted with 5% TViton X-100 
in 0.1 M NaCl, 20 mM MOPS, pH 7.5, 0.02% NaN^ were applied to 
WGA-agarose, and bound proteins were eluted with 0.5 m iV-acetylglu- 
oosamine (WGA eluate). After extensive dialysis against 0,1 M NaCl, 20 
mM MOPS, pH 7.5, 0.02% NaNg, 0.02% Bry-SB, 2 mm CaClg, 2 mM MgClj 
(equilibration buffer), the WGA eluate was loaded on a truncated P- 
selectin (tPS)-Emphaze column (10 mg lectin/ml resin). The column was 
extensively washed with equilibration buffer and eluted with 5 mM 
EDTA. ligand-containing fractions were pooled and loaded on a Mono 
Q PC 1.6/5 column equilibrated with 0.1 m NaCl, 20 mM MOPS, pH 7.5, 
2 niM EDTA, 0.02% NaNg, 0,02% Brij-58 using a SMART^Micro Sepa- 
ration System (Pharmacia/LKB). The column was developed with a 
2-ml linear gradient (0.1—1.0 m NaCl) at 50 pl/min. Aliquots of the 
fractions were iodinated with Na^^I using Iodobeads™according to the 
instructions of the manufacturer. The purity of the fractions was as- 
sessed by SDS-PAGE and autoradiography of the iodinated fractions. 

A recombinant soluble form of P-selectin (tPS) truncated after the 
ninth consensus repeat was immunoafiinity purified from conditioned 
medium of permanently transfected 293 cells as described previously 
(14). As determined by sedimentation velocity and equilibrium analysis, 
tPS is an asymmetric monomer with a molecular weight of 103,600 Da 
(14). The JSa for tPS is 12.3 (14). 

Recombinant soluble truncated E-selectin (t£S) was immunoafHnity 
purified from conditioned medium of CHO D~ cells stably secreting tES. 
Cells were cultured in equal parts of Dulbecco*s modified Eagle's me- 
dium and Ham*6 F-12 nutrient mixture, supplemented with 10% fetal 
bovine serum, 10 units/ml penicillin, and 10 mg/ml streptomycin. At 
confluence, the cells were shifted to serum-free medium, and condi- 
tioned medium was harvested every 7 days. Conditioned medium was 
clarified, sterile-filtered, using two Sartorius filters (10 and 0.2 pm) 
connected in series, and then concentrated 50-fbld (Filtron Maximate, 
10 kDa molecular mass cut off). The anti-E-selectin mAb H18/7 (27) was 
coupled to cyanogen bromide-activated Sepharose to a density of 6 
mg/ml resin. The H18/7 -Sepharose coliman was equilibrated at 4 ^C 
with PBS, pH 7.5, containing 1 mM CaCl^ and 1 mM MgCl^, and 500 ml 
of concentrated medium was loaded on the column ( V,. := 200 ml) at a 
flow rate of 10 cm/h. The column was eluted with 0.1 m glycine, pH 2.7, 
and the fractions were rapidly neutralized by addition of IVis-HCl, pH 
7.5, to a final concentration of 0.1 m. The eluate was dialyzed against 25 
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mjd sodium phosphate, pH 7.5» and loaded on a Q-High Performance 
anion-exchange column (Vj. » 200 ml, Pharmacia) equilibrated in the 
same buffer. The column waa developed with a linear gradient of NaCl 
(0.17-0.22 m). Purity was assessed by SDS-PAGE and by N-terminal 
sequence analysis. Protein concentration was determined using an 
^280 = ^^'^ that was calculated using a previously described method 
(28). The yield was 250 mg of t£S/"'60O ml of concentrated conditioned 
medium. 

Sedimentation Equilibrium Analysis ofSoiuble tES — Sedimentation 
equilibrium experiments were conducted using a four-hole titanium 
rotor, short column kel-f centerpieces, and sapphire windows in a Beck- 
man Optima XLA analytical ultraoentrifuge equipped with on-line Ray- 
leigh interference optics. Data were acquired at rotor speeds of 10,000, 
15,000, 20,000, and 25,000 revolutions/min at 20 ''C using a television- 
based camera and an on-line acquisition and anal3rsis system (29, 30). 
Samples were loaded at concentrations of approximately 1.0, 0.5, 0.25, 
and 0.13 mg/ml in PBS, pH 7.4. Data were collected at intervals after 
the estimated time to equilibrium and tested for equilibrium by sub- 
tracting successive scans (31). The data within the optical window were 
selected using the program REEDIT (kindly provided by David Yphan- 
tis). Data were analyzed to estimate the monomer molecular weight of 
tES using the NONLIN program (32). Three or more channels of sedi- 
mentation equilibrium data obtained at different loading concentration 
of tES, radial positions, and angular velocities were 6t by simultaneous 
nonlinear least-squares analysis. For molecular wei^t calculations, 
the partial specific volume of tES waa estimated to be 0.688 ml/g with 
an inaccuracy of ± 0.03 ml/g to account for uncertainty in the carbohy- 
drate composition of tES (30). 

Sedimentation Velocity Analysis of Soluble tES — Sedimentation ve- 
locity experiments were conducted in the Beckman Optima XLA ana- 
lytical ultracentrifiige using Rayleigh interference optics. Experiments 
were conducted at 60,000 revolutions/min and 20 ''C, using a four-hole 
titanium rotor, two-channel, charcoal-fLlled, epon centerpieces, and sap- 
phire windows. Sample was loaded at a concentration of 0.5 mg/ml in 
PBS, pH 7.4. Sedimentation coefficients were determined from the time 
derivative of the concentration profile as described (33). 

Protein lodinations — ^The mAbs Gl and CL2 (200 pg) were iodinated 
with 400 pCi of carrier-free Na^^l (ICN Biomedical, Inc., Costa Mesa, 
CA) using lodobeads™. Free '^I was removed by gel iiltralaon tJurough 
a Sephadex G-25 column (PD-10, Pharmacia) equilibrated in 0.1 M 
NaCl, 20 mM MOPS, pH 7.5, 0.1% TViton X-100. The labeled antibodies 
were then centrifuged for 30 min at 90,000 x ^ in a TL-100 ultracen- 
trifuge (Beckman, Palo Alto, CA). The concentration of labeled proteins 
was determined with a Micro BCA protein assay kit (Pierce) using the 
respective unlabeled proteins as standards. The specific activity of the 
labeled antibodies ranged from 0.5 to 1.0 pCi/pg protein. 

PSGL-1 ('*-0.6 pg) or WGAeluate (-100 pg) waa iodinated with 400 
pCi of Na'^I and desalted as described above. Radioabeled PSGL-1 was 
concentrated to —200 pi using a Centicon-30™ device (Pharmacia/ 
LKB), and gel filtered on a Superose 6 PC 3.2/30 column equilibrated 
with 0. 15 M NaCl, 20 mM MOPS. pH 7.5, 0.02% NaNg, 0. 1% Brij-68 using 
a SMART™Micro Sepcu'ation System. The specific activity of the radio- 
labeled ligand ranged from 14 to 30 pCi/pg protein. Labeled proteins 
were routinely >95% trichloroacetic add precipitable. 

Binding of'^J PSGL-2 to Immobilized tPS or tES-^tPS or tES (SO pi, 
10 pg^ml) in HBSS, 0.02% NaNs (HBSS/Az) were incubated overnight at 
4 in microtiter plates (Immulon I Removawell™ Strips, Pynatecb 
Laboratories, Inc., Chantally, VA). After three washes with HBSS/Az the 
wells were blocked with HBSS/Az, 1% human serum albumin (HSA) for 
2 h at 22 ""C. The wells were then washed once with HBSS/Az and 
>^I-PSGL.l diluted in HBSS/Az, 0.1% HSA was added (60 pi, 5,000- 
10,000 counta/min/well). After 1 h at 22 ''C, the wells were rapidly 
washed five times with HBSS/Az using a 12-channel plate washer 
(Nimc-Immuno Wash 12, Nunc Inc., Naperville, XL). The individual 
wells were then counted in a 7 counter. All assays were performed in 
quadruplicate. In certain assays binding of '^I-PSGL-1 to immobilized 
tPS was measured in the presence of increasing concentrations of fluid- 
phase tPS or tES. 

Binding of '^I-PSGL*! to Immobilized Anti-carbohydrate Anti- 
bodies — Goat anti-mouse p-chain specific IgG (50 pi, 10 pg/ml) in 
HBSS/Az was incubated ovemi^t at 4 ^C in Immulon I Removawell™ 
strips. The wells were washed and blocked as described above. Then, 
either CSLEX-1, LeuMl, or MOPC104E was added (50 pi, 5 pg/ml). 
Afler incubation for 1 h at 22 ^C, the wells were washed three times 
with HBSS/Az, and ^'^I-PSGI^l in HBSS/Az, 0.1% HSA (60 pi, 5,000^ 
10,000 counts/min/well) was added. After 1 h the wells were washed as 
described above, and individual wells were counted in a -y counter. All 
assays were performed in quadruplicate. 



Enzyme Digestion of ^^I-PSGL-1 — Digestions with sialidase (0,2 
unit/nU, 15 h), endo-^-galactosidase (2 unit/ml, 15 h). and al,3/4-L- 
fiicosidase (0.32 milliunit/ml, 15 h) were performed in 0. 1 m NaCl, 50 mM 
sodium acetate, pH 5.5. Digestions with PNGaseF (40 units/ml, 15 h) 
were done in 0.1 m sodium phosphate, pH 8.6. PSGL-1 was generally not 
pretreated with SDS because PNGase F had similar effects with or 
without SDS pretreatment. Sequential digestions with sialidase (0.2 
unit/ml, 2 h) followed by endo-a-A^-acetylgalactosaminidase (0.1 unit/ 
ml, 16 h) were performed in 0.1 m NaCl, 50 mM sodium acetate, pH 6.0. 
All glycosidase digestions were performed at 37 °C in the presence of 20 
pM leupeptin, 30 pM antipain, 1 mM benzamidine, and 0.02% NaNg. 

Lectin Affinity Chromatography — ^I-PSGL-1 was applied to a con- 
canavalin A (ConA)-Sepharose column iV^ =■ 1 ml) equilibrated in 0.1 m 
NaCl, 20 mM TVis, pH 7.5, 2 nui CaCl,, 0.1% Br^-58, 0.02% NaN^. The 
column was washed with five column volumes of equilibration buffer 
and eluted with 0.5 m o-methylmannoside in equilibration buffer pre- 
heated to 65 **C. Ibmato lectin chromatography was similarly per- 
formed except that the column (V7. s 1 ml) was equilibrated with 0.1 M 
NaCl, 20 mM MOPS, pH 7.5, 0.1% Brij-58, 0.02% NaNg, and step eluted 
with 20 mg/ml chitotriose in equilibration buffer. Greater than 85% of 
the counts loaded on the lectin columns were recovered. 

Site Density Determinations — ^mAbs to P-selectin (Gl) or E-selectin 
(CL2) that inhibit interactions with myeloid cells were used for site 
density measurements. Wells were coated with tPS or t£S (50 pi, 10 
pg/ml) and blocked as described above. Satiu^ting concentrations of 
>^I-labeled mAb (2 pg/ml) in HBSS/Az, 0.1% HSA were added to the 
wells and incubated for 1 h at 22 *'C. The wells were washed five times 
and then counted in a y counter. Specific binding waa defined as counts/ 
min bound to selectin-coated wells minus oounta/min bound to HSA- 
coated wells. Site densities were calculated assuming monovalent bind- 
ing of antibody at saturation. All assays were performed in 
quadruplicate. 

Neutrophil Adhesion Assay — ^Human neutrophils were isolated from 
heparinized blood irom consenting volunteer donors by dextran sedi- 
mentation, hypotonic lysis, and Ficoll-Hypaque density gradient cen- 
trifiigation as described previously (34). Quantitation of neutrophil 
adhesion to tPS or tES (50 pi, 10 pg/ml), immobilized on Immu- 
lon I Removawell'™ microtiter plates, was performed as described 
previously (19). 

RESULTS 

An Antiserum to a PSGL-l Peptide Recognizes the P-selectin 
Glycoprotein Ligand from Human Neutrophils — We wished to 
determine whether the polypeptide core of the sialomudn li- 
gand for P-selectin that we characterized in human neutrophils 
and HL-60 cells (6, 13) was immunologically related to PSGL-1, 
a glycoprotein ligand for P-selectin identified by expression 
cloning from a HL-60 cDNA library (16). We prepared anti- 
serum in rabbits against a peptide corresponding to residues 
42-56 of the cDNA-derived amino acid sequence of PSGL-1 and 
tested whether this antiserum could recognize the glycoprotein 
ligand that we isolated by P-selectin afifinity chromatography. 
Fig. lA demonstrates the radiochemical purity of the ^^I-P- 
selectin glycoprotein ligand used in this study. It exhibited a 
relative molecular mass of '^250,000 imder nonreducing con- 
ditions and 120,000 under reducing conditions. Thus, the ra- 
dioiodinated glycoprotein contained two disulfide-linked sub- 
units of 120,000, consistent with previous results using 
protein blotting or metabolic labeling (6). As noted previously, a 
small portion of the dimeric glycoprotein was resistant to re- 
duction under the conditions used. Fig. IB shows that the im- 
mune serum, but not normal rabbit serum, precipitated the 
purified *^I-P-selectin ligand. Precipitation by the antiserum 
was specific for the 42--56 peptide sequence because it was 
completely inhibited by inclusion of this peptide, but not by a 
control peptide corresponding to amino acids 354-376 of 
PSGL-1. The specificity of the antiserum was further demon- 
strated by its selective immunoprecipitation of the ligand from 
a crude mixture of radiolabeled neutrophil membrane proteins 
that were eluted from a WGA column (Fig. IC, arrows). 

We also sequenced two tryptic peptides from the P-selectin 
glycoprotein ligand purified from human neutrophils. The se- 
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Fig. 1. Immunoprecipitation of the P-selecfin glycoprotein ligand by anti-PSGirl peptide antibodies. A, PSQL-1 isolated from human 
neutrophils was radiolabeled as described under ''Experimental Procedures" and electrbphpresed iii ah SOS, 7.$% polyacrylanude gel under 
nonreducing XNR') and reducing iR) conditions. *^I-PSGL-1 was immunoprecipitated with either normal rabbit serum iNRS) or rabbit 
antiserum directed against a peptide corresponding to residues 42-56 of PSGL-1 (anti 42-56) in the presence or absence of 0.2 mg/ml of the 
immuriizirig peptide or of a control peptide corresponding to residues 35^-376 of PSGL-1. C, ^^^I-WGA eluate frbrn human neutrophil membrane 
extracts was immunoprecipitated using either normal rabbit scrum or anti 42—56 serum. The samples in B and C were electrophoresed in SDS, 
7.5% polyacryiamide gels uiider reducing conditions. All gels were then analyzed by autoradiography, DF = dye front. 



quences of these peptides, His-Met-iyr-Pro-Val-Arg and Pro- 
Gly-Lys-Thr-Pro^Olu-Proi are idientical to amino acids 340-345 
and 380-386 in the cDNA-derived sequence of PSGL-1 (data 
not shown). These data and the results from the immunopre^ 
cipitation experiments establish the identity of the polypeptide 
backbones of the P-selectin ligands studied by both groups. To 
ensure consistency in nomenclature, we will use the term 
PSGL'l to refer to this polypeptide, although the possibility 
exists that it is dUferentially glycosylated in various cell types. 

Purified '^I'PSGL'l Binds Specifically to P'Selectin—^H' 
iPSG]>l isolated &om human neutrophils bound in a time- 
dependent fashion to tPS immobilized on . microtiter wells (Pig. 
2A). Binding increased as a function of the amount of tPS coated 
on the wells (Fig. 2B ). Binding was Ca^* dependent and was 
abolished by Gl, a mAb to P-selectin that inhibits leukocyte 
adhesion to P-selectin, but not by S12, a mAb that does not block 
adhesion (Fig. 2B), In addition, 84.6 ± 2.4% (mean ± S.D., n = 
4) of the radiolabeled glycoprotein rebourid to a column of re- 
combinant, soluble P-selectin (tPS) and was eluted with buflfer 
containing EDTA. This result demonstr^Ltes that the function of 
^^I-PSGL-1 was not substantially altered 'by iodination. 

PSGL'l Contains Substituted Poly-N-acetyllactosamine — 
PSGL-1 from human myeloid ceils contains clustered, sialyl- 
ated O-linked oligosaccharides that are released from the 
polypeptide backbone by /3-elimi nation (13). To determine 
whether these O-linked glycans had simple structures, we 
treated *^I-PSGL-1 with sialidase, then with endo-a-AT-acetyl- 
galactosaminidase^ an enzyme that releases non-sialylated 
Gal01-3GalNAc core 1 disaccharides, but not more complex 
O-linked glycans, from serine or threonine residues (35). Fig. 
3A shows that treatment with sialidase slowed the electro- 
phoretic mobility of PSGL-1, confirming previous results (6). 
Subsequent addition of endo-a^AT-acetylgalactosaminidase in- 
creased the mobility of PSGL-1 only slightly more than siali- 
dase-treated PSGL-1, suggesting that very few of the O-linked 



oligosaccharides were simple structures that were susceptible 
to this enzyme. 

PSGL-1 is fucosyJated because it contains the sialylated, fu- 
cosylated tetrasaccharide antigen sLe* (13). Treatment of the 
glycoprotein with al,3/4-fucosidase, which removes fucose at- 
tached to a penultimate GlcNAc, had no effect on electro- 
phoretic mobility (Fig, 3B), although it did remove fucose resi- 
dues that were part of the Le'^ epitopes on the molecule (see Fig. 
5). We next treated the ligand with endo-0-galactosidase, which 
hydrolyzes internal pi-4 linkages between galactose and jV- 
acetylgiucosamine «]raipi-4GlcNAc) in extended unbranched 
poly-N-acetyllactosamine chains (36, 37). This enzyme slightly 
accelerated the mobility and lessened the electrophoretic hetr 
erogeneity of PSGL-1, suggesting that PSGL-1 contained some 
poly-iV-acetyllactosamine (Fig. 3B, see also Fig. 4C). 

To confirm the presence of poly-iV-acetyllactosamine on 
PSGL-1, we tested the ability of ^^^l-PSGL-l to bind to a column 
containing immobilized tomato lectin, which avidly binds to 
poly-JV-acetyllactosamine. A single poly-iV-acetyllactosamine 
chain is sufficient to confer binding of a glycoprotein to tomato 
lectin in this system (17). Greater than 95% of the PSGL-1 
bound to the column, indicating that essentially every molecule 
contained at least one poly-A^-acetyllactosamine (Table I). 
Treatment of PSGL-1 with endo-p-galactosidase modestly re- 
duced to 84% the amount of material binding to the column. 
This suggests that most if not all of the poly-iV-acetyllac- 
tosamine could be removed from a portion of the PSGL-1 mol- 
ecules by endo-0-gaiactosidase. Because substitutions of poly- 
iy-acetyllactosamine with sialic acid and/or fucose inhibit the 
efficiency of this enzyme, we pretreated PSGL-1 with sialidase 
and fucosidase before addition of endo-/3-galactosidase. IVeat- 
ment with sialidase and fucosidase had no effect on binding of 
PSGL-1 to the column, consistent with the fact that substitu- 
tions with sialic acid or fucose do not affect binding of poly-iV- 
acetyllactosamine to tomato lectin. However, subsequent addi- 
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Fig. 2. Binding of ^I-PSOI^l to immobUized tPS. A, ^^I-PSGL-l 
was incubated for varying times in microtiter wells coated with tPS (50 
10 pg^ml) and then blocked with HSA, or in control wells without tPS 
that were blocked with HSA. B, was incubated for 1 h in 

microtiter wells in which varying amounts of tPS (50 pi, 0-20 )]g/ml) 
had been immobilized. As indicated, some incubations were perfonned 
in the presence of 8 mM EDTA or 40 ng/wl of the mAbs S12 or Gl to 
P'selectin. The data represent the mean ± S.D. of quadruplicate wells 
and are representative of three independent experiments. 

tion of endo~0-galactosidase reduced to 69% the portion of the 
glycoprotein binding to the coliimn. Because tomato lectin 
binds weakly to terminal GlcNAc residues that are exposed by 
endo-^-galactosida8e treatment (38), we then added 
acetylglucosaminidase to remove these terminal residues. Only 
53% of this material bound to tomato lectin. These data clearly 
demonstrated that PSGL-l contained poly-iV-acetyllac* 
tosamine. Furthermore, at least some of these chains were 
resistant to endo*0-galacto6idase because of substitutions with 
sialic acid and/or fucose. The inability of sialidase, fucosidase, 
endo-^-galacto8ida8e, and ^-iV-acetylgIucosa^li^idase to elimi- 
nate binding of all PSGLrl molecules to tomato lectin may 
reflect enzyme resistance because of the clustered nature of 
O-linked glycans, the presence of internal fucose residues, 
and/or the presence of additional substitutions such as sulfate 
on the polyrAT-acetyllactosamine. 

PSGL'l Contains a Limited Number of Heterogeneous N- 
Linked Oligosaccharides — ^Fig. 4A shows that PNGaseF treat- 
ment slightly increased the electrophoretic mobility of 
PSGL-I, consistent with previous studies indicating that all 
molecules contained a limited number of iV-linked glycans sen- 
sitive to this enzyme (6). The cDNA-derived sequence of 
PSGL-1 indicates that the molecule contains only three poten- 
tial sites for attachment of iV-linked oligosaccharides (16). Tb 
explore possible heterogeneity of the iV-linked structures, we 
applied sham-treated and PNGaseF-treated ^^sj-PSGL-l to a 
column of ConA, a plant lectin that binds avidly to high man- 
nose iV-linked glycans, less avidly to complex biantennary and 
hybrid iV-linked oligosaccharides, and very weakly to complex 
triantennary and tetraantennary iV-linked chains (39). We 



foimd that 41.9 ^ 4.2% of the sham-treated ligand bound to 
ConA, whereas only 3.7 ± 1.0% of the PNGaseF-treated mate- 
rial bound (mean ± S.D., n = 4). Greater than 85% of the counts 
loaded on the column were recovered. 'Hiis result demonstrated 
that PNGaseF removed iV-linked glycans recognized by ConA 
from PSGL-1. 

Fig. 4B shows that the fraction of PSGL-1 not bound by ConA 
migrated slightly slower than the fraction boimd by ConA. lb 
confirm that the ConA-unbound material also contained 
linked oligosaccharides, we treated both the boxmd and un- 
bound fractions with PNGaseF (Fig. 4C). The enzyme de- 
creased the apparent molecular masses of the ConA-unboiuid 
and ConA-bound fractions by 14.5 and 13.7 kDa, respectively. 
Thus, both fractions had PNGaseF-sensitive iV- linked glycans. 
These data suggested that the ConA-unbound fraction con- 
tained triantennary and/or tetraantennary AT-linked glycans 
that ConA did not recognize. The largest linked glycan to be 
described on myeloid cells (molecular mass ^ 6,600) is a disial- 
ylated tetraantennary structure with poly-^-acetyllactosamine 
sequences on each antenna and both external and core fiioose 
(40). Therefore, the observed change in electrophoretic mobility 
after PNGaseF treatment is consistent with the removal of at 
least two and possibly three complex AT-linked glycans. Even 
after PNGaseF treatment, the CJonA-unbound material mi- 
grated slightly slower than the ConA-bound material. This may 
indicate that the ConA«unbound fraction retained a single 
linked chain that was resistant to PNGaseF. Alternatively, 
there may be other structural differences between the ConA- 
bound and -unbound fractions, of which the most likely is het- 
erogeneity in O-linked glycosylation. 

IVeatment with sialidase or endo-/3-galactosidase had simi- 
lar effects on the electrophoretic mobilities of the ConA-un- 
bound and -boimd fractions, probably because these enzjrmes 
primarily affected the abundant O-linked oligosaccharides 
(Fig. 4C). Endo-^galactosidase produced similar increases in 
the electrophoretic mobihties of PSGL-1 pretreated with 
PNGaseF (data not shown) and PSGL-1 that contained all its 
A^-linked oligosaccharides (Figs. 3B and 4C), suggesting that at 
least some of the poly-iV-acetyllactosamine was on the O-linked 
glycans. 

PSGL'l Expresses Le* and sLe* on O-Linked Poly-N-acetyl- 
lactosamine — lb determine whether PSGL-1 expresses Le* or 
sLe* on poly-iV-acetyliactosamine, we developed an assay to 
measiure the binding of ^^^I-PSGL-1 to an immobilized mono- 
clonal IgM antibody to sLe* (CSLEX-1) or Le' (LeuMl), or to an 
immobilized irrelevant IgM mAb (MOPC104E). Fig. 5 demon- 
strates that PSGL-1 bound to CSLEXl but not to MOPC104E, 
confirming previous immunoblotting data that the glycoprotein 
expressed sLe' (13). Greater than 85% of the ^'^^I-PSGL-l mol- 
eciiles bound to a CSLEX-1 affinity column, indicating that 
virtually all molecules contained sLe* (data not shown). 
PSGL-1 also bound to LeuMl, indicating that it contained Le', 
although it is possible that desialylation generated this struc- 
tiM-e during piuification of the glycoprotein (Fig. 5). Treatment 
of PSGL-1 with sialidase eliminated binding to CSLEX-1 and 
increased binding to LeuMl, consistent with the known speci- 
ficities of these antibodies. TVeatment of PSGL-1 with fucosi- 
dase had no effect on binding to CSLEX-1, in keeping with the 
inhibition of this enz3naie by terminal sialic acid (41» 42). In 
contrast, the fucosidase eliminated the binding of PSGL-1 to 
LeuMl. 

After establishing the specificity of the assay, we examined 
whether treatment of PSGL-1 with endo-/3-gaIactosidase af- 
fected the expression of Le* or sLe* (Fig. 5). This enzyme abol- 
ished binding of PSGL-1 to immobihzed LeuMl and decreased 
binding to CSLEX-1 by 64 * 14% (mean ± S.D., n = 4). These 
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Fig. 3. SDS-PAGE of glycosidase^ 
treated »*«I-PSGL-1, '-"^I-PSGL-l was 
treated with the indicated glycosidase8» 
then electrophoresed in SDS, 7.5% poly- 
acrylamide gels under reducing condi- 
tions. The gels were then analyzed by au- 
toradiography The results are represent- 
ative of four independent experiments. 
Endo-a-GalNAcase = endo-aW-acctyl- 
galactosaminidase; DF = dye front. 
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Fig. 4. Effects of glycosidase treatment on ConA-fractionated ***I-PSGL,-1. A, '^^i-PSGL-l was sham-treated or treated with PNGaseF, 
then electrophoresed in an SDS, 7,5% polyacrylamide gel under reducing conditions and analyzed by autoradiography. B, *^I-PSGL-1 was 
fractionated on ConA-Sepharose as described under "Experimental Procedures." Unfractionatcd. ConA-bound, and ConA-unbound samples were 
electrophoresed in an SDS, 7.5% polyacrylamide gel under reducing conditions and analyzed by autoradiography. C, ConA-unbound and ConA- 
bound *"I-PSGtrl was sham-treated or treated with the indicated glycosidase, then electrophoresed in an SDS, 7.5% polyacrylamide gel under 
reducing conditions and analyzed by autoradiography DF s dye front. 



data clearly indicated that poly-AT-acetyllactosamine cleaved by 
endo-p-galactosidase carried some, and perhaps ail, of the Le* 
as well as a portion of the sLe* on PSGL-1, The remaining sLe* 
structures may be on branched or substituted poly-iV-acetyllac- 
tosamine that was resistant to the enzyme or on chains that 
lacked poly-iV-acetyllactosamine. 

In contrast to the effects of endo-0-gaIactosidase» PNGaseF 
did not affect binding of PSGL-1 to immobilized LeuMl or 
CSLEX-1, indicating that few, if any, Le' or sLe' structures 
were present on PNGaseP-sensitive AT-linked oligosaccharides 
(Fig. 5). In conjunction with the other data in Fig. 5, this ob- 
servation indicated that O-linked oligosaccharides carried 
most, and perhaps all, of the Le* and sLe* structures. Further- 



more, at least some of these structures were on O-linked poly- 
N-acetyllactosamine. 

Effects of Glycosidases on Binding of PSGL-1 to P-selec- 
tin — ^We next examined the effects of the glycosidases on bind- 
ing of *^I-PSGL-1 to immobilized tPS. As shown in Fig, 6, 
treatment with sialidase, which eliminated binding to the anti- 
sLe" antibody but enhanced binding to the anti-Le' antibody, 
completely prevented binding of PSGL-1 to tPS. In contrast, 
treatment with fiicosidase, which eHminated binding to the 
anti-Le* antibody but not to the anti-sLe* antibody, had no 
effect on the interaction of PSGL-1 with tPS. These data are 
consistent with previous results from other assays that high 
afHnity binding of P-selectin to myeloid cells or to PSGL-1 
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Table I 

Effects of glycosidases on binding of ^^I-PSGL-l 
to a tomato lectin affinity column 

*^I-PSGL-1 was treated with the indicated glycosidaseCs) and applied 
to a column of Immobilized tomato lectin (V^ = I ml, 2 mg lectin/ml 
resin). After washing, the column was eluted with 20 mg/ml of chitotri- 
oso. The percentage of PSGL-1 bound was defined as the radioactivity 
eluted by chitotriose divided by the sum of the radioactivity in the 
column Oowthrough, wash, and eluate. 

Enzyme treatment Percftnt«g(i bound 

Sham >95 

Endo-/3-galactosidase 84 

Sialldase fucosidase >95 

Sialidasc -f fucosidase 69 

4- £ndo-^-gaIactosidasc 

Sialldase + fucosidase 53 

^ £ndo<^<galactosidase 

+ 0-.V-acetylglucosaminidase 

Peptide: iV-glycosidase F >95 




Fig. 5. Effects of glycosiiiases on binding of *^I-PSGLrl to im. 
mobilized anti>carbohydrate monoclonal antibodies. ^^I-PSOL-1 
was sham-treated or treated with the indicated glycosidase. Samples 
were analyzed for their ability to bind to immobilized anti-sLe' 
iCSLEX'l), anti-Le* iheuMl\ or an irrelevant control IgM mAb, 
MOPC104E. The data represent the mean ± S.D. of quadruplicate wells 
and are representative of three independent experiments. 

tOOO .i . . 




Fig. 6, jE^Efeets of glycosidases on binding of "^-PSOI/-l to im- 
mobilized tPS* '^I-PSGL-l was sham-treated or treated with the tn> 
dicated glycosidase. Samples were analyzed for their ability to bind to 
microti ter wells coated with tPS (50 pi, 10 pg/ml) and then blocked with 
HSA, or to control wells lacking tPS that were blocked with HSA. The 
data represent the mean S.D. of quadruplicate wells and are repre* 
sentative of three independent experiments. 

required sialic acid on the ligand (6, 43, 44). 

Treatment of PSGL-1 with endo-^-galactosidase resulted in 
only a modest reduction in binding to immobilized tPS (Fig. 6). 
In four independent experiments, endo•^•galactosidase re* 



duced binding by 25 ± 11% (mean ± S.D., n - 4). These data 
suggested that some, but not all, of the sialylated chains re- 
quired for P-selectin recognition were on poly-AT-acetyllac- 
tosamine that were sensitive to the enzyme. The remaining 
glycans required for recognition might be on poly-iV-acetyllac- 
tpsamine that were resistant to endo-p-galactosidase or on oli- 
gosaccharides that lacked poly-iV-acetyllactosamine. Endo-p- 
galactosidase inhibited binding of PSGL-1 to the anti-sLe"" 
antibody more than it inhibited binding to tPS. However, we 
could not accurately correlate the levels of sLe' with the inter- 
action with tPS, since we do not know how many sLe* struc- 
tures were required for PSGL-1 to bind to either the antibody or 
to tPS in these assays. 

TVeatment of PSGI^l with PNGaseF did not affect its abil- 
ity to bind to tPS (Fig. 6), supporting the conclusion that 
N-Iinked oligosaccharide played little if any role in P-selectin 
recognition. 

PSGL-1 Binds to E-selectin But with Lower Affinity Than to 
P'selectin — Both P- and E-selectin bind to human leukocytes 
and interact weakly with the tetrasaccharide, sLe* (44—48). 
Since PSGL-1 expresses sLe*, we examined whether PSGL-1 
could also bind to E-selectin. We expressed a recombinant 
soluble form of E-selectin, termed tES^ that was truncated im- 
mediately before the transmembrane domain. Sedimentation 
equilibrium measurements indicated that tES, like tPS (14), 
was monomeric, with no mass action association over the pro- 
tein concentrations examined. We determined the molecular 
mass of tES to be 85,800 Da, of which 58,400 Da was derived 
from the polj-peptide backbone and the remainder from attached 
N-linked oligosaccharides. Sedimentation velocity measure- 
ments indicated that tES was highly asymmetric. Assuming the 
molecule to be a hydrated prolate ellipsoid, it had a sedimen- 
tation coeflicient of 3.60 s and an axial ratio of 19.7. Thus tES, 
like tPS (14), was a rigid, elongated monomer. The availability 
of soluble, monomeric forms of both selectins allowed us to 
directly compare their binding aiBnities for PSGL-1. 

Fig. lA demonstrates that immobilized tES, like tPS, sup- 
ported adhesion of neutrophils, confirming its functional sta- 
tus. The degree of adhesion increased in proportion to the coat- 
ing c*oncentration of each selectin. Neutrophil adhesion to 
immobilized tES was Ca^* dependent and was inhibited by 
CL2, a mAb to E-selectin that blocks adhesion of myeloid cells 
to E-selectin, but not by CL37, a mAb to E-selectin that does 
not block adhesion (20) (data not shown). 

To examine the relative abilities of immobilized tPS and tES 
to interact with PSGL-1, we measured binding of *^I-PSGL-1 to 
each selectin coated at a concentration that promoted maximal 
neutrophil adhesion. Measurements using *^I-labeled mAbs 
indicated that the site densities of the immobilized selectins 
were identical (568 ± 29 and 561 ± 16 sites/iim* for tPS and tES, 
respectively (mean ± S.D., n = 3)). Fig. IB shows that 
PSGL-1 bound to both tPS and tES, although binding was 
much higher to tPS than to tES. In each case, binding was 
specific, as it was blocked by EDTA and by an inhibitor>' mAb 
to the selectin. The data from this nonequilibrium binding as- 
say suggested that E-selectin also recognized PSGL-1, but less 
well than did P-selectin. 

lb examine the relative affinities of tPS and tES for PSGL-1 
under equilibrium conditions, we measured the ability of fluid- 
phase tPS or tES to inhibit binding of "^si-PSGL-l to immobi- 
lized tPS. Fig. 8 shows that fluid-phase tPS inhibited binding of 
»25i.pSGL-l to immobilized tPS with an IC^o of - 0.5 pM, 
whereas fluid-phase tES inhibited binding with an ICqq that 
was estimated by extrapolation to be at least 25 \^h^, These 
results demonstrated that the affinity of PSGL-1 for tES was at 
least 50-fold lower than for tPS. 
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Ptu. 7. Interactions of tES and tPS with neutrophils or with 
***I-PSGL-1. A, neutrophils <2 x lOVwell) were allowed to adhere for 30 
min at 22 ^^C to microti ter wells coated with the indicated amounts of 
tES or tPS. The plates were then seated and centrifuged inverted for r> 
min at 200 x g. Bound neutrophils were quantitated using a mycloper- 
oxidaKe nmtxy, *^'^I*PSGL-I was incubated for 1 h in microtiter wells 
blocked with HSA or precouted with cither tPS or tES (0.5 pg/well) and 
then blocked with HSA. The samples were incubated in the presence or 
absence of 8 mM EDTA or 40 Mg/ml of the indicated mAb. The data 
represent the mean ± S.D. of quadrupUcate wells and are represent^i- 
ttve of three independent experiments. 
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Fm. 8. Inhibition of '""l-PSGL-l binding to immobilized tPS by 
fluid-phase tPS or tES. A fixed concentration of *^I.PSGI*-l was 
incubated for 1 h in microtiter wells coated with tPS (50 pi, 10 pg/ml) in 
the presence of increasing concentrations of either fluid-phase tPS or 
t£S. The diamond represents basal adhesion of ''''^I-PSCl^l to wells 
blocked with HSA but not coated with cither selectin. The data repre- 
sent the mean ± S,D. of quadruplicate wells and are representative of 
two independent experiments. 



DISCUSSION 

We previously described a P-selectin glycoprotein ligand 
from human myeloid cells that is a disulfide-linked homodimcr 
with few yV-linked glycans but many clustered, siaiylated O- 
linked oligosaccharides. The glycoprotein expresses the sLe' 
antigen, and removal of sialic acid eliminates recognition by 
P-selectin (6, 13). In this study, we have established that the 
polypeptide backbone of this ligand is identical to that of 



PSGL-1, a glycoprotein ligand for P-selectin that was recently 
described by expression cloning from an HL-60 cell cDNA li- 
brary (16). We have further characterized the structure and 
function ofPSGL-l from human neutrophils, using as.says with 
the purified, radioiodinated glycoprotein. Our results suggest 
that this molecule presents an array of siaiylated and fucosyl- 
ated, O-linked poly-A^-acetyllactosamine that creates high af- 
finity binding structures for P-sclectin, but not for E-selectin, 

Although the great majority of the oligosaccharides on 
PSGL-1 are O-linked, there are a few A'-linked glycans which 
we detemiined to he heterogeneous. All PSGL-l molecules ccm- 
tained PNGaseF-sensitive A^-linked oligosaccharides, but only 
42% of the glycoprotein molecules bound to ConA, a lectin that 
binds well to high mannose N-linked glycans and less well to 
complex biantennary and hybrid N-linked chain.s. Based on the 
reductions in apparent molecular weight observed in SDS gels. 
PNGaseF removed at least two of the three po.ssible /V-linked 
glycans (16) from both the ConA-lxmnd and -unbound fractions 
of PSGL-1- One chain on the ConA-unbound fraction of PSGl-^1 
may be resistant to PNGa.seF, since the ConA-unl>ound mate- 
rial still migrated slightly slower in SDS gels than the ConA- 
bound material following treatment with PNGaseF. Alterna- 
tively, PNGaseF may have removed all the A'-linked chains 
from both the ConA-bound and -unbound fractions, in which 
case the mobility difference may reflect heterogeneous O-linked 
glycosylation. The PNGaseF-sensitive A^-linked glycans carried 
little or no Le* and sLe* and did not contribute measurably to 
the binding interaction with P-selectin. These results strongly 
suggest that the O-linked glycans display the relevant struc- 
tures that P-selectin recognizes. It is highly improbable that a 
single PNGaseF-resistant A?-l inked glycan on a subpopulation 
of PSGL-1 is responsible for P-selectin recognition. 

Because at least some of the O-linkcd oligosaccharides on 
PSGL-1 carry (a2-3)siaHc acid and (al™3)fucose, they must 
have relatively large, branched structures. In agreement with 
this prediction, we found that PSGL-1 carries few of the Ser/ 
Thr-Hnked Gal/3 l-3GalNAc core 1 disaccharides that endo-or- 
N-acetylgalactosaminidase cleaves following removal of termi- 
nal sialic acids. These data are consistent with previous 
observations that less than 10% of the ^H-lalxjled O-linked 
oligosaccharides released by ^-elimination from PSGL-1 were 
small as assessed by gel filtration (13). 

The O-linked glycans on PSGL«l include a heterogeneous 
group of poly-N-acetyJIactosamine, As measured by mobility on 
SDS gels and by binding to tomato lectin, endo*P-galactosidase 
cleaved .some of the poly-A^-acetyllactosamine at the internal 
pi-4 linkage between the Gal and GlcNAc. Treatment with this 
enzyme also reduced the amount of Le* and six**, indicating 
that some of these .structures were on poly-N-acetylhic- 
tosamine. As measured by binding to tomato lectin, endo-3- 
galactosidasc cleaved other poly-fV-acetyllactosamine only af- 
ter treatment with sialidase and fucosidase, demonstrating 
that they had sialic acid and fucose moieties at or near their 
terminus that prevented access to endo-(3-galactosidase. Be- 
cause a significant portion of the PSGL-1 molecules still bound 
to tomato lectin after treatment with all three cnz>Tncs, there 
may be other O-linkcd poly-A'-acetyllactosamine chains that 
remained resistant to endo-p-galactosidase because of branch- 
ing or additional internal substitutions. Such structures might 
be responsible, at least in part, for the ability of PSGI..-1 to hind 
to immobilized tPS afler digestion with endo-0-galactosidase. 

Previous studies have shown that human myeloid cells ex- 
press poly-N-acetyllactosamine on some O-linked glycans. 
These structures occur almost exclusively on the branch at- 
tached to the C-6 of GalNAc as part of a core 2 O-linked oligo- 
saccharide (49, 50). Of the --SO O-linked glycans on the major 
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sialomudn of human myeloid cells, leukosialin (CD43), only ^-galactosaminidase, and concluded that O-linked oligosaccha- 

one or two contain poly-iV-acetyllactosamine, and approxi- rides were important for recognition by P-selectin. Based on the 

mately half of these have a terminal sLe' moiety (42), as shown data presented here, this conclusion appears to be correct, 

in the following structure (Structure 1): 
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Structure 1 



In contrast, PSGL-1 from neutrophils appears to contain a 
higher percentage of core 2, poly-iST-acetyllactosamine. Direct 
stnictixral analysis will be required to determine the specific 
monosaccharides, linkages » and possible modifications such as 
sulfation, that constitute the O-linked glycans of PSGL-1. How- 
ever, the current data suggest that some of these glycans may 
differ from those in leukosialin, implying that humian myeloid 
cells can differentially glycosylate the polypeptide backbones of 
two distinct sialomucins. This differential glycosylation is func- 
tionally important, because P-selectin binds with high affinity 
to PSGLrl, but not to leukosialin (6). 

The selectins require sialylation and fucosylation of most 
carbohydrate ligands for recognition (1-^). Similarly, PSGLi-l 
requires sialic acid (6) and fucose (16) to interact with P-selec- 
tin. Although PSGIrl expresses the terminal sLe* tetrasaccha- 
ride, it is not clear whether it requires this structure per se for 
recognition. A simple model is that PSGLi-l presents many 
terminal sLe^ moieties that increase the avidity with which it 
interacts with P-selectin. However, previous observations indi- 
cate that high concentrations of cell siirface sLe* are not suffi- 
cient to confer optimal interactiona with P-selectin (13, 44). It is 
more likely that additional modifications on each oligosaccha- 
ride chain are essential for optimal recognition. Specific fea- 
tures of several oligosaccharides in proximity may also present 
a clustered saccharide patch that promotes high affinity bind- 
ing to P-selectin (13). 

Sako et al, (16) generated a recombinant form of PSGL-l that 
boimd to P-selectin when expressed in COS cells co-transfected 
with an al— 3/4-fucosyltransferase, but not in COS cells lacking 
the fucosyltransferase. The relative moleciilar mass of the re- 
combinant molecule was similar to that of PSGL-1 in htunan 
myeloid HL-60 cells, and the authors apparently concluded 
that the structure and function of recombinant PSGLrl were 
equivalent to those of the native molecule. However, the spe- 
cific oligosaccharide structures attached to native PSGL-1 and 
the glycosyltransferases required to construct these structm-es 
are unknown. Moreover, the al-3/4-fuco8yltransfera8e ex- 
pressed in the COS cells to generate recombinant PSGL-1 dif- 
fers from the fucosyltransferases in human myeloid cells (51, 
52). Thus, it is not clear that the glycosylation, and hence the 
function, of recombinant PSGL-1 is identical to that of the 
native glycoprotein in human myeloid cells. Indeed, following 
sialidase digestion of recombinant PSGL-1, addition of endo-a- 
iV-galactosaminidase caused a substantial increase in mobility 
of the glycoprotein (16), indicating that recombinant PSGL-1, 
in contrast to the native glycoprotein, contains many simple 
O-linked core 1 GaI^l-3GalNAc disaccharides that are suscep- 
tible to this enzyme. 

Sako et al. (16) fovmd that recombinant PSGL-1 no longer 
bound to P-selectin after treatment with sialidase and endo-a- 



However, the experiment of Sako et al, cannot be used to sup- 
port a role for O-linked glycans in recognition. Prolonged di- 
gestion with siahdase is required to eliminate binding to native 
PSGL-1 (6), and the conditions for sialidase digestion of recom- 
binant PSGL-1 used by Sako et al, reduced, but did not elimi- 
nate, binding of P-selectin. The loss of P-selectin binding after 
addition of endo-a-i^-galactosaminidase, which releases only 
non*sialylated core 1 Gal01--3GalNAc disaccharides, was more 
likely due to the previously observed contamination of this 
enz3mie with sialidase, which would remove the remaining 
sialic acids required for recognition (6). 

Sakoe^ aL (16) also suggested that theiV-linked oligosaccha- 
rides contributed to binding to P-selectin because recombinant 
P-selectin did not reprecipitate all of the PNGaseF-treated re- 
combinant PSGL-1. However, these authors did not demon- 
strate whether the reprecipitation assay was quantitative by 
testing whether additional precipitation steps increased recov- 
ery of PSGL-1. The same group also concluded that iV-linked 
glycosylation was important for P-selectin recognition because 
they observed that myeloid HL-60 cells treated with tunicamy- 
cin, an inhibitor of iV-linked glycosylation, did not adhere to 
P-selectin-expressing cells (53). However, tunicamycin exerts 
indirect effects on cells, such as inhibiting the exit of some 
proteins from the endoplasmic reticulum, that make it difficult 
to interpret its effects on cell adhesion. We find no evidence 
that the iV-linked glycans of PSGL-1 from human neutrophils 
play a role in recognition by P-selectin (this paper and Ref. 6). 

E-selectin binds to small oligosaccharides containing sLe' as 
well as or better than P-selectin (54). Virtually all of the 
PSGL-1 molecules displayed sLe*, as measured by their ability 
to bind to a CSLEX-1 mAb affinity column. If terminal sLe' 
itself is the key recognition element, then E-selectin should 
bind to PSGL-1 as well as or even better than P-selectin. How- 
ever, our results indicated that E-selectin bound to PSGLrl 
with at least 50-fold lower affinity than did P-selectin. This 
finding demonstrates that E-selectin, like P-selectin, requires 
more them mere presentation of sLe* to bind with high affinity. 
It is possible that a small subset of the PSGL-1 molecules 
studied bound as well to E-selectin as to P-selectin, accounting 
for the overall weak binding. If so, the remainder of the PSGL-1 
would have even less, if any, affinity for E-selectin. 

Sako et al. (16) observed that recombinant PSGL-1 supported 
adhesion of transfected COS ceUs to cells expressing E-selectin 
as well as to cells expressing P-selectin, although the site den- 
sities of the adhesion molecules on the cells were not reported. 
Recently, Lenter et al. (55) reported that a molecule in murine 
neutrophils that appears to represent PSGL-1 was precipitated 
by both bivalent E-selectin and P-selectin Ig chimeras when 
coupled to protein A beads. Neither of these assays is sensitive 
to differences in the equihbrium affinities of E- and P-selectin 
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for PSGL-1. One interpretation of these studies is that PSGL-1 
interacted with lower affinity with E*selectin than with P-se- 
lectin, but was still able to mediate static adhesion of PSGLi-l- 
transfected COS cells to E-selectin-ezpressing cells or to bind to 
E-selectin immobilized on beads tmder the conditions used. 
Another interpretation of the studies of Sako et al, is that 
recombinant PSGL-1 is glycosylated significantly differently 
than the native glycoprotein, changing, and perhaps even re- 
versing, the relative affinities of the molecule for E- and P- 
. electin. In either scenario, a difference in the affinities of 
I'SGLrl for E- and P-selectin might not be detectable in static 
cell adhesion experiments, particularly if the cells expressed 
high levels of PSGL-1 and the selectins. Whatever the affinity 
differences, it remains to be determined whether PSGL-1 on 
neutrophils forms physiologically relevant interactions with E- 
or P-selectin under the shear forces of the venular circulation. 

In conclusion, our data indicate that PSGL-1 from human 
neutrophils displays complex, sialylated and fucosylated 
O-linked oligosaccharides, at least some of which appear to 
contain poly-iV-acetyllactosamine with varying degrees of sub- 
stitution. Presentation of these structures allows high affinity 
binding to P-selectin, but not to E-selectin. The sequences of 
these oligosaccharides, as well as their points of attachment to 
the polypeptide backbone, must be determined to understand 
the structural basis for the specific high affinity interaction of 
PSGL-1 with P-selectin. Comparison of the structure and func- 
tion of recombinant PSGL-1 with that of the native glycopro- 
tein in human myeloid cells will determine the extent of their 
similarities. 
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A 

mf H bstract. The interaction of substance P with 
human blood T4ymphocytes» which stimulates T-lym- 
phocyte proliferation, was quantified by both flow 
cytometric and direct binding assays. Fluorescence- 
detection flow cytometry recorded the binding of di- 
chlorotriazinylamino-fluorescein-labeled substance P to 
21±10% (mean±SD, n = 6) and 35±8% (n = 2) of hu- 
man blood T-lymphocytes before and after stimulation 
with 10 fig/m\ of phytohemagglutinin, respectively. The 
suppressor-cytotoxic (leu 2a) and helper-induoer (leu 3a) 
subsets identified by phycoerythrin-labeled monoclonal 
antibodies contained substance P-reactive T-lymphocytes 
at respective mean frequencies of 10 and 18%. 
[^H]substance P bound rapidly and reveisibly to a mean 
of 7035±2850 sites/T-lymphocyte, which exhibited a 
dissociation constant (Kd) of l.85±0.70 X 10"^ M 
(mean±SD, n = 5), [D-Pro^D-Phe^D-Trp*]substance P 
inhibited the binding of dichlorotriazinylamino-fluores- 
cein-labeled substance P and[^H]sub5tance P to T-lym- 
phocytes at concentrations that suppressed the prolifer- 
ative response to substance P. Substance P (4-11), 
eledoisin, and substance K (a-neurokinin), which all 
share with substance P the carboxy-terminal substituent 
-Gly-Leu-Met-NH2, were more potent than substance P 
(1-4) in inhibiting the binding of [^H]substance P to T- 
lymphocytes, suggesting the importance of this sequence 
in the interaction. Purified human blood B-lymphocytes, 
monocytes, polymorphonuclear leukocytes, and platelets, 
and cultured Hut 78 cutaneous lymphoma T-cells, Jurkat 
cells, Molt-4 lymphoblasts, and HL-60 and U-937 
monocyte-like cells all showed only minimal specific 
binding of ['H]substance P. The recognition of substance 
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P by T-lymphocytes provides one mechanism for selec- 
tive modulation of immunity by sensory nerves. 

Introduction 

Modulation of immunologic responses by elements of the 
central and peripheral nervous system appeais to be attributable 
in part to the bidirectional effects of neuropeptides on distinct 
functions of lymphocytes. The resulu of in vitro studies of 
lymphocyte proliferation have shown that the responses to 
mitogens are suppressed by somatostatin (I, 2) and enhanced 
lyy substance P (SP)' (3) and /^endorphin (4). That different 
aspects of neuropeptide structure deteraiine specifically the 
net effect on other lymphocyte functions was suggested by the 
capacity of a-endoiphin but not j9-endorphin to inhibit T- 
lymphocyte-dependent antibody production (5), and of methi- 
onine-enkephalin and Endorphin but not morphine or a- 
endorphin to enhance the natural killer activity of lymphocytes 
(6). The interactions of neuropeptides with lymphocytes have 
been defined by direct analyses of the binding of methionine- 
enkephalin to T-lymphocytes (7) and of vasoactive intestinal 
polypeptide to both Molt-4 lymphoblasts bxmA T-lymphocytes 
(8, 9). 

SP is an undecapeptide of amino add sequence Aig-Pn>- 
Lys-Pro-Gln-Gln-Phe-Phe<jly-Lcu-Met-NH2, which has been 
identified in the central and peripheral nervous system and 
intestinal tract (10-13) and implicated in the mediation of 
hypersensitivity reactions by the detection of elevated levels of 
SP in sensory nerves supplying localized sites of chronic 
inflammation (14. 15). SP elicits or enhances functional re- 
sponses of human mast cells, polymorphonuclear (PMN) 
leukocytes, T-lymphocytes, and guinea pig peritoneal macro- 
phages at concentrations ranging from 10"** to 10"' M (3, 16- 
19). Furthermore, the stimulation of T-lymphocyte proliferation 
by SP was inhibited specifically by the otherwise immunolog- 
ically inactive analogue (I>Pro^.I>Phe\D-Trp'lSP (3. 20). The 



I. Abbreviations used in this paper: DTAF, dichlorotriazinylamino 
fluorescein; FACS, fluoresoenoe-activated cell sorter, LTB4, leukotriene 
B4; M199-HPS and RPMI-HPS. medium 199 containing Hepes (25 
mM, pH 7.4). penicillin (ICQ U/mlX and streptomycin (100 ng/ml); 
PE, phycoerythrin; PHA. phytohemagglutin-M: SP, substance P; SP*. 
substance P coupled to DTAF. 
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preparatioii and purification of fluorescein-labeled substance 
P (SP*) now has pennittod an evaluation by fluorescence- 
detection flow cytometry of the recognition of SP by a small 
subset of human blood T-lymphocytes, which manifest a 
specificity for SP* and ['H]SP similar to that observed initially 
in studies of the effects of SP on T-lymphocyte function (3). 
Furthermore, the concurrent aj^lication of SP* and of phy- 
coerythrin-labeled monoclonal antibodies specific for antigenic 
determinants on Ainctionally distinct subsets of T-lymphocytes 
now indicate that the SP* reactive T-lymi^ocytes are distributed 
in both the suppressor-cytotoxic and belper-inducer subsets. 

Rtothods 

Medium 199. RPMI-1640, sheep erythrocytes (Microbiological Asso- 
ciates Bioproducts, WallcereviUe, MDX 6 g% macromolecular dextran- 
70 in 0.15 M saline (MacrodexX FiooU-Hypaque (Phannada Fine 
Chemicals, Inc., Piscataway, NJ), 4-<2-hydn>xy-ethy])-l-pipeiBzine eth- 
anesulfonic acid (HqxsX penicillin (1,000 U/ml), streptomycin (1,000 
Mg/ml; Giboo Laboratories, Grand Island, NY), pbytchcmasglutin-M 
(PHA) (Difco Laboratories, Inc., Detroit, MI), n-butyl phthalate (Fisher 
Scientific Co., Pittsburgh, PAX dinonyl phthalate (ICN Phannaoeuticals, 
Inc., Plain view, NY), synthetic SP, SP (1>4X SP (4-1 1), eledoisin, a- 
neurokinin (substance K), [I>Pro',D-Phe^J>-Trp']SP, somatostatin 
(UI4) (Peninsula Laboratories, Inc., Belmont, CAX [^HISP (25-53 O/ 
mmol) (New England Nuclear, Boston, MA), 1.5 ml conical polypro- 
pylene tubes (Saistedt, Inc., Princeton, NJ), silica gel H of 250 /tm 
thidmess on 20 X 20<m plates (Anahech, Ina, Newaric, DEX phy- 
coerythrin (PE)-oonjugated monoclonal mouse antibodies to leu-ia 
and leu-2a (Becton-Diddnson, Inc., Mountainview, CA), diciilorotri- 
azinylaminofluoresoein (DTAF) (Research Orsanics, Inc., Cleveland 
OHX oiKanic solvents which bad been redistilled from glass (Burdick 
and Jadcson Laboratories, Inc., Muskegon, MIX reagents for determining 
the amino add composition of pcriypeptides (Beckman Instruments, 
Inc., Palo Alto, CA, and Pierce Chemical Co., Rockford, ILX htunan 
promydocytic leukemia HL-60 and Jurkat cells (Dr. J. Stobo, University 
of California at San Frandsoo), cultures of Moh-4 lymphoblasts, U- 
937 monocyte-like cells. Hut 78 T-cells (American Type Culture 
Collection, Rockville, MD), and 1,25-dihydroxy vitamin Pi 
[l,25(OH)2D)l (Hoflfaum-La Roche, Nutley, NJ) were obtained from 
the designated supplierSb 

Isolation tf human T- and B-tymphocytes, PMN leukocytes, mono- 
cytes, and pUaelets. Mixed leukocytes from beparin-anti-ooagulated 
blood of noimal sulgects were oentrifusBd on FiooU-Hypaque cushions 
to resolve mononuclear leukocytes from PMN leukocytes. The PMN 
leukocytes in the pellet were resuspended in 1 ml of AB-positive 
human serum that was diluted to 10 ml with distiUed water at room 
temperature to lyse contaminating erythrocytes. After 20 sec, 40 ml of 
Medium 199 containing 25 mM Hc^ (pH 7.4X 100 U/ml of 
penicillin, and 100 ^ml of streptomydn (M199-HPS) was added, 
and the PMN leukocytes were reooveied by centiifugation and washed 
twice with M199-HPS; the purity of the PMN leukocytes was >95% 
(2 IX The mononuclear leukocytes at the buffer and Fiooll-Hypaque 
inter&oe were washed twice in MI99-HPS and then were incubated 
with neuraminidase-trBated fieah sheep erythrocytes to achieve rosetting 
of the T-tymphocytes (2IX The mixtures were oentiifuged on FtooU- 
Hypaque cushions to separate the T>lymphocyte rosettes from less 
dense nonrosetting monocytes and B-lymphocytes. The erythrocytes 
were lysed by hypotonic exposure, and the T-lymphocytes were washed 



and resuspended in MI99-HPS. The purity of the T-lympbocytes was 
>95%. with <3% monocytes detected by a nonspeci fic esterase stain, 
and T-lymphocyte viability was ahvays >97%, as determined by the 
exdusion of trypan blue dye (21). 

The nonrosetting mixture of monocytes and B-lymphocytes was 
resuspended at a concentration of I X lO^/ml of M199-HPS with 10% 
(vcd:v(4) human AB serum and incubated in 75 cm' plastic tissue 
cultiue flasks for 1 h at 37''C in 5% C02:95% air to remove the 
adherent monocytes. The contents of the flasks were decanted; the 
nonadherent B-lymphocytes were washed and resuspended in M199- 
HPS and the incubation in plastic flasks repeated to remove residual 
adherent monocytes. The purity of the B-lympbocytes ahvays was 
>90%, with <5% T-lymphocytes and 8% monocytes; B-Iymphocyte 
viability was >94%, as determined by the exdusion of trypan blue dye 
(21X Monocytes were obtained by scraping the plastic 75 cm' flasks 
with a rubber spatula and washing the detached oeDs in M199-HPS. 
The purity of the monocytes was always >85% as assrssed by a 
nonspecific esterase stain, aiid monocyte viability was always >5N)%. 

Human platdets from normal sut^iects who had not taken aspirin 
or other platelet-active medication for at least 7 d were isolated from 
dtratP-antiooBgiitotfd venous blood and washed three times on cushions 
of autologous erythrocytes, as described (22). The washed platdets 
were freed of erythrocytes by centrilugation at 10 g for 5 min and 
were resuspended in MI99-HPS (pH 7.4) containing 0.1% (wt'vol) 
hunum serum albumin. 

Maintenance cf Molt-4. U'937, Hut 78, Jurkat, and HL^O cell 
cultures. Cells were cultured at a density of ««5 X lO^/ml in 75 cm' 
plastic flasks in RPMI-1640 with L-glutamine, 15% (vol: vol) heat- 
inactivated fetaKcalf serum, 25 mM Hepes (pH 7.4X penicillin (l(X) 
U/mlX and streptomycin (100 iig/ml) (RPMI-HPS) at 37'C in 5% 
CO!2:95% air. The cultures were divided every 48-72 h and the cells 
used within 2 d of a subdivision. The cells were washed twice and 
resuspended in M199-HPS; viability was always >95%, as determined 
by the exclusion of trypan blue dye (21X 

T-Iymphocytes were cultured in RPMI-HPS for 48 h with and 
without 10 M8/ml of PHA and the dqsree of proliferation was quantified 
with a 0.2 ml portion of each su^>ension. The T-lymphocytes were 
diluted to 1 X 10^/ml and transferred to wells of microtiter plates 
containing 1 tiCi of ('H]thymidine (New England Nudear) and the 
incubation continued for 8 h at 37 "C in 5% C02:95% air. The uptake 
of I^lthynudine was quantified by trapping and washing the T- 
lymphocytes on gtass fiber fiheis in a PHD oeQ harvester (Cambridge 
Technology. Inc., CambridBB, MAX and then counting the radioactivity 
as described (23X In two separate eaveriments^ the uptake of 
I'H]thynudine by the unstimulated T-lymphocytes was 253±67 
cpm (mcan±SD) and by the PHA-stimulated T-lymphocytes was 
1 7,64 1 ±2345 cpm. 

Induction of differentiation of U-937 and HLr60 cells by 
I,25(OH)iDj, U-937 and HL-60 cells were stimulated to difierentiate 
into monocyte-like cells by l,25(OH)3D) as described (24, 25) by 
incubation of suspensions of 5 X lO'/ml of RPMI-HPS in 75 cm' 
tissue culture flasks with 10~^ M \,25(OHyOy at 37''C in 5% C02:95% 
air for 72 h. The contents of the flasks were decanted and the adherent 
differentiated cells removed with a rubber spatula were washed and 
resuspended in M 199-HPS; vialnlity of the adherent <lifeentiated oeUs 
always exceeded 85%, as awciwrrt by the exdusion of trypan blue. In 
two successive experiments, the extent of diffinentiation into monocytes 
was 65±I0% (mean±SD) and 80±5% for the HL^ and U-937 cells, 
respectivdy, as hcttsw^ by the increased percentage of adherent cells 
and the percentage which expressed nonspecific esterase activity (25). 
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Preparation of DTAFSP fSP*). To prepare fluorescent SP, 2.5- 
5.0 mg of DTAF and 0.5-1.0 mg of SP were reacted in 0.4-O.8 ml of 
0.2 M sodium carttonate-bufiered 0.15 M NaQ (pH 9.0) for 2 h at 
37**C. The mixture was ai^lied to one end of a 20 X 20<m i^te of 
250 fitn thick silica gel H that was developed in a sealed chamber with 
cfaloiofoiinrmethanoUglacial acetic add (15:5:1, vol:vol:voi). Unreacted 
DTAF and SP, which were detected by ultiaviolet tight-induced 
fluorescence and ninhydrin staining, respectively, migrated as sepnrate 
spots with Rf values of 0.91 and 0.05, respectively. SP coupled to 
DTAF (SP*) migrated as a single spot with an R«r of 0.23. The SP*- 
containing silica gd was scraped from the plate and eluted with four 
portions of 1 ml of methanol. The eluate then was dried with a 
continuous stream of Nj and resiispended in 0.4-0.8 ml of Ml 99- 
HPS. To verily the amino acid composition of the SP*, replicate 
portions were lyophilized in 9 X 150'mm glass test tubes, resuspended 
in 0.3 ml of constant boiling (5.7 M) HQ, and hydrolyzed as described 
(26). The amino adds were quantified with a Dumim D500 analyzer, 
utilizing norleudne as an internal standard (Dionex Co., Sunnyvale, 
CA) (26). For two difierent preparations, the relative amino add 
composition of the SP* was Lys 0.91 (1), Aig 0.99 (1), Gin 1.93 (2), 
Pro 2.1 1 (2), Gly 0.94 (1), Met 0.97 (1), Leu 1.02 (1). and Phe 2.2 (2), 
as comp a red with that expected for the sequence Af^-Pro-Lys-Pro- 
Gln-Gln-Phe-Ph^Gly-Leu-Met-NHj and indicated in the paientbeses 
after each value. 

Assessment of the interaction of SP-DTAF (SP*) with T-lymphocytes 
and other celb. In each experiment, 1 X 10^ T-lymphocytes, monocytes, 
PMN leukocytes, or cultured cells in 0.2 ml M 199-HPS were incubated 
for 40 min at 4'C with 1-3 X 10"' M SP* or unconjugated DTAF. 
The labeled T-lymphocytes were washed and resuspended in 1 ml of 
M 199-HPS at 4''C immediately before introduction into a Becton- 
Dickinson fluorescence-activated cell sorter (FACS IV) equipped with 
a 2 W Aigon laser (Spectra Physics, Mountainview, CA) operating at 
400 mW that exdted at 488 nnu The emitted light was passed through 
a long-pass filter and analyzed at 515 nm. Fluorescence intensity was 
measured on a scale of logarithmic ampUtude which was calibrated so 
that a change of 57 channels was the equivalent of a lO-foM increase 
in fluorescence intensity. For each measurement of fluorescence, 2 
X lO' T-lymphocytes were counted. The interaction of SP* with the 
T-lymphocytes was analyzed in terms of either two parameters (relative 
cell number, and relative fluorescence intensity) or three parameters 
(forward light scatter, relative cell number, and relative fluorescence 
intensity). 

The interaction of SP with a specific subset of T-lymphocytes was 
quantified by dual color FACS analysis after sequential exposure of 
mixed T-lymphocytes to PE-labeled monodonal leu-3a and leu-2a 
antibodies, which identify the hdper-induoer and suppressor-cytotoxic 
subsets, respectivdy, and to SP coupled to DTAF (SP*). 1 X 10' T- 
lymphocytes in 0.2 ml of M 199-HPS were incubated at 4*C for 45 
min with 0.05 ml of either ]eu-3a or Ieu-2a bearing PE. The subset-, 
labeled T-Iymphocytcs then were washed once and incubated with SP* 
as described above. Dual color FACS analysis was carried out with a 
single 2 W Aigon laser operating at 400 mW. Ruorescein staining was 
exdted at 488 nm and the emitted light was passed through a long- 
pass filter and analyzed at 515 nm, while PE staining exdted at 488 
nm and was analyzed with a 500 nm beam splitter and two six-cavity 
band-pass filters (530/30 and 575/25) (Becton Dickinson, Inc.). 

Measurement ef the binding of {'H^ubstance P to blood T- 
lymphocytes and other celh. In each experiment, a concentration of 
[*H]SP ranging finom 3 to 30 nM without and with difierent conoen- 
trations of SP, [I>Pro',D-Phe\D-Trp^P, SP substituent peptides. 



substance K, eledoisin, .or somatostatin was incubated with duplicate 
suspensions of 1 X 10' T-lymphocytes or other cells in a final volume 
of 0.3 ml for 40 min at 4^C. The amount of bound radioactivity was 
determined by sedimenting the T^ymphocytes in each suspension 
through a 0.3 ml layer of jdithalate oils (27) in a 1.5 ml conical 
polypropylene tube that was oentrifugied for 30 s at 9000 ; in a 
Beckinan microfuge B (Beckman Instruments, Inc.). The tip of the 
polypiopylene tube containing the T-lympbocyte peDet was cut off 
with a razor blade, and the contents of the tip were resuspended with 
a pasteur pipette in 3 ml of Hydrofluor for the quantification of 
radioactivity in the pellet 

The total number of moles of SP bound to the T-lympbocytes was 
determined by dividing the cpm bound to the cells by the spedhc 
activity of ['H]SP. The amount of radioactivity bound in the presence 
of 3.0 X 10"* to 1.0 X 10~* M nonradioactive SP was divided by the 
same value for the spedfic activity of [^H]SP to determine the level of 
nonspedfic binding of SP. The number of moles of SP spedfically 
bound to the T-lymphocytes was calculated by subtracting the nonspe- 
dfic binding from the total binding. 



Results 

Characteristics of the interaction of SP-DTAF (SP*) with 
human blood T-lymphocytes, T-lymphocytes were labeled with 
SP* and the distribution of labding analyzed with a FACS to 
assess whether SP is recognized by all of the T-lymphocytes 
or only a limited subpopulation. The flow cytometric analyses 
demonstrated a small peak of positively fluoxesoent cells (Fig. 
: 1), which represents the T-lymi^ocytes that bind SP*. In six 
consecutive experiments with T-lymphocytes purified from 
different normal subjects, the number of cells labeled with SP* 
accounted for 2I±10% (mean±SD) of the total T-lymphocytes. 
Id contrast, DTAF alone failed to reveal a spedfically staining 
population of cells. The spedfidty of labding of T-lymphocytes 
with SP* was examined using a similar experimental design, 
but including replicate su^nsions incubated with a mixture 
of SP* and 10~* M (D-Pro*.I>Phc',I>Trp']SP, which prevents 
the enhancement of T-lymphocyte function by SP (3). In the 




Figure I. Flow cytometric analysis of the trinding of SP* to human 
T-lymphocytes. I X 10^ T-lymphocytes in 0.2 ml of M 199-HPS were 
incubated for 40 min at 4**C with 1-3 X I0~^ M SP*, washed, and 
analyzed in a FACS IV. Fluorescence intensity is depicted on a log 
amplitude scale. The subset of T-lymphocytes reactive with SP is 
de^gnated SPt. 
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absence of tD-Pro*,I>Phe',D-Trp^SP, a positively fluorescent 
peak of T-lymi^ocytes labeled with SP* is detected, representing 
15% of the total T-lymphocytes (Fig. 2 A), The peptide 
antagonist of the effects of SP on T-lymphocytes eliminated 
completely any labeling of T-lymphocytes with SP* (Fig. 2 B). 

The binding of SP* to T-lymphocytes, which had been 
stimulated by PHA to proliferate, was evaluated in order to 
examine the responsiveness of the SP^reactive subset and the 
effects of expansion of the subset on the specificity of binding 
of SP. At the end of the period of exposure to PHA, T- 
lymphocyte viability was 85±S% (mean±SD, n » 2) and 81 ±7% 
for the iiii<ifti|inifatf^ and PHA-stimulated T-lymphocytes, 
req;)ectively, as assessed by the exclusion of trypan blue. The 
labeling of PHA-stimulated T-lymphocytes with SP* increased 
to 35±8% (mean±SD, n » 2), as compared with 23±6% for 
unstimulated T-lymphocytes from the same donors. The iMnd- 
ing of SP* by PHA-stimulated T-lymphocytes was revened 
completely by 10"* M [D^Pro'.D-PhcM^Trp'lSP, as with 
unstimulated T-lymphocytes. In two consecutive experiments 
with SP*, PMN leukocytes, monocytes, Molt<4 lymphoblasts. 
Hut 78 cells, and Jufkat cells failed to develop specific fluores- 
cence, indicating a lack of recognition of SP. 

Dual immunofluoresoent staining of T-lymphocytes with 
SP* and subset-specific monoclonal antibodies conjugated with 
PE demonstrated SP*-reactive T-lymphocytes in both subsets 
(Figs. 3 and 4). In the suppressor-cytotoxic subset identified 
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RELATIVE FLUORESCENCE INTENSITY 

Figure 2. Flow cytometric analysis of the specificity of binding of 
SP* to human T-lymphocytes. 1 X 10^ T-lymphocytes were incu- 
bated with SP* as described in Fig. L In {A\ the binding of 1 X 10'* 
M SP* was subjected to two parameter analysis; in T-lympho- 
cytes were incubated with both 1 X 10~* M SP* and 10~' M ID- 
Pro* JD-Phc',D-Trp')SP and an identical analysis performed. With the 
addition of [D-Pro*,DPhe^J>-Trp^SP, the small positively staining 
peak in ^ is no longer present in B. 




RELATIVE FLUORESCENCE INTENSITY 

Figure 3, Two-color flow cytometric analysis of Icu 2a-PE (suppress 
sor-cytotoxic) expression and binding of SP-DTAF (SP*) to human 
blood T-lymphocytcs. T-lymphocytes were stained seqiientially with 
anti-leu 2a-PE and SP* and analyzed with a FACS IV with two 
parameters. The positively-staining anti-leu la-PE T-lymphocytes 
(left) were analyzed selectively (gated peak) and examined for the 
presence of SP* reactivity. Within the leu 2a-positive peak, there are 
both nonreacUve and SP*-reactive (9.5%) (right) T-lymphocytes. The 
positively-staining anti-leu 2a-PE T-lymphocytes constituted 26±5% 
(mean±SD, n = 3) of the total. 

by leu 2a-PE. 9.5±3.0% (mean±range, n = 3) of the T-lym- 
phocytes bound SP* (Fig. 3), while in the helper-inducer sub- 
set identified by leu 3a-PE, 18.0±5.0% (mean±range, « 
= 3) of the T-lymphocytes bound SP* (Fig. 4). 

Characteristics of the binding of['H]SP to human blood 
T-lymphocytes, The time-course of association of ['H]SP with 
purified T-lymphocytes was assessed by incubating duplicate 
suspensions of 1 X 10^ T-lymphocytes with 10 nM ['H]SP for 
0.5-40 min at 4*C The specific landing of ['H]SP by T- 
lymphocytes increased with time and reached a plateau after 
'^9-12 min (Fig. 5). The time-course of dissociation of q^edf- 
ically bound ['H]SP from T^ymphocytes was examined by 
incubating replicate suspensions of T-lymphocytes for 40 min 
at 4*C with 10 nM [^H]SP to achieve saturation, washing with 
cold buffer to remove fluid phase [^H]SP, and resuspending in 
buffer with IO~^M (IO,000>-fold excess) nonradioactive SP. 
Specifically bound [^H]SP diminished ra(»dly with time, so 
that "-'SO and 80% were dissociated after S and 30 min, 
respectively (Fig. S). 

A representative Scatchard plot (Fig. 6) of the ooncentration- 
dependenoe of ['H]SP tnnding to T-lymphocytes reveals a 
linear relationship with a single dissociation constant (JfTp) of 
0.83 X 10'^ M and a mean of 4820 specific binding sites for 
SP/T-lymphocyte. The data firom this and four other identical 
analyses of the binding of ['H]SP to T-lymphocytes from 
different normal subjects exhibit some donor-dependent vari- 
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RELATIVE FLUORESCENCE INTENSITY 

Figure 4. Two-color flow cytometric analysis of leu 3a-PE (helper- 
inducer) expression and binding of SP-DTAF (SP*) to peripheral 
blood T-lymphocytes. T-lymphocytes were stained with anti-leu 3a- 
P£ and SP* and analyzed with a FACS IV as in Fig. 3. The 
positively staining anti-leu 3a-PE Trlymphocytes (left) were analyzed 
sdectively (gated peak) and examined fxx the presence of SP* reactiv* 
ity. Within the leu 3a-p08itive peak, there were both oonreactive and 
SP* reactive (18%) (right) T-lymphocytes. The positively staining 
anti-leu 3a-PE T-lymphocytes constituted 60±8% (mean±SD, a 3) 
of the total. 



ation (Table I). The mean Xb±SD was 1,85±0.70 X 10"' M 
and the mean density of binding sites for SP±SD was 
7035±2850/T-lymphocytc, The level of nonspecific binding of 
[^H]SP after a 40-min incubation at for five separate 
experiments was 35± 1 5% (mean±SD) of the total binding. 

Structural determinants of the binding of SP to human T- 
lymphocytes. The peptide requirements for the binding of 
['H]SP to T-lymphocytes was evaluated by examining the 
effects on binding of structurally distinct neuropeptides and 



substituents of SP (Fig. S). The SP antagonist (19) [D-Pro^^ 
Phe\D-Trp']SP competitively inhilrited the binding of [^H]SP 
to T-lymphocytes (Fig. 7) at concentrations known to suppress 
the stimulatory effixts of SP on T-lymphocyte function (3). 
SP (4-11), which stimulates ['H]thymidine incorporation by 
T-lymphocytes (3), also competitively inhibited the binding of 
['H]SP to T-lymphocytes. Eledoisin, which has the same 
amino adds as SP at positions 2, 7, and 9-1 1, and substance 
K (a-neurokinin) (28), which has the same amino adds as SP 
at positions 3. 7, and 9-11, exhibited '■^ 10-20% of the potency 
of SP assessed at 20% displacement of Innding of ['H]SP and 
10% and 0.1%, respectively, of the potency of SP at 40% 
displacement (Fig. 7). In contrast, similar concentrations of 
the tetrapeptide SP (1-4) and of somatostatin failed to inhibit 
significantly the binding of ['H]SP; the concentration of so- 
matostatin required for 50% inhibition of spedfic binding of 
[^H]SP was 1,000-fold greater than for SP. 

Cellular specificity of the binding of SP. The binding of 
['H]SP by purified human bipod B-lymphocytes, monocytes, 
PMN leukocytes, and platelets, and by cultured Jurkat, Hut 
78, Molt-4, and differentiated U-937 and HI.-60 cells was 
examined in two experiments (Table II). None of the non-T- 
lymphocytes demonstrated specific binding of [^H]SP which 
exceeded 21% of the total binding. In contrast, the total 
binding of [^H]SP by PHA-stimulated and unstimulated blood 
T-lymphocytes, which achieved a level similar to that of the 
cultured cells, consisted of over 60% specific binding as assessed 
by the addition of a 10,000-fold molar excess of unlabeled SP 
(Table II). 



Discussion 

The capadty of neuropeptides and other mediators to modify 
immune function specifically is dependent on recognition of 
the mediators by T-lymphocytes which express immunoregu- 
latory activity. The application of two distinct techniques 
employing SP labeled by different methods demonstrated the 
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Figure 5. lune-oouxse of qiedfic bind- 
ing of CHISP to human T-lymi^ocytes. 
['H]SP was incubated with I X 10^ T- 
lymphocytes for different periods of 
time and the leveb of spedfic binding 
were quantified, Each point and bracket 
r ep r e s ent the mean±SD of three experi- 
ments done in duplicate. 
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Figure 6. Scatchard plot of the cosoentration' 
dependence ofI'H]SP binding to human T*lym* 
phocytes. The data are based on total ['H]SP 
bound by 1 X 10^ T-lymphocytes in the pnar 
ence dt increasittg qiiantitks of nonradioactive 
SP. Each point repfcsents the mean of duplicale 
nxauurements for caqxriment number I (see 
Table I). The horizontal portion of the curve 
represents the amount of nonspecific binding of 
['HJSP to T-lymphocytes. 



q)ecificity of the interaction of SP with human T-lymphocytes. 
A fluorescent derivative of SP, dfsagnatcd SP*, labeled 21±I0% 
(mean±SD, n = 6) of human T-lympbocytes as assessed by 
flow cytometry (Fig. 1). The Innding of ('H]SP by similarly 
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M Ko 


Experiment* 


T-lympliocytel 


(Molar X l(r')| 


1 


4820 


0.83 


2 


7230 


3.33 


3 


9033 


1.40 


4 


10480 


2.22 


5 


3614 


1.51 


Mean±SD 


7033±2850 


1.85±0.70 X 10-' M 



* Experiments were done on blood T-lymphocytes from five different 
normal donors. 

% The ATo and number of Unding sites weie determined by Scatchaid 
analysis as described in the text Each experiment was done in dupli- 
cate. 



purified human T-lymphocytes revealed a single class of recep- 
tors with a iCo of 1.85±0.70 X 10"^ M (mean±SD, n » 5) 
(Fig. 6, Table I). The binding of pHJSP to T-lymphocytc 
receptors achieved equilibrium rapidly and was reversed rapidly 
by the addition of excess nonradioactive SP (Fig. 5). The 
specificity of the interaction of SP with T-lymphocytes was 
confirmed by complete inhibition of both SP* fluorescent 
staining and pH]SP binding by [I>'Pro',I>PheM>-Trp']SP 
(Fig. 7) at concentrations which inhibit the effects of SP on T- 
lymphocyte function (3). The competitive inhibition of ['H]SP 
binding by other peptides revealed that SP (4-11), which 
stimulates T-lymphocyte proliferation with a concentration 
dependence similar to SP (3), displaced ['H]SP at equimolar 
concentrations (Fig. 7). Two peptides of the tachykinin family, 
eledoisin and substance K, which share the same carboxy- 
terminal tripeptide and two other amino adds with SP, also 
inhibit the binding of [^H]SP to T-lymphocytes at higher 
concentrations (Rg. 7). The structural requirements for binding 
of SP to T-lymphocytes suggest that the caiboxy-terminal 
sequence -G]y-ljeu-Met-NH2, which contributes to the stabi- 
lization of the structure of SP (29) and Phe' but not the 
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MOLAR CONCENTRATION OF LIGAND 

Figure 7. Structural determinants of binding of ['HISP to human T- 
lymphocytes. Each point and bracket depicts the mean±SD of the 
percentage of total binding of ['H)SP inhibited specifically by increas- 
ing concentrations of the nomadioecdve peptides listed below, and it 
equals the number of experiments. n — 4, SP Arg-Pro-Lys-Pro- 
Gln-Gln-Phe-Phe-Gly-Leu^Met-NHj; o. » = 2. SP (1-4) Arg-Pro-Lys- 
Pro; A. n « 2, SP (4-1 1) Pro-Gln-Gln-Phe-Phc-Gly-Leu-Mct-NHj ; o, 
n - 2» eledoisin pGlu-Pro-Ser-Lys-Asp-Ala-Phe>lle-Gly-Leu-Met- 
NHa; n 2, substance K His-Lys-Thr-Asp-Ser-Phe-Val-Gly-Leu- 
Mct-NHj; A, n = 2, somatostatin (1-14) Ala-Gly- Cys-Lys-Asn-Phe- 

Pbe-Trp-Lys-Thr-Phe-Thr-Ser-Cys; n « 2, [D-Pro^ J>-Phc^,I>- 

TrpTSP. 

amino-terminal tetiapeptide, determine the affinity and sped- 
licity of the interaction. 

Concurrent identification of helper4nduoer and suppressor- 
cytotoxic subsets of T-lymfriiocytes with PE-coi^ugated mono- 
clonal antibodies to different leu antigens of the respective 
subsets permitted assignment of the SP-DTAF (SP*) fluores- 
cence to T-lymphocytes in both subsets (Fig. 3 and 4). SP* 
was recognized by a mean of 18 and 9.5% of the helper- 
inducer and suppressor-cytotoxic subsets of T-lymphocytes, 
respectively. The cellular specificity of the binding of SP was 
established by demonstrating a greater d^ree of specific binding 
of ['H]SP to T-lymphocytes than to B-lymphocytes, monocytes* 
PMN leukocytes, platelets, and differentiated monocytes of the 
U-937 and HL-60 lines (Table ID. SP, at concentrations of 
10~"-l(r^ M had no effect on the uptake of ['H]thymidine 
by the lymphcrid cells Molt-4, Hut-78, aikl Jurkat in culture, 
and ['H]SP failed to bind the cultured lymphoid cells. 

The demonstration of labeling of only a mean of 21% of 
mixed T-lymphocytes with SP* permitted an estimation of the 
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Toul bifMjins 


SpeciAc 


Cdl type 


(10-* M l»HJSI*) 


binding!! 


T-lymphocytes* 


1273±126 


766±96 


T-lymphocytes stimulated 






with PHA* 


1626±80 


994±87 


B-Iymphocytes* 


966±]07 


19±63 


Monocytes^ 


1451±16l 


97±130 


PMN leukocytest 


972±60 


138±67 


Platelets^ 


2817±90 


119±120 


Molt-4 lymphoblasts 


3810±368 


369±248 


Juricat cells 


264I±2I0 


174±190 


Hut 78 T-cells 


2185±162 


189±141 


U-937 monocytes. 






differenttated§ 


846±3S 


196±86 


HU60 cells. 






differentiated! 


724±117 


178±54 



* Experiments utilized T- and B-lymphocytes isolated from five sepa- 
rate normal donors. T-lymphocytes were stimulated with 10 ^ml of 
PHA for 48 h. 

X Experiments utilized PMN leukocytes, monocytes, and platelets 
from two separate normal donors. 

§ Two separate experiments were performed with cells stimulated to 
differentiate by l,25(OH)2D3. 

" Mean cpm bound±SD/l x 10^ celb for duplicate measuremenu 
with cells from each donor, except in the case of platelets where 1 
X 10* cells in duplicate were used. 

f Specific binding = total binding (ID-* M ['H]SP) minus nonspecific 
binding (I0-* M ['H]SP + 10~* M SP). 



number of SP receptois. The Scatchaid plot of the results of 
Innding of ['H]SP to T-lymphocytes revealed a mean of 7035 
SP receptors/T-lymphocyte, if aU cells were assumed to bind 
SP (Fig. 6, Table I). Utilizing the flow cytometric data to 
define a subset of SP-reactive T-lymphocytes vfbich amount 
to 21% of the total population led to an estimate of '^35,000 
receptors/T-lymphocyte in the subset which recognized SP. 
The relevance of the Ixnding data to the effects of SP on T- 
lymphocytes also was suggested by the similarity of the JCd to 
the concentration of O.S X 10''' M SP required to achieve 50% 
of the maximal stimulation of T-lymphocyte proliferation (3). 

The selective interaction of SP with a quantitatively small 
subset of T-lymphocytes is reminiscent of the binding of C3a 
peptides (2 1) and leukotriene B4 (LTB4) (30) to only '-^40 and 
1 1%, respectively, of human blood T-lymphocytes. The distri- 
bution of SP*-reactive T-lymphocytes between the helper- 
inducer Oeu 3a+) and suppressor-cytotoxic (leu 2a-l-) subsets 
resembles more that of LTB«-reactive lymphocytes within the 
same general subsets (30). Although LTB4 reacted with t>oth 
subsets, a greater percentage of LTB4-reactive lymphocytes 
were suppressor-cytotoxic cells (14%) than helper-inducer cells 
(8%), with the net effect of LTB4 on proliferation being 
inhibition (23). In contrast, a greater percentage of SP-reactive 
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T-lymphocytes were helper-inducer cells (18%) compared with 
suppiessor-cytotoxic cells (9%), and the net effect of SP on 
proliferation is stimulation (3). However, the presence in the 
helper-induoer subset of suppressor-induoer T-lymphocytes 
emphasizes the difficulty of meaningfully explaining a functional 
outcome from the results of analyses with a limited number 
of monoclonal antibody reagents. The recognition of C3a 
peptides and somatostatin by a minority of T-lymphocytes 
also has substantial functional consequences, as manifested by 
significant suppression of proliferation (1, 21) and other activ- 
ities (21). The in vitro findings suggest that the net effect on 
immunological responses of neuropeptides released from pe- 
ripheral nerves may depend, in part, on the balance of the 
stimulatory effects of SP and the inhibitory effects of somato- 
statin. 
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Regulation of Calcium Homeostasis in Sensory Neurons by 
Bradykinin 

Stanley A. Thayer, Teresa M. Pemey, and Richard J. Miller 
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The nonapeptide bradykinin (BK) activates sensory neurons 
and stimulates the transmission of nociceptive information 
into the CNS. We investigated the effect of this peptide on 
rat dorsal root ganglion neurons (DRQ) grown In vitro, BK 
stimulated the synthesis of inositol trisphosphate (IP,) and 
the breaicdown of phosphatidylinositol bisphosphate, the 
synthesis of diacyiglycerol, and the release of arachidonic 
acid from DRG cells. The release of IPs and arachidonic acid 
was not inhibited by pretreatment of the cells with pertussis 
toxin. BK also mobilized intracellular Ca^-^ stores in DRG cells 
as assessed by fura-2-ba8ed microfluorlmetry. Two types 
of Ca'-*- stores appeared to exist in DRG neurons. One type 
could be mobilized by caffeine (10-* m), and this effect could 
be bloclced by ryanodine in a use-dependent manner. These 
stores occurred primarily in the cell soma and were virtually 
absent from cell processes. A second type of store could 
be mobilized by BK, presumably through the mediation of 
IP,. These latter stores were distributed equally between the 
cell soma and processes. Experiments with combinations of 
caffeine and BK suggested that the stores mobilized by these 
2 agents may be separate entities. Both the caffeine and BK 
sensitive Ca*-^ storage sites appeared to participate in buff- 
ering a Ca^-^ load induced in DRG neurons by cell depolar- 
ization. The relevance of these observations to the mech- 
anism of action of BK on sensory neurons is discussed. 

The nonapeptide bradykinin (BK) is released from its precursors 
the kininogens by the action of the enzyme kallikrein in response 
to trauma (Erdos, 1979). BK production is one of a number of 
responses to injury initiated by the activation of Hagemann 
factor— other examples being the blood clotting and comple- 
ment fixation cascades. Receptors for BK exist in a number of 
tissues, including the nervous system (Manning and Snyder, 
1 983), gastrointestinal and vascular smooth muscle (Manning 
etal., 1986;Benyetal., 1987), the gastrointestinal mucosa (Man- 
ning et al., 1982) and many others. When such receptors are 
activated, a multitude of responses are initiated which help to 
orchestrate the body's defenses against injury. 

BK powerfully activates sensory neurons (Higashi et al., 1 982), 
leading to the increased release of neurotransmitters such as 
substance P (Yaksh and Hammond, 1982). These neurotrans- 
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mitters mediate the transmission of nociceptive information 
into the spinal cord. Although such a role for BK is well estab- 
lished, the molecular mechanisms underlying these actions are 
obscure. In non*neuronal tissue, BK receptor activation leads 
to the stimulation of both phospholipase C and phospholipase 
A2 and the subsequent release of a host of powerful lipid-derived 
second-messenger molecules (Hong and Deykin, 1982; Miller, 
1987b). These include inositol trisphosphate (IP3), diacyiglycer- 
ol (DAG), and arachidonic acid (Miller, 1987b). These sub- 
stances can now initiate Ca^*^ mobilization, activation of protein 
kinase C (PKC), and eicosanoid biosynthesis, which can, in turn, 
lead to the synthesis of further second messengers. Studies on 
neuronal clonal cell lines have indicated that BK can also stim- 
ulate the production of lipid-derived intermediates in neuronal 
tissues (Yano et al., 1984; Francel and Dawson, 1986; Francel 
et al., 1 987; Jackson et al.. 1 987; Miller, 1 987b; Van Calker and 
Heumann, 1987). Furthermore, various electrophysiological ef- 
fects of BK and of lipid-derived second messengers have been 
demonstrated both in neuronal cell lines and authentic sensory 
neurons (Baccaglini and Hogan, 1 983; Fowler et al., 1 985; Reiser 
and Hamprecht, 1985; Higashida and Brown, 1986a, b, 1987; 
Higashida et al., 1 986; Osugi et al., 1986a, b; Rang and Ritchie, 
1987; Weinreich, 1986). 

The release of neurotransmitters from sensory neurons can 
be triggered by increases in [Ca^+Jj. In dorsal root ganglion (DRG) 
and other nerve cells, the resting [Ca'"^]; is normally very low 
(—10"' m) (Miller, 1987a). [Ca^^]i can potentially increase either 
through Ca^^ influx from the cell exterior via a variety of mem- 
brane channels or by mobilization from intracellular Ca^^ stores 
(Miller, 1987a). In the present series of studies, we have ex- 
amined the effects of BK on lipid metabolism and authentic 
sensory neurons in vitro. We find that BK activates phospho- 
lipases C and Aj and also modulates intracellular Ca^-^ stores. 
Furthermore, sensory neurons seem to possess a Ca^* store that 
can be mobilized by methylxanthines such as caffeine. The prop- 
erties of the BK and cafieine-sensitive stores are compared. 

Materials and Methods 

Cell culture. Pure populations of DRG neurons were cultured as de- 
scribed by Pemey et al. (1986). Briefly, DRG neurons were dissected 
from thoracic and lumbar segments of 1 - to 3-d-old Sprague-Dawley 
rats, incubated for 15 min at 3 7*^1: in collagenase/dispase (0.8 and 6.4 
units/ml), and then dissociated into single cells by trituration through 
a Pasteur pipette. The cells were then plated on laminin-fibronectin- 
coated coverglasses (no. 1, 25 nun diameter). Cells were fed every 2-3 
d with Ham*s nutrient mixture F-12 supplemented with 5% heat-in- 
activated rat serum, 4% 1 7-d embryonic rat extract, 50 ng^ml NGF, 44 
mM glucose, 2 niM L-glutamine, 1% MEM 100 x vitamins (GIBCO), 
and penicillin/streptomycin (100 units/ml and 100 Mg/nil, respectively; 
GIBCO). Cultures were maintained at 37'*C in a water-saturated at- 
mosphere with 5% carbon dioxide. Sympathetic neurons were cultured 
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from rat superior cervical ganglia (Pcmcy ct al., 1986) and central neu- 
rons from rat striatum (Murphy et al., 1 987). Cultures were treated with 
10-^ M cytosine arabinoside for 48 hr, 12 hr after plating to suppress 
the growth of non-neuronal cells. 

Measurement oflCc^^h [Ca**], was determined using a microfluorim- 
ctcr to monitor Ca**-sensitive fluorescent chelator, fura-2 (Grynkiewicz 
et al., 1985). Neurons were loaded with the dye by incubation with 2 
MM fura-2 acetoxymcthyl ester (Molecular Probes Inc., Eugene, OR), 
which is membrane permeant, for 1 hr at yVC in HEPES-buffered 
Hank's balanced salt solution, pH 7.45, containing 0.5% BSA. The 
HEPES Hank's solution was composed of (in him) the following: HEPES, 
20; Naa, 137; CaQ^, 1.3; MgSO*, 0.4; MgQ^ 0.5; Ka, 5.0; KH^PO,, 
0.4; NaHPO^, 0.6; NaHCO,, 3.0; and glucose, 5.6. Following the loading 
incubation, during which time tiie dye ester is hydrolyzed by cytosolic 
esterases to the membrane-impermeant polycarboxylate anion that is 
fura-2, the cells were washed twice in the HEPES-Hank*s solution and 
incubated for 30 min. It is difficult to measure precisely the amount of 
dye loaded into a cell. However, in cells loaded with a 100 mm fura-2 
via internal dialysis with a patch-clamp pipette, the fluorescence signal 
was similar to the very bri^test cells used in this study. Thus, 100 mm 
would be an upper limit for the intracellular fura-2 concentration. 

The coverglasses containing the loaded and washed cells were then 
mounted in a flow-through chamber for viewing. Briefly, thie chamber 
consisted of a Plexiglas block machined to accommodate the coverslip 
as a bottom. Three reservoirs were cut into the block sucdi that a thin 
sheet of buffer flowed out of the inlet reservoir, across the cells in the 
experimental chamber, and was drawn up across nylon mesh to the 
efllux reservoir for evacuation by suction. Solutions in the chamber 
could be completely exchanged within 15 sec, including the time taken 
to flow through the tubing delay between the large media reservoirs and 
the inlet to the chamber. Experiments were run at 22*<^. 

The perfusion chamber was mounted on an inverted microscope (E>ia- 
phot, Nikon, Garden City, NY), and cells and processes were localized 
by standard phase-contrast illumination. The perfusion chamber as well 
as the microfluorimeter are described in detail elsewhere (Thayer et al., 
1 987a, 1 988). For exdtion of the funi-2, the collimated lig^t beam from 
a 200 )v Hg arc lamp was passed through a dual-beam spectrophotometer 
(Phoenix Instruments), which alternated wavelengths from 340 to 380 
nm by means of a wheel spinning at a frequency of 60 Hz. In place of 
the original sample chamber, a collimating beam probe was placed for 
focusing the light onto the end of a liquid guide (3 mm x 1 m. Oriel, 
Stratford, CT). On the other end of the liquid light guide, a similar probe 
was positioned for directing light throujgh the epifluorescence illumi- 
nator of the microscope. The light was reflected off a dichroic mirror 
(Nikon, DM 400) and through a x70 phase-contrast oil-immersion 
objective (E. Leitz nc, Rockleigh, NJ, numerical aperature, 1.15). The 
emission fluorescence was selected for wavelength with a 480 nm barrier 
filter, and recordings were defined spatially with a rectangular dia*- 
phragm. The fluorescence emission was analyzed with a photomultiplier 
tube and discriminator (Thorn EMI Gencom Inc., Plain view, NY). The 
discriminator output was converted to pulses, which were then inte- 
grated by passing the signal through an 8-pole low-pass Bessel filter at 
500 Hz. The photomultiplier signal was fed into one channel of an 
analog-to-digital converter computer system (C-lab, Indec Systems, 
Sunnyvale, CA). The signals from 2 photodiodes, each placed in a small 
portion of the light beam directed to the monochromators, were fed 
into 2 additional channels of the analog-to-digital converter. 

The photomultiplier output was sorted into signal from 340 and 380 
nm excitation by using the photodiode output as synchronizing signals. 
In the typical traces described here, 30 ratios were determined per 
second, the average ratio was displayed on-line, and the average intensity 
values for each wavelength were stored. After completion of a given 
experiment, the microscope stage was adjusted so that no cells or debris 
occupied the field of view defined by the diaphragm, and then back- 
ground light levels were determined. Background light' levels ranged 
firom less than 5% for lai^e cell bodies to close to 50% for very &ie 
processes. Autofluorescenoe from cells that had not been loaded with 
fura-2 was not detectable. Records were later corrected for background 
and the ratios recalculated. Ratios were converted to free {Ca^^J by using 
the equation [Ca^*] = K{R-R^^f{R^^'R\ in which R is the 340/380 
nm fluorescence ratio (Grynkiewicz et al., 1985). The maximum ratio 
{R^y the minimum ratio (/J„,i„), and the constant (/T is the product 
of the dissociation constant for fura-2 and the ratio of the free and 
bound forms of the dye at 380 nm) were determined from a standard 
curve to which the above equation was fit using a nonlinear least-squares 



fit computer program. The system was recalibrated following any ad- 
justment in the apparatus. Values for the constants and K 
ranged firom 0. 1 2 1 to 0.334, 4.02 to 5.06, and 2034 to 23 73, respectively. 
The standard curve was determined from the fura-2 pentapotassium 
salt in calibration bufler (which contains, in mM, HEPES, 20; KQ, 120; 
Naa, 5; pH 7. 1) containing 10 mM EGTA, = 3.969 x 106 m (Fabiato 
and Fabiato, 1979), and varying amounts of added Ca^*, which were 
calculated to give finee Ca^* concentrations ranging from 0 to 2000 um. 
Records were digitally filtered with an algorithm that added the value 
of each data point with % of the value of each of the 2 neighboring 
points. The data were cycled through this routine 5 times. Results are 
presented as means ± SEM. 

All experiments were performed on cells continuously perfused with 
HEPES Hank*s solution. Depolarization-induced CA'* influx was pro- 
duced by changing the perfusing solution from low K* (5 mM) to high 
K+ (50 mM) with exchanged for Na^ reciprocally. Ca^^-free solutions 
were prepared by replacing Ca^^ with 20 mm EGTA. 

Lipid metabolism, DRG cultures were incubated with either *H-myo- 
inositol (5 /iCi/ml) for 48 hr or m-arachidonic acid (1 ^Ci/ml) for 10 
hr in normal growth medium. In some experiments, cells were also 
incubated with 1 00-350 ng/ml pertussis toxin for 24 hr. Unincorporated 
isotope was removed by washing cultures 2 times with 2.4 ml of HEPES- 
buffered Eagle's MEM containing 0.5 mg/ml fatty acid-fi^ BSA. Cells 
were then incubated for 30 min in 2.5 ml HEPES Eagle's MEM at 37"C. 
A final wash was performed, and the experiment was initiated after 5 
min by the addition of BK (100 nM except as indicated). The reaction 
was terminated after various intervals by rapid aspiration of the buffer 
and addition of either 2 ml of 10% trichloroacetic acid (TCA) or 2 ml 
of methanol for ^H-myoinositol- or ^H-arachidonic acid-labeled cells, 
respectively. In experiments in which arachidonic acid release was stud- 
ied, the aspirated medium was saved. Cells were then scraped off* the 
plate and added to 1 5 ml glass centrifuge tubes. The TCA extracts were 
spun at 1000 rpm for 5 min. The supernatant was saved for inositol 
phosphate analysis and the pellet for phospholipid analysis. Chloroform, 
2 ml, and HQ, 100 m1, were added to the methanol extracts and phase- 
separated by the addition of 1.2 ml of 1 N HCl/5 mM EGTA. The 
aqueous phase was removed, and the organic phase evaporated under 
Na and saved for DAG analysis. 

Analysis of inositol sugars. ^H-labeled inositol sugars were assayed by 
the method of Berridgc et al. (1982). Briefly, TCA extracts containing 
the inositol sugars were extracted 5 times in an equal volume of ether 
to remove the acid and then neutralized with 50 mM tetraborate. The 
water-soluble inositol phosphates were then separated by ion-exchange 
chromatography on Dower- 1 X 8 (formate form). The samples were 
applied to 1 ml columns of Dowex, and free inositol was washed through 
the columns with ten 2 ml rinses of glass-distilled water. Subsequent 
elution of glycerol phosphoinositol, inositol monophosphate (IP), ino- 
sitol bisphosphate (IP^), and IP, was accomplished by five 2 ml rinses 
with the following buffers: 60 mM sodium formate/5 mM sodium tetra- 
borate; 0. 1 M formic acid/0.2 m ammonium formate; 0. 1 m formic acid/ 
0.5 M ammonium formate; and 0.1 m formic acid/ 1.0 m ammonium 
formate. 

Phospholipid analysis. TCA pellets were resuspended in 1.5 ml of 
chloroform/methanol/HCl (100:100:1, vol/vol), sonicated, and spun at 
1000 rpm for 5 min. The supernatant was saved and the pellet extracted 
twice more in a similar manner first with 1.5 ml of chloroform/meth- 
anol/HCl (100: 100:1) and then with 1 ml of chloroform/methanol/HCl 
(200:100:1). The supematants were pooled and phase-separated by the 
addition of 1 .5 ml chloroform and 1 .5 ml 0. 1 m HQ. The aqueous phase 
was removed and the organic phase washed twice with the theoretical 
upper phase [chloroform/methanol/water (3:48:47)] before drying under 
Nj. After evaporation the residues were redissolved in 50 ml of chlo- 
roform and applied to TLC plates (silica G). The phospholipids were 
separated on oxalate-impregnated silica plates and developed in chlo- 
roform, methanol, and 4 N NH^OH (45:35:9.5) according to the method 
described by Billah and Lapetina (1982). Individual Upids were visu- 
alized by exposure to 1 2 vapors. Zones corresponding to lipid standards 
were scraped and assayed for radioactivity by liquid scintillation meth- 
ods. 

DAG analysis. For »H-arachidomc acid-labeled DAG determination, 
the dried-down lipids were redissolved in 1 ml chloroform and applied 
to a 2 ml silicic acid column. Neutral lipids were eluted from the column 
by a 10 ml rinse with chloroform. After evaporation of the column 
eluate under N2, the residue was redissolved in 50 m1 of chloroform and 
applied to TLC plates. Separation of DAG was achieved by developing 
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Figure 1 . Time course of BK-stimulated phospholipid breakdown and 
inositol phosphate production. Cultured DRG neurons were prelabeled 
with ^H-myoinositol (5 MCi/ml) for 48 h and then exposed to BK (100 
um) for various intervals (see Materials and Methods) A, Inositol phos- 
phate production. The data are expressed as percentage of the basal 
value. Each point represents the mean of duplicate determinations. This 
time course is representative of 3 experiments. The basal levels were 
488 cpm/plate; ^H-IPj, 532 cpm/plate; and IP, 2448 cpm/plate. 
B, Phospholipid breakdown. The data are the mean of duplicate deter- 
minations and are expressed as percentage of the basal levels. Basal 
levels were PIP2, 8.83 x 10* cpm/plate; PIP, 8.75 x 10^ cpm/plate; and 
PI, 1.6 X 10' cpm/plate. 



the plates in ether/hexane/acetic acid (70:30: 1) as described by Griend- 
linget al. (1986). 

Arachidonic acid release. The release of ^H-arachidonic acid and its 
metabolites into the medium was determined by assaying 200 ml ali- 
quots of the medium for radioactivity by liquid scintillation spectro- 
photometry. In some experiments, ^H-arachidonic acid in the organic 
phase was separated from its metabolites by TLC [developed in chlo- 
roform/isopropanol/ethanol/formic acid (45:5:0.5:0.3)] after extraction 
of the medium in 6 ml chloroform/methanol (2:1). Tlie 2 methods 
yielded similar results. 

Results 

Phospholipid metabolism 

In many cell types, BK has been shown to activate both phos- 
pholipase C and phospholipase A2 (Miller, 1987b). For example, 
this has been demonstrated in some neuronal clonal cell lines, 
although not in authentic DRG cells (Yano et al., 1984; Francel 
and Dawson, 1986; Francel et al., 1987; Jackson et al., 1987; 
Van Calker and Heumann, 1987). We therefore began by ex- 
amining the effects of BK on phospholipid metabolism in cul- 
tures of rat DRG neurons in vitro, BK, in the range IQ-'-lO"* 
M, rapidly stimulated the production of IP, IP,, and IPj (Figs. 
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Figure 2. Dose dependence of BK-stimulated inositol phosphate pro- 
duction. Cultured DRG neurons were prelabeled with ^H-myoinositol 
(5 /iCi/ml) for 48 hr and then exposed to BK at various concentrations 
for 30 sec (see Materials and Methods). Data are means ± SE of triplicate 
determinations and are expressed as percentage of the control value. 

1 and 2). The synthesis of IP, was seen most rapidly followed 
by that of IP2 and then IP. Large increases in the synthesis of 
IP3 were observed (up to 20-fold). As reported for a variety of 
other systems, the agonist stimulated synthesis of IP3 and IP2 
was relatively transient, declining to basal levels over a few 
minutes. In contrast, the slower and less profoimd synthesis of 
IP was more sustained and was still apparent at the termination 
of experiments (5 min). As would be expected diuing the period 
over which the synthesis of IP3 and IPj occurred, the concen- 
trations of PIP2 and PIP fell (Fig. 1). BK-stimulated IP3 pro- 
duction was not blocked after pretreatment of cells with pertussis 
toxin (PTX) (Fig. 3). BK also increased the production of DAG 
by DRG cells (Fig. 4). Interestingly, the time course of DAG 
production was considerably longer than that observed for IP3. 
An initial peak of DAG production occurred that correlated in 
time with the production of IP3. However, this was followed by 
a second extended peak of DAG production that lasted for many 
minutes. Such biphasic agonist-induced DAG production has 
now been noted in many cases and may reflect multiple sources 
of DAG (Miller, 1988). These observations make it clear that 
BK does activate phospholipase C in DRG neurons. We also 
found that BK activated phospholipase A,. BK stimulated the 
release of ^H-arachidonic acid and its metabolites from labeled 
DRG cells. In the presence of BK (10-^ m), the levels of radio- 
activity released into the culture medium rose to 1 15.8 ± 7.9% 
of control after 30 sec and 127.4 ± 8.5% of control after 5 min 
of incubation (n = 1 8). This release was not significantly reduced 
by pertussis toxin treatment (n = 10). In one experiment, we 
also observed that BK (10"' m) stimulated the production of 
lyso-PI by the ceils. 

Intracellular Ca^^ stores 

IP3 has been shown to mobilize Ca^*^ from intracellular stores 
in a number of cell types (Berridge, 1987). We therefore ex- 
amined the effects of BK on [Ca^^], in different portions of rat 
DRG neurons and compared it with the effects of other stimuli 
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Figure 3. Effects of pertussis toxin on 
BK-stimulated inositol phosphate pro- 
duction. Cultured DRG neurons were 
prelabeled with ^H-myoinositol (5 fiCi/ 
ml) and then exposed to BK (100 nM) 
for 30 sec. For some of the cxUtures, 350 
ng/ml pertussis toxin was added 24 hr 
after labeling had begwi. Data are means 
± SE of triplicate determinations in A 
experiments and are expressed as per- 
centage of the control value (see Ma- 
terials and Methods). A similar exper- 
iment in which the time of exposure 
to BK was varied also showed no per- 
tussis toxin sensitivity to the response. 
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that affect [Ca^*]j. In the cell bodies of DRG cells, depolarization 
induced by raising [K*]p induced a rapid rise in (Ca^^l. This 
occurred in 100% of the cells examined (Figs. 5, 9). [Ca^*^], rose 
from a resting concentration of 77 ± 7 nM (n = 30) to 513 ± 
6 nM (/I = 27) (Figs. 5, 9). In Ca^^-frec mediiun, the response 
to depolarization was completely blocked (Fig. 5^. A striking 
feature of the depolarization-induced Ca^"- transient in DRG 
ceUs was that it was buffered very slowly. Following washout of 
the depolarizing stimulus, [Ca^^jj initially fell rapidly but then 
stabilized and only declined very slowly even in Ca^^-free me- 
dium (Figs. 5, 6). This slow buffering of Ca^* stands in contrast 
to other types of peripheral and central neurons we have ex- 
amined, which buffer similar Ca^^ loads much more quickly 
(Fig. 6) (Murphy et al., 1987; Thayer et al. 1987a). 

Methylxanthines such as caffeine have been shown to mo- 
bilize Ca^-" from intracellular stores in several types of cells, 
including peripheral neurons (Neering and McBimiey, 1984; 
lipscombe et al., 1 987; Thayer et al., 1 987b). Caffeine was also 
very effective in DRG cell bodies. In Ca^^-free medium, 10-^ m 
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Figure 4. Time course of bradykinin stimulated diacylglycerol for- 
mation. Cuhured DRG neurons were prelabeled with [*H]arachidonic 
acid (1 /iCi/ml) for 10 hr and then exposed to bradykinin (100 nm) for 
various intervals. This time course is representative of three sixnilar 
experiments. (See Materials and Methods). 
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Figure 5. Depolarization-induced increases in [Ca^'^li in sensory neu- 
rons. [Ca^^jj was measured in single DRG cell bodies (A and B) and 
processes (C) as described in Materials and Methods. Cells were de- 
polarized during the time indicated by the horizorUcU bars by changing 
the perfusing solution from low (5 nuf) to high (50 him) media. In 
contrast to the 3 control responses generated in A, depolarization in 
Ca'^-free (20 mm EGTA) media (horizortlal bar) failed to elicit a response 
(B). Depolarization of a neuronal process (Q produced [Ca^'^li transients 
that were smaller and more rapidly buffered than those elicited in cell 
bodies. 
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Figure 6. Comp^ison of Ca''*^ buffering in sensory, sympathetic, and 
oential neurons. Single somata from the DRG {A), the SCG {fi), and 
the striatum (O were depolarized during the time indicated tyy the 
horizontal bars by changing the perfusing solution from low (5 mM) to 
high (50 mM) media. [C^^'^li was measured as described in Materials 
and Methods. 



caffeine produced Ca^* transients in the vast majority of cells 
examined (83%, Figs. 7, 9). In order to elicit multiple responses 
to caffeine, the stores reqtiired "refilling" after each exposure. 
This could be achieved by depolarizing the cell briefly in C^^**^- 
containing medium and allowing Ca^^ influx to elevate [Ca^^J^ 
(Fig. 7). The caffeine-sensitive stores in DRG neurons seem to 
be similar to those seen in muscle cells, as the caflfeine-induced 
increases in [Ca^'^Jj could be completely blocked by ryanodine 
(Fig. 7), The block produced by ryanodine was "use dependent." 
Thus, after addition of ryanodine, a caffeine response could be 
obtained initially; however, after this first response, all subse- 
quent attempts to elicit a caifeine response, even after normal 
"refilling," were completely blocked. We have observed that the 
ryanodine block of the caffeine response had precisely the same 
characteristics in rat sympathetic neurons (Thayer et al., 1 987b). 
BK also produced Ca^*^ transients in DRG cell bodies. These 
were still observed in €]!a'^-free medium. The magnitude of the 
C^^-^ transients produced by BK were similar to those produced 
by caffeine, although they were observed in considerably fewer 
cells (33%; Figs. 8, 9). The effects produced by BK desensitized. 
Following the production of a BK response, subsequent appli- 
cations of the peptide produced progressively smaller responses 
even if attempts were made to refill stores as with caffeine (Fig. 
8). 

The effects of these various stimuli on {Csi^^]i in individual 
DRG cell processes were also examined. Several interesting dif- 
ferences were observed in comparison with results obtained in 
cell bodies. First, we found that elevating [K'^Jo was less effective 
in raising [C;^^*]i in cell processes (Figs. 5, 9). Although this 
depolarizing stimulus was still effective in every process ex- 
amined, the net rise in (Ca^*! to 219 ± 37 nM (/i = 23) was 
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Figure 7. Caffeine-sensitive Ca^^ stores in sensory neurons. In DRG 
neurons perfused with Ca^^-frec (20 mm EGTA) medium, adding 10 mM 
caffeine to the solution at the times indicated, produced a rapid and 
transient increase in [Ca^^jj. Multiple responses to caffeine could be 
elicited only if the intracellular stores were allowed to refill with Ca^, 
accomplished here by perfusing with depolarizing (SO mM K*^) media 
containing normal Ca'* levels. When 1 ioa ryanodine was add^ to the 
perfusing medium prior to and during the second application of caffeine 
{B), the response to the third application of caffeine was always blocked. 
[C^^^], was measured as described in Materials and Methods. 



considerably smaller than that observed in the soma (Figs. 5, 
9). This is in contrast to sympathetic neurons, for example, in 
which we observed that raising [K^]o increased [Ca^'^^li in cell 
bodies and processes to a similar extent (Thayer et al., 1987a). 
The observations on DRG cells suggest some variability in the 
density of Ca^*^ channels in different parts of the cell. In addition, 
increases in [Qq}*\ produced by depolarization in cell processes 
were much more transient than the long-lasting increases ob- 
served in cell bodies (Fig. 5). Striking differences were also ob- 
served with respect to the effects of caffeine in cell processes. 
Whereas caffeine was very effective in virtually all cell bodies 
tested, it rarely produced any effects in cell processes. We did 
observe occasional small responses to caffeine in processes, but 
they were very infrequent (Fig. 9). These observations are sim- 
ilar to others made in rat and bullfrog sympathetic neurons, in 
which the effects of caffeine on [Ca^^Ji are also restricted to the 
cell soma (Lipscombe, 1987; Thayer et al., 1987a) (Figs. 9, 10). 
In contrast to caffeine, BK was at least as effective in cell pro- 
cesses as in the cell soma. The average increases in [Ca^*]j pro- 
duced by BK in cell processes (126 ± 57 nM, /i = 10) were 
similar to those observed in cell bodies (108 ± 37 nM, 10), 
and the response was seen more frequently (43 versus 33%; Fig. 
9). These differential effects could indicate that the stores acti- 
vated by caffeine and BK are different. That this may be so is 
further suggested by results such as those shown in Figtu^ 10. 
There, we illustrate a cell body that failed to respond to BK but 
showed a large caffeiue reponse. In contrast, a DRG process in 



4094 Thayer et al. • Bradykinin and Neuronal Ca^ Homeostasis 



500-1 



400- 



S. 300- 

CO 200- 



100- 



Figure 8. BK-induced increases in 
[Ca^^li. The cell body of a sensory neu- 
ron was perfused with either 100 nM BK 
or depolarizing (SO niM K^) medium at 
the times indicated by the horizontal 
bars. [Ca^^li was measured as described 
in Materials and Methods. 
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whicb caffeine failed to elicit a response showed a lai^e BK 
effect. Such differentia] sensitivity was frequently observed. Al- 
though lack of a response to BK may indicate the absence of 
BK receptors in some instances, taken together these results 

strongly suggest that caffeine and BK (IPj?)-sensitive stores are 
separate entities. 
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Figure 9, Comparison of the increases in [Ca'^ls produced by depo- 
larization, caffeine, and BK in somata and processes of sensory neurons. 
The frequency (A) and magnitude (B) of the responses in either DRG 
somata or processes to iwrfiision with media containing 50 mM K* (open 
bars), 10 mM caffeine (hatched bars), or 100 nM BK (solid bars) are 
shown. Basal [Ca^^]: levels were subtracted from the peak responses and 
displayed as means ± SE. 



Another observation we have made concerns the potential 
role of intracellular Ca^-*^ storage sites in Ca^^ buffering. As dis- 
cussed above, the shape of the C^^'*' transient evoked by de- 
polarization in the cell bodies of DRG neurons is very char- 
acteristic. After a rapid initial decline, the Ca^* signal decays 
only very slowly. We found, however, that the shape of this 
transient could be radically altered if either caffeine- or BK- 
sensitive stores were first discharged. After such treatment, the 
long-lasting tail of the Ca^* transient was abolished or at least 
very greatly reduced (Fig. 1 1). Following washout of the caffeine 
or BK stimulus, the depolarization-evoked Ca^+ transients re- 
turned to their original form. Indeed, following discharge of 
intracellular stores, the Ca'*^ transients obtained by depolariza- 
tion of DRG cells resembled those obtained in other peripheral 
or central neurons (see above). We also observed that acute 
treatment of cells with phorbol esters or down-regulation of 
protein kinase C following chronic phorbol ester treatment 
(Matthies et al., 1987; Ewald et al., 1988) did not alter the 
buffering of Ca^^ by the cells. This indicates that the effects of 
BK on Ca^+ buffering do not involve protein kinase C but are 
presumably related to the discharge of IPj-sensitive stores. 

We have noted above that the activation of IPy synthesis 
produced by BK in DRG neurons was not blocked by pertussis 
toxin. We therefore examined the effect of the toxin on BK- 
ihduced Ca^* transients in cell processes. Quite surprisingly, we 
observed that, although BK was still effective in some instances, 
its ability to increase [Ca^*]i in pertussis toxin-treated cells seemed 
to be substantially decreased (13% of the processes responded 
after treatment, n - 19 with a mean response of 60 ± 24 um; 
n = 3). Caffeine, as noted, produced very little effect in cell 
processes; if anything, however, it was even less effective fol- 
lowing the toxin (11% responded, w - 19; mean 37 nM; n = 2). 
In contrast, pertussis toxin had no effect upon the size of the 
Ca^^ transients produced in cell processes by depolarization (net 
increase 261 ±47 um, n = 19). 

Discussion 

The action of BK on virtually all its target cells leads to the 
activation of both phospholipase C and phospholipase Aj. This, 
in turn, leads to the production of a variety of lipid-derived 
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Figure 10, Separation of BK and caffeine sensitivity of the intracellular 
Ca'"^ stores in sensory neurons. A DRG soma and process (B) were 
perfused with Ca^*-free (20 mm EGTA) solution containing either 100 
nM BK or lOmM caffeine as indicated. Depolarization-induced Ca^^ 
influx was produced by perfusion with 50 mM K^ in normal Ca'*^- 
containing medium. The BK response in the process {B) is an especially 
laige [Ca'^^lj transient chosen because of the low noise in the record. 



second messengers (Hong and Deykin, 1982; Miller, 1987b). 
Such substances could mediate the known effects of BK on DRG 
excitability (Miller, 1987b). In the present study we have dem- 
onstrated that in DRG cells, as in other cases, BK does indeed 
stimulate the production of several phospholipid-derived sec- 
ond messengers, suggesting that it activates phospholipases C 
and A2. As in other cases, the IP, produced probably plays an 
important role in mediating the BK-induced changes in Ca^^ 
metabolism we have observed. The role of arachidonic acid in 
this respect is imclear, and we have not investigated it further. 
However, as we shall discuss, DAG and eicosanoids derived 
from the metabolism of arachidonic add are clearly essential 
in prpducing the overall excitatory effects of BK on DRG cells. 
This is primarily due to their effects on Ca^* and K'^ conduc- 
tances (see below). 

The general pattern of phospholipid metabolism produced by 
BK in- DRG cells is similar to that pi^iously reported in other 
tissues including a DRG x neuroblastoma clonal cell line, Fl 1 
(Francel et al., 1 987). One difference concerns the effects of PTX. 
In DRG cells, PTX had no effect on IP, production. However, 
in F- 1 1 cells a partial inhibitory effect was observed (Francel et 
al., 1987). Although it is clear that agonist-induced IP, synthesis 
utilizes a G-protein of some sort, the PTX sensitivity of this 
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Figure J L Intracellular stores and [Ca^*l buffering in sensory neurons. 
Increases in the [Ca**li in DRG cell bodies were induced by perfusion 
with depolarizing (50 mM K^) medium in normal Ca^^ as indicated by 
the hoiizontai bars. The cells were perfused with Ca'^-fiee (20 mm EGTA) 
solution duriog the recovery to basal [Ca'ii levek. Caffeine, 10 mM (4), 
or BK, 100 nM was added to the peifusion solution prior to and 
during the depolarization as indicated by the iumzontal bars. 



entity (or entities) has been found to vary widely (Miller, 1 988). 
Actually, it is only in a minority of cases that PTX sensitivity 
of the IP3 generating process has been observed (Berridge, 1987; 
Miller, 1988). In the particular case of BK, both PTX-sensitive 
and -insensitive IP, production has been reported (Miller, 1 988). 
Whether this means that BK receptors can be coupled to dif- 
ferent G-proteins (Burch and Axelrod, 1987) or vtliether it re- 
flects the known heterogeneity of BK receptors is as yet imclear 
(Manning et al., 1986). 

In many cell types, IP, generation subsequently leads to the 
mobilization of intracellular Ca'* (Berridge, 1987), This also 
appears to be true in DRG cells. BK clearly produced increases 
in [Ca^-^li in many cells even in Qi**-free medium. It is inter- 
esting to note that reponses to BK appeared to be of about the 
same magnitude even if Ca''*'-containing medium was used (S. 
Thayer and R. Miller, unpublished observations). This indicates 
that BK does not produce extensive Ca^* influx. Thus, any sub- 
sequent metabolism of IP3 to IP4 and associated Ca^^ influx in 
these neurons may not be of great importance (Higashida and 
Brown, 1986a; Irvine and Moore, 1986; Morris et al., 1987). It 
should be pointed out that, although we imagine that the in- 
creases in [Ca'^1 we have observed in response to BK involve 
IP3, the ability of IP, to actually mobilize Ca'* in DRG cells (or 
indeed in any other type of vertebrate neuron) remains to be 
directly demonstrated (see, however, Freedman and Aghanja- 
nian, 1987; Shah et al., 1987). BK did not increase [Ca^i, in all 
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neurons studied. This is probably a reflection of some degree 
of heterogeneity in the cultures. BK receptors probably exist 
primarily on small poly modal nociceptors and may not be pres- 
ent on the cell bodies or processes of all DRG neurons. 

In our studies, it appeared that BK-sensitive Ca^* stores ex- 
isted in both the cell soma and processes. These stores may 
difler from those that can be mobilized by high concentrations 
of methylxanthines. It is well known that caffeine can mobilize 
an intracellular Ca^*^ store in various types of muscle (Carafoli, 
1 987). It is thought that this type of store can participate in the 
phenomenon of "Ca^^-induced Ca^* release," which may be a 
way of amplifying the Ca^^ signal in these cells (Barcenas-Ruiz 
and Weir, 1987; Carafoli, 1987). By implication, such stores 
have also been thought to exist in bullfrog sympathetic neurons 
(Kuba, 1980), and this fact has now been directly demonstrated 
using imaging techniques (Lipscombe et al., 1987). Studies we 
have carried out using rat superior cervical ganglion neurons 
have also demonstrated the existence of caifeine-sensitive stores 
which can be blocked by both ryanodine and dantrolene (Thayer 
et al. 1987b). Presumably, therefore, these stores are similar to 
those found in muscle. The present studies demonstrate that 
methylxanthine-sensitive stores also seem to exist in DRG cells 
where again they can be blocked by ryanodine. It has been 
observed that in both rat and bullfrog sympathetic neurons caf- 
feine-sensitive stores seem to exist primarily in the cell soma 
(Lipscombe et al., 1987; Thayer et al., 1987b). This is also true 
in DRG cells. We also noted that in many instances it was 
possible to produce BK responses from a cell process that was 
completely insensitive to caffeine. If we make the assumption 
that the BK-induced rise in [Ca^'^li results from BK-stimulated 
IP} synthesis and subsequent mobilization of Ca^*^ stores, then 
it appears likely that the caffeine and IPj-sensitive stores are 
separate entities. Indeed, such a separation of IP,- and caffeine- 
sensitive stores has also been observed in non-neuronal tissues 
(Kanaideet al., 1987). 

It is clear from our data that the manner in which DRG cells 
buffer Ca^^ is quite extraordinary. In comparison with all other 
types of peripheral and central neurons we have studied, the 
shape of the Ca'^ transient produced upon depolarization is 
remarkable. Following a rapid phase of buffering, [Ca^^l^ falls 
very slowly. The tardiness of Ca^* buffering in these cells is also 
observed on a much shorter time scale when voltage-clamp steps 
are used to depolarize the cells (Thayer et al., 1 988). It is in- 
teresting to note that Jia and Nelson (1986) predicted that DRG 
cells might buffer Ca^* slowly on the basis of the paucity of 
mitochondria they observed in DRG nerve terminals. However, 
it is not clear that this is actually the reason involved. Thus, the 
Ca^^ transients observed in cell processes seem considerably 
faster than those observed in the cell soma. Our results indicate 
that the rate of Ca^^ buffering may be critically dependent on 
the availability of intracellular Ca^^ storage sites. Thus, if we 
discharged either the BK- or caffeine-sensitive stores, Ca^* buff- 
ering in the cell soma became relatively fast. This seems rea- 
sonable. K* depolarization, BK, and caffeine all raise [Ca^^L into 
the submicromolar range but seldom higher. Thus, a relatively 
high-affinity Ca'^ bufier is probably most important for the rapid 
removal of the accumulated Ca^*. Mitochondria and exchange 
systems are generally of lower affinity than intracellular Ca^-^ 
storage sites or the cell membrane Ca^* ATPase (Baker and 
DiPolo, 1984; Carafoli, 1987). We presume that after loading 
during elevated K"" depolarization, the Ca** buffering that occurs 
rapidly mostly reflects Ca^* entry into vacant stores. However, 



when these are full, the DRG soma appears to be able to cope 
with the remaining load only very slowly. In contrast, if storage 
sites have been vacated by using caffeine or BK, then once the 
Ca2+ released has been disposed of, presumably by expulsion 
from the cell, buffering of a depolarization-induced Ca^* load 
can occur more rapidly. 

Another observation we have made relates to the degree of 
increase in [Ca^^jj produced by depolarization of different por- 
tions of the DRG neuron. In rat superior cervical ganglion neu- 
rons we observed that K* depolarization increased [Ca^^Ji to the 
same extent in all portions of the cell (Thayer et al., 1987a, b). 
However, this is clearly not the case in DRG neurons. In DRG 
cells the overall density of Ca^^ channels in the cell processes is 
apparently lower than in the cell soma. However, it may well 
be that high concentrations of of Ca*^ channels exist in certain 
portions of the DRG processes from which neurotransmitter 
release actually occurs and that they are scarce in other portions 
of the process. Such a question is most easily answered using 
imaging techniques. It is also possible that Ca^^ buffering in 
processes is more efficient than in the cell soma. 

The increase in [Ca^^], produced by BK may be important in 
several respects. Thus, in NG108-15 cells mobilization of Ca'* 
from intracellular stores does lead to neurotransmitter release 
(Higashida, 1987). Furthermore, as a number of ionic conduc- 
tances are sensitive to Ca^*, BK-induced increases in [Ca^^L may 
help to regulate DRG excitability. It should further be noted 
that BK stimulates the release of DAG and arachidonic acid. 
The former has been observed to inhibit Ca^**^ currents in DRG 
cells (Rane and Dunlap), and the latter can be metabolized to 
various eicosanoids that have been shown to alter K*^ channels 
in DRG cells through the production of further second messen- 
gers such as cyclic AMP (Weinreich, 1986; Wonderlein and 
Weinreich, 1986). 
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Abstract Human Factor IX (Christmas factor) 
was isolated from the plasma of a patient with mild 
hemophilia B. The patient's plasma contained 5% 
Factor IX clotting activity but 100% Factor IX antigenic 
activity as determined by immunological assays, which 
included inhibitor neutralization and a radioimmuno- 
assay for Factor IX. This abnormal Factor IX is called 
Factor IX Chapel Hill (Factor IXch). Both normal 
Factor IX and Factor IXch have tyrosine as the NH^- 
terminal amino acid. The two proteins have a similar 
molecular weight, a similar amino acid analysis, the 
same number of gamma-carboxyglutamic acid residues 
(10 y-carboxyglutamic acid residues), and a similar 
carbohydrate content. Both exist as a single-chain 
glycoprotein in plasma. The major difference between 
normal Factor IX and Factor IXch is that the latter 
exhibits delayed activation to Factor IXa in the 
presence of Factor XIa and Ca'***. Thus, Factor IXch dif- 
fers from other previously described abnormal Factor 
IX molecules. 

INTRODUCTION 

Factor IX (plasma thromboplastin component, Christ* 
mas factor) is a clotting factor which participates in 
the middle phase of intrinsic blood coagulation. Di- 
minished activity of this &ctor results in hemophilia B. 
Factor IX is a glycoprotein that has been isolated 
from both^the human and bovine species and many 
of the characteristics of the normal factor have 
been published (1-8). In the circulation. Factor 
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IX exists as a single-chain zymogen which is converted 
to an active serine protease in the presence of Factor 
XIa (plasma thromboplastin antecedent) and Ca*'*' ions 
(9). During activation, the Factor IX zymogen under- 
goes proteolysis in a two-stage reaction that results in 
a two-chain molecule composed of a light and heavy 
chain linked by disulfide bonds. The first step in the 
proteolysis is the cleavage of a specific arginine- 
alanine peptide bond which results in an inactive 
intermediate product. The second cleavage splits an 
arginine-valine peptide bond with the release of a car- 
bohydrate-rich activation peptide and an active Factor 

IX enzyme (10, 11). The active site serine resides 
on the heavy chain (11). 

Activated Factor IX (Factor IXa), in the presence of 
phospholipids, thrombin-modified Factor VIII, and 
calcium ions, converts Factor X to Factor Xa (2). How- 
ever, the precise molecular interactions of Factor IXa 
with calcium, phospholipid. Factor VIII, and Factor 

X are not completely understood. Elucidation of the 
nature of the defect in abnormal Factor IX molecules 
found in some patients with hemophilia B (Christmas 
disease) may provide insights into frinction of normal 
Factor IX in hemostasis and its interaction with other 
clotting factors. Roberts et al. (12), and other investi- 
gators (13-19) have demonstrated that some patients 
with hemophilia B possess an abnormal Factor IX 
molecule which has low to undetectable clotting ac- 
tivity, but which is present in relatively normal 
amounts when measured by immunological techniques 
(12, 20-22). Because they contain cross-reacting ma- 
terial (CRM)' to specific homologous and heterologous 



* Abbreviations used in this paper: CRM. cross-reacting ma- 
terial; CRM*. CRM positive variants; CRM". CRM negative 
variants; CRM", CRM reduced variants; Factor IXch* Factor 
IX Chapel Hill; SOS, sodium dodecyl sulfate. 
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anti-Factor IX antibodies, these patients have been 
termed CRM^ (CRM positive variants). They can be 
distinguished from other hemophilia B patients who 
have reduced, but not absent, levels of both Factor 
IX antigen and Factor IX clotting activity (CRM*^; 
CRM reduced variants). The CRM**" patients can also 
be distinguished from those hemophilia B patients 
in whom Factor IX antigen and Factor IX clotting ac- 
tivity are absent or undetectable (CRM~; CRM nega- 
tive variants) (22). 

The purpose of this paper is to describe the isolation 
and characteristics of an abnormal Factor IX molecule 
from a mildly affected CRM"*^ hemophilia B patient. 
This abnormal Factor IX has been named Factor IX 
Chapel Hill (Factor IXch) (23). Factor IXch is compared 
with purified normal human Factor IX in terms of its 
molecular weight, NHj-terminal amino acid, content of 
y-carboxyglutamic acid residues, carbohydrate content, 
and activation by Factor XIa and Ca''*'. The major 
abnormality in Factor IXch appears to be defective 
activation by Factor XIa and Ca^"**, although abnormali- 
ties in interactions with other clotting factors cannot 
be excluded. 

METHODS 

Materials. The patient with Factor IXch is a 30-yr-old 
white male who has been followed at the University of 
North Carolina Hospital for 16 yr* He has a mild bleeding 
tendency and a Factor IX level of 5% of normal when 
tested in one-stage clotting assays. With a si>ecific antibody to 
Factor IX, however, the patient was found to have 100% of 
Factor IX when measured by an antibody neutralization 
test and a radioimmunoassay for Factor IX (vide infra). All 
other clotting factors were normal. Plasma for purification of 
Factor IXch was obtained from this patient who was previ- 
ously shown to be CRM'*' (24). The patient was plasma- 
pheresed every 2 wk to I mo, after informed consent, until 
sufficient plasma was obtained for the studies oudined b.elow. 
The blood was handled in a manner similar to that for 
normal blood as described below. The resultant plasma was 
stored at — TO'C and thawed at 37**C just before use. Normal, 
pooled human plasma was obtained from the Blood Bank of the 
North Carohna Memorial Hospital, Chapel Hill, N. C. Human 
blood was collected in plastic bags that contained standard 
. citrate phosphate dextrose anticoagulant and centrifuged at 
3,000 g for 30 min at 4^C to render the plasma platelet poor 
and free of erythrocytes. The plasma was then recentrifuged 
under the same conditions. Plasmas used for the purification 
of normal Factor IX were either fresh or stored for <2 
mo at -7(rC. 

Alumina gel was purchased from Calbiochem, San 
Diego, Calif. Benzamidine hydrochloride and cyanogen bro- 
mide were purchased from Eastman Organic Chemicals, Div. 
Eastman Kodak Co., Rochester, N. Y. Heparin sodium salt, 
bovine serum albumin, dithioerythritol, dansyl chloride, 
dansyl amino acids, galactose, mannose, glucose, fucose, N- 
acetylneuraminic acid, and amino sugars were obtained from 
Sigma Chemical Co., St. Louis, Mo. DEAE-cellulose was 
Whatman DE-52 (Whatman Chemicals, Div. W & R Balston, 
Maidstone, Kent, England). DEAE-Sephadex (A-50), Sephadex 
G-150, Ribonuclease A, chymotrypsinogen A, and ovalbumin 
were purchased from Pharmacia Fine Chemicals, Uppsala, 



Sweden. Bovine brain cephalin (Thrombo&x) was purchased 
from Ortho Pharmaceuticals Corporation, Raritan, N. J. Throm- 
boplastin reagent \(Simplastin) was purchased from General 
Diagnostics, Warner Lambert Company, Morris Plains, N. J. 
Agarose (Bio-Gel A-15 m, 100-200 mesh), Aminex A-4 Resin, 
/3-mercaptoethanol, and materials used in gel electrophoresis 
were obtained from Bio-Rad Laboratories, Richmond, Calif. 
Celite 545 was purchased from Johns-Manville, Denver, Colo. 
All other chemicals were of reagent grade or better. 

Protein concentrations. The Lowry method (25), with 
bovine serum albumin as a standard, was used for the estima- 
tion of protein concentration for the early stages of purifica- 
tion of Factor IX. Protein concentration on the final product 
was estimated by optical adsorption at 280 nm with experi- 
mentally determined extinction coefficients (vide infra). 

Coagulation assays, 1 U of Factor VII, VIII, IX, or X clot- 
ting activity was defined as that amount in 1 ml of normal, 
pooled human plasma. A one-stage assay, based on the partial 
thromboplastin time, was used for the measurement of Factors 
VIII and IX (26, 27). Factors X and VII were measured by a 
one-stage assay based on the prothrombin time (28). Prothrom- 
bin was assayed with a two-stage method (29). Factor XI was 
assayed with artificially depleted human plasma (30) as well 
as plasma congenitally deficient in Factor XI (31). Factor IX 
immunological activity was assayed by an inhibitor-neutraliza- 
tion technique (12, 21). Factor IX was also measured by a 
radioimmunoassay with purified normal human Factor IX and 
a human anti-Factor IX antibody (32). 

Preparation ofheparin-agarose. Heparin-agarose was pre- 
pared by a modification of the method described by Porath 
(33) and Cuatrecasas (34). 

Electrophoresis. Disc gel electrophoresis was performed 
in 0.2 M N,N-bis[2-hydroxyethyl]glycine-imidazole buffer 
at pH 7.8 for nondenatured protein and 0.025 M Tris-glycine 
buffer at pH 8.3 for sodium dodecyl sulphate-treated protein 
(35). 7.5 and 10% polyacrylamide gels were used for analytical 
separation of the proteins with 3% polyacrylamide as the 
stacking gel. /3-Mercaptoethanol and dithioerythritol were 
used as reducing agents. Immuno-electrophoresis was per- 
formed on 1% agarose gel in 0.075 M Tris-barbital buffer 
at pH 8.6 which contained 2 mM of calcium lactate. 

Purification of Factor IX ch, 60 ml of cold alumina gel was 
stirred into 600 ml of cold platelet-poor plasma, obtained from 
the patient with Factor IXch> for 5 min at 25*'C. The plasma 
was centrifuged and the supemate decanted. The adsorbate 
was washed twice with cold, normal saline and then eluted 
with 30 ml 0.3 M (NHJ, HPO4 buffer at pH 8.5. The elution 
was repeated at least once. Pooled eluate was dialyzed against 
4 liters of 0.01 M sodium phosphate buffer at pH 7.0. The 
dialysate was then applied to a DEAE-cellulose (DE-52, 
Whatman Chemicals) column (2.5 x 25 cm) which was washed 
extensively with sodium phosphate buffer containing 0.13 
NaCl until the OD (280 nm) of the effluent was <0.01. The 
column was eluted with a linear gradient of NaCl (from 0.13 M 
to 0.30 M) with an automatic level sensor. Factor IXcH-rich 
fractions were pooled and dialyzed against 4 liters of 0.05. M 
sodium citrate at pH 7.0. The dialysate was applied to a DEAE- 
Sephadex column (2.3 x 12 cm) (Pharmacia Fine Chemicals) 
which was washed with 0.08 M sodium citrate (pH 7.0) until 
the OD (280 nm) was <0.01. A linear gradient of sodium 
citrate from 0.08 to 0.30 M (pH 7.0) was then used to elute 
the column. Factor IXcH-nch firactions were pooled and dialyzed 
against 4 liters of 0.05 M imidazole-HCl buffer (pH 6.0) over- 
night, and the dialysate was brought to 2.5 mM CaCls concen- 
tration with a 1-M CaClx stock solution. A heparin-agarose 
column (1.1 X 4.5 cm) was used for the last step of purification. 
After the Factor IX solution was applied, the column was 
washed with 250 ml of imidazole-HCl buffer which contained 
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0.25 M NaCl and 2.5 mM CaCU. Factor IX was eluted with 
0.4 M NaCl and 2.5 mM CaClt in the same buffer. The pro- 
cedures were carried out at 4**C except when stated other- 
wise. All solutions used in the purification procedure con- 
tained 1 mM benzamidine hydrochloride. Purification was 
monitored by a two-stage Factor IX neutralization assay as 
well as clotting assays for Factors 11, VII, VIII, IX, and X. 

Purification of normal human Factor IX, Purification of 
normal Factor IX was similar to that described for Factor 

IXcH- 

Preparation of Factor XIa . 10 ml of normal, pooled plasma 
was adsorbed twice with alumina C'j gel for 30 min at 25°C. 
After centrifugation, the supernatant plasma was incubated 
with 200 mg of prewashed Celite 545 Qohns-Manville) in a 
plastic tube at 37**C for 10 min. The tube was inverted every 
1-2 min. The Celite suspension was collected by centrifuga- 
tion and then washed 3 times with 10 ml of distilled water. 
Factor XIa was eluted firom the Celite with 10 ml of 10% saline, 
and dialyzed against buffered citrated saline (pH 7.2). Factor 
XIa was also purified by the method of Lundblad and Kingdon 
(36) and used for die activation of purified Factor IX. 

Molecular weight. The molecular weight was estimated 
by the electrophoretic mobility method widi sodium dodecyl 
sul&te (SDS) disc gel electrophoresis as described above (35). 

Inhibitor neutralization assay and radioimmunoassay for 
Factor IX, The inhibitor neutralization assay was performed 
according to the method of Roberts et al. (12). A radioimmuno- 
assay was carried out according to the method described by 
Reisner et al. (32). 

Activation of Factor IX by Factor XIa and Ca*'*^. 3 mg of 
purified, normal human Factor IX were incubated with 100 
/Ag of partially purified Factor XIa at 3T'C in 0.05 M imidazole 
buffer (pH 6.0) in a final volume of 3 ml which contained 
5 mM of CaClf. At selected time intervals an aliquot of the 
incubation mixture was added to platelet-poor, unactivated 
Factor IX-deficient substrate plasma obtained from a severely 
affected CRM~ hemophilia B patient. A nonactivated, partial 
thromboplastin time and a Factor IX assay were performed 
on die final mixture. At the same time intervals, aliquots fiom the 
first incubation mixture were evaluated on SDS gel electro- 
phoresis to determine the time of appearance of lower molecular 
weight components. Factor IXch was also activated in the same 
manner, and monitored in a similar &shion. Appropriate controls 
were run by substituting imidazole buffer for Factor XIa» Fac- 
tor IX, and Ca*'*' in the incubation mixture. 

NHrterminal amino acid analysis. The NHfterminal 
amino acid was determined by Edman degradation (37) with 
a Beckman model 890-C Sequencer (Beckman Instruments, 
Inc., Fullerton, Calif.). Phenyl isothiocyanate was coupled to 
the free amino terminal group of the protein by incubation 
at 55°C for 15 min. Anhydrous heptafluorobutyric acid was 
used to release the phenyl isothiocyanate derivative of the 
NHt-terminal amino acid which was identified by gas chroma- 
tography. The NHs-terminal amino acid was also determined 
by thin layer chromatography with dansyl chloride (38). 

Amino acid composition of human Factor IX , Amino acid 
analysis was performed by the method of Spackman et al. 
(39). Samples in 6 N HCl were sealed in Pyrex (Coming Glass 
Works, Coming, N. Y.) tubes (Research & Development Prod- 
ucts, Berkeley, Calif.) at reduced pressure and hydrolyzed 
at llO^C for 24 and 72 h. y-Carboxyglutamic acid was quan- 
titated from alkaline hydrolysates of protein with a modifica- 
tion of the method of Hauschka et al. (40). 1 mg of each of 
these samples was dissolved in 2 ml of 2 N KOH and sealed 
at reduced pressure in glass tubes lined with polypropylene. 
Hydrolysis was carried out at llO^C for 24 h. After alkaline 
hydrolysis, samples were brought to pH 7.0 by drop-wise addi- 
tion of 35% HCIO4. The solutions were evaporated and the 



amino acids were eluted from the precipitate cake with ali- 
quots of 0.2 N sodium citrate buffer, pH 2.2. Amino acid analy- 
ses were performed on either a modified Phoenix model 
VG6000-B (Phoenix Precision Instrument, Div. Virtis Co., Inc., 
Gardiner, N. Y.) or a Beckman model 116 analyzer (Beck- 
man Instruments, Inc.). The determination of cystine as cysteic 
acid was performed after Moore (41) with bovine prothrom- 
bin as a standard to determine the yield of the reaction. The 
relative amount of tryptophan to tyrosine in Factor IX was 
determined by a spectrophotometric method (42) in 0.1 N 
NaOH with a Cary 118 spectrophotometer (Cary Instruments, 
Fairfield, N. J.). Tryptophan residues per molecule of Factor 
IX were calculated firom the tyrosine residues obtained in 
the acid hydrolysis. 

Carbohydrate composition. Neutral sugars were deter- 
mined by quantitative gas liquid chromatography (43) after 
being released from the protein by hydrolysis in 2 N HtSO^ 
at lOO^C for 5h and converted to alditol acetates (44). A 
colorimetric method with the phenolsuliiiric acid reaction and 
a 1:1 mixture of galactose and mannose as standard was also 
employed to confirm die result of total hexoses (45). Total 
hexosamine was determined according to Gardell (46). 
Sialic acid was assayed with 2-thiobarbituric acid (47). 

Extinction coefficient. Purified normal Factor IX and Fac- 
tor IXcH were rendered salt-free and lyophilized. The lyoph- 
ilized proteins were then dissolved in 0.05 M imidazole- 
HCl buffer (pH 6.0) and the optical absorption at 280 nm 
was measured and extinction coefficients were calculated. 

RESULTS 

Summaries of the purification results of Factor IXch 
and normal Factor IX are shown in Tables I and II, 
respectively. Both normal Factor IX and Factor IXch 
were purified from 17,0(X)- to 18,000-fold with a yield 
ranging from 47 to 53%. 

When the final products were electrophoresed on 
SDS gels only one band was seen for normal Factor 
IX and Factor IXch as shown in Fig. 1. The apparent 
molecular weights according to this technique were 
70,0(X) for both the normal product and Factor IXch* 
When both normal Factor IX and IXch were run in 
SDS gels in the presence of /3-mercaptoethanoI or 
dithioerythritol, only one band was seen, which in- 



Table I 
Purification of Factor IXch 



Stage 


Vol- 
ume 


Total 

protein 


Clotting 

activity 


Yield 


Purification 




ml 


mg 


U 


% 


-fold 


Plasma 


600 


3.1 X 10* 


28.2 


100 


1 


Al (OH), 


60 


155 


26.8 


95 


190 


DE>52 


150 


5.89 


19.5 


69 


3,640 


DEAE- 












Sephadex 


60 


2.65 


17.7 


63 


7,340 


Heparin- 












agarose 


35 


0.96 


16.3 


58 


18,700 










(54)* 


(17,300)* 



* Antigenic determination in parentheses. 
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Table II 

Purification of Normal Human Factor IX 





Vol- 


Total 


Clotting 




PuriB- 


Stage 


ume 


protein 


activity 


Yield 


cation 




ml 


tng 


U 


% 


-fold 


Plasma 


2.000 


15.3 X 10* 


1.7 X 10» 


100 


I 


Al(OH), 


203 


7.3 X 10* 


1.66 X 10» 


97.2 


204 


DEAE- 












Sephadex 


90 


1.1 X 10» 


1.6 X 10* 


94.2 


1.301 


DE-52 


180 


10.3 


1.0 X 10> 


59.0 


8.769 


Heparin- 












agarose 


45 


4.2 


0.8 X 10* 


47.0 


17.163 



dicates that the zymogen form of both molecules is 
composed of a single polypeptide chain (Fig. 1). To 
show that the bands on SDS gels were indeed Factor 
IX, the purified normal Factor IX and Factor IXch were 
electrophoresed on .non-denatured polyacrylamide 
gels. Two gels for each product were used. One gel 
was stained with Coomassie Blue (Bio-Rad Labora- 
tories, Richmond, Calif.) to identify the position of the 
protein band while the other gel was sliced into 1—2 
mm lengths and eluted with buffered citrated saline 
as shown in Figs. 2 and 3. As can be seen in Fig: 2, 
normal Factor IX clotting activity was eluted in the 
same position as the protein band. In Fig. 3 both clot- 
ting and immunologic assays were used to detect Fac- 
tor IXcH* On immunoelectrophoresis and immunodif- 
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12 3 4 

Figure 1 SDS gel electrophoresis of: (i ) nonreduced Factor 
IXch; (2) nonreduced normal Factor IX; (3) reduced Factor 
IXch; ) reduced normal Factor IX. 




PROTEIN MIGRATION 
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Figure 2 Disc gel electrophoresis of normal human Factor 
IX. 15 /xg of purified normal human Factor IX was electro- 
phoresed in 7.5% polyacrylamide as shown on the top panel. 
Protein was identified by staining with Coomassie Blue. The 
nriigration was toward the anode. A duplicate gel was sliced 
into 2- mm segments, as shown in the lower panel, and each 
segment was crushed and incubated with 0.2 ml 0.15 M NaCI 
in 0.05 M imidazoIe-HCl buffer (pH 6.0) at 4*^0 overnight. 
The sample was centrifuged and the supemate assayed for 
Factor IX. 
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Figure 3 Disc gel electrophoresis of Factor IXch- 15 Mg of 
purified Factor IXch was electrophoresed on 7.5% polyacryl- 
amide as shown in the top panel. The protein was identified 
by staining with Coomassie Blue. The migration was toward 
the anode. A duplicate gel was sliced into 1-mm segments 
and each segment was crushed and incubated with 0.1 ml 0.15 
M NaCl in 0.05 M imidazole buffer (pH 6.0) at 4*^0 overnight. 
The Factor IX clotting activity was assayed with the one- 
stage method and the antigenic activity was assayed by the 
techniques described in Methods: (O) antigenic activity; 
(•) clotting activity. 
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fusion, purified normal Factor IX and IXch were not 
recognized by. antibodies to whole human serum, al- 
bumin, prothrombin, and Factor X but were reactive 
with anti-Factor IX antibodies. 

Amino acid analyses of both the normal Factor IX 
and Factor IXch are shown in Table III. The amino acid 
compositions of both Factor IX proteins are similar. 
Both normal Factor IX and Factor IXch contain 10 
y-carboxyglutamic acid residues per molecule of 
Factor IX based on our observed molecular weight 
for the peptide. They have been subtracted from the 
glutamic acid content as reported in Table III. 

The NHs-terminal amino acid of both the normal and 
abnormal molecule is tyrosine, as determined by both 
automatic and manual methods on several determina- 
tions. 

The carbohydrate content of normal Factor IX and 
IXcH is also shown in Table III. The carbohydrate con- 
tent of each protein is nearly identical except for a 



Table III 

Amino Acid and Carbohydrate Compositions of Normal 

F€ictor IX and Factor IXch 



Components 


Factor IX 


Factor IXcm 






no. of- 


Amino acid 




residues 


Asp 


51.9 


50.4 


Thr 


32.8 


30.9 


Ser 


32.6 


33.7 


Glu 


53.8 


56.1 


Pro 


17.6 


20.4 


Gly 


43.3 


43.1 


Ala 


28.6 


30.1 


Cys/2 


24.0 


26.5 


Val 


32.1 


31.5 


Met 


6.3 


3.9 


He 


19.9 


17.9 


Leu 


32.2 


32.2 


Tyr 


18.3 


18.1 


Phe 


23.0 


19.8 


Lys 


29.7 


29.1 


His 


10.0 


10.7 


Arg 


22.1 


20.5 


Trp 


10.0 


10.1 


Ola* 


9.8 


10.2 


Total residues 


498.0 


495.2 


Mol wt 


56,600 


56,600 



Carbohydrates 
Galactose 
Mannose 
Glucose 
Fucose 
Hexosamines 
Sialic Acid 

Total 



1.81 
1.72 
0.31 
0.39 
7.30 
6.40 
17.93 



1.93 
1.65 
0.41 
0.51 
7.00 
7.60 
19.10 



slighdy higher sialic acid content of Factor IXcHt but 
this is within the limits of experimental error. 

The results of activation of both normal Factor IX 
and Factor IXch are shown in Fig. 4. In this experi- 
ment, when normal Factor IX is activated by Factor 
XIa and Ca**, the clotting times are progressively 
shortened. When purified Factor IXch is activated by 
Factor XI and Ca'* under the .same conditions, very 
little shortening of the unactivated partial thrombo- 
plastin time is observed. If Factor XIa, Factor IX, or 
Ca*'*' is omitted from the incubation mixture as a 
control, there is again very little shortening of the 
clotting time. 

When the activation of normal Factor IX is followed 
by SDS gel electrophoresis, as shown in Fig. 5, the 
70,000-mol wt band is converted to a lower molecular 
weight component (estimated mol wt 50,000) within 
30 min. When Factor IXch is activated under the same 
conditions, no change in molecular weight is observed. 
Even after incubation for 20 h, only very litde lower 
molecular weight component was formed. These find- 
ings were consistent on several determinations. 

The extinction coefficients (EiSnm) were similar, 
11.8 for Factor IX and 12.5 for Factor IXch- 

DISCUSSION 

An abnormal Factor IX molecule obtained firom a pa- 
tient with mild hemophilia B has been isolated and 
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* y-Carboxyglutamic acid. 



INCUBATION TIME (min) 

Figure 4 Activation of Factor IXch and normal human Fac- 
tor IX by activated human Factor XI and Ca. The incubation 
mixture consisted of 3 mg Factor IX protein (or buffer con- 
trol); 100 /Ag Factor XIa (or buffer control); 5 mM CaCls in a 
filial volume of 3 ml (0.05 M imidazole buffer, pH 6.0). At 
the indicated time intervals, 0.1 ml of the incubation mix* 
ture was diluted with imidazole buffer (1:200 for Factor IX 
and 1:20 for Factor IXch) and was added to 0.1 ml unactivated 
Factor IX-deficient substrate. Clotting was achieved by the 
addition of 0.2 ml of a 1:1 mixture of 32 mM CaCIs and 10% 
Thrombofax in normal saline. 
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INCUBATION TIME (min) 

Figure 5 Activation of Factor IXch and normal human Fac^ 
tor IX by activated human Factor XI and Ca. The activation 
conditions were the same as those described in Fig. 4 except 
that at the indicated time interval, an aliquot of the incuba- 
tion mixture which contained as 15 fig protein was subjected 
to SDS gel electrophoresis. Gels 1—3 show normal Factor IX 
which was converted to a lower molecular weight component. 
Gel 4 shows Factor IXch which showed no change in mo- 
lecular weight. 

characterized. The patient's plasma contains 5% Factor 
IX clotting activity but 100% Factor IX antigen as de- 
termined by radioimmunoassay and by antibody neu- 
tralization experiments as previously described. This 
variant Factor IX (Factor IXch) was first described by 
Aronson (23) who noted that addition of SrCU to the 
patient's plasma did not result in activation of the 
patient's Factor IX, while addition of SrClj to normal 
plasma caused rapid activation of normal Factor IX. 
After activation promoted by SrCU, the apparent mo- 
lecular weight of the patient's Factor IX did not change, 
while that of normal Factor IX, as estimated on a Sepha- 
dex column (Pharmacia Fine Chemicals, Inc.), had a 
smaller molecular weight (23). However, Aronson used 
a plasma preparation, not a purified Factor IX. 

We have purified Factor IXch 18,000-fold. The prod- 
uct is homogeneous by several criteria. In the purifica- 
tion procedure, the DE-52 ion exchange and the heparin- 
agarose affinity chromatography are critical. In the 
former, the flow rate must be maintained at least at 60 
ml/h. On heparin-agarose chromatography, calcium 
ions were used to enhance the binding of both 
normal Factor IX and Factor IXch to the column. In 



the presence of 2,5 mM CaCla, prothrombin does not 
bind to the column and can be collected in relatively 
pure form after the sample application. The column 
is washed extensively with imidazole buffer, which 
contains 0.25 M NaCl and 2.5 mM CaCl2, to remove 
Factor X and prothrombin remaining in the column. 
Factor IXch <-*an then be eluted with 0.4 M NaCl in 
the same buffer. 

Factor IXch resembles normal human Factor IX in 
several respects. Although several other values of mo- 
lecular weight for normal human Factor IX have been 
reported (2-6, 8) including one as low as 57,000 as de- 
termined by sedimentation equilibrium (8), higher 
molecular weights are found with the Laemmli SDS 
gel method (35). Nevertheless, it is apparent that both 
Factor IX moleculars have a similar molecular weight. 
In addition, both normal Factor IX and Factor IXch 
have tyrosine as the NH2-terminal amino acid. Amino 
acid analyses of both proteins are very similar. Com- 
pared to analyses for normal human Factor IX pub- 
lished by other investigators (2, 3, 6, 8), by normalizing 
molecular weights to 50,000, our analyses were found 
to be in good agreement with some reports in terms 
of most of the individual amino acid (3, 8) as well as 
the total residues (8). This type of analysis, however, 
obviously does not exclude a single amino acid sub- 
stitution in Factor IXch- Furthermore, the carbohydrate 
content of the normal and abnormal molecules is similar 
and is in agreement with that reported by others for 
the normal molecule (3, 6, 8). Moreover, Factor IXch 
contains the same number of y-carboxyglutamic acid 
residues as normal Factor IX. Thus, there is no evi- 
dence that the abnormality in Factor IXch is the re- 
sult of defective y-carboxylation of the molecule dur- 
ing postribosomal modification in the hepatic parenchy- 
mal cells. 

Of particular interest are the studies on the activa- 
tion of Factor IXch* Activation of this abnormal Factor 
IX by Factor XIa and Ca*"*^, when followed by both 
coagulation assays and SDS gel electrophoreses, strongly 
suggests that the major abnormality in Factor IX< h is 
defective activation. These findings are in agreement 
with those reported by Aronson (23), except that activa- 
tion of the purified protein is even slower than in a 
plasma system. Thus, it is possible that the major defect 
of Factor IXch is delayed activation under physiological 
conditions. Factor IXch may be the result of a single 
amino acid substitution governed by gene(s) on the X 
chromosome. Such a substitution could retard the cleav- 
age of bonds that result in the two-chain Factor IXa 
molecule. Prolonged incubation of Factor IX< h with 
Factor XIa and Ca*"*" for up to 20 h does result in the 
formation of a lower molecular weight component, but 
this process is very slow and most of the Factor IX< h 
remains in the zymogen form. Activated Factor IX<*h 
has not yet been isolated in amounts sufficient to com- 
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pare specific activity of Factor IXacn with that of normal 
Factor IXa. Fujikawa et al. (10) and Lindquist et al. 
(48) have shown, with purified bovine Factor IX, that 
an Arg-Ala bond is selectively cleaved by Factor XIa 
resulting in an inactive intenmediate; a second proteo- 
lytic step cleaves an Arg-Val bond, releases a carbo- 
hydrate-rich activation peptide, and results in activated 
Factor IX. Whether the amino acid substitution in Fac- 
tor IXcH is close to sites of cleavage by Factor XIa 
and Ca*"*", or whether the substitution is remote and 
exerts its effect through tertiary conformational changes 
that hinder the proteolytic action of Factor XIa is not 
yet elucidated. 

While it seems clear that Factor IXch is not activated 
normally, it is difficult to determine whether this is 
the only defect. If Factor IXch can be activated by sub- 
stances such as Russell's viper venom or kallikrein, 
it would be possible to study the interaction of Factor 
IXacH with calcium, lipid, and Factors VIII and X. These 
studies are currently in progress in our laboratory. 

It could be postulated that the patient with Factor 
IXcH is synthesizing both a normal protein (that is con- 
tributing the 5% coagulant activity) and a nonfunction- 
ing protein that retains antigenic activity but not clot- 
ting activity. This explanation seems unlikely on a 
genetic basis because hemophilia B is inherited as an 
X-linked recessive characteristic and it would be un- 
likely to have a gene coding for normal Factor IX and 
another for abnormal Factor IX. Moreover, in the activa- 
tion experiment with clotting assays, one would expect 
5% normal Factor IX to activate normally which was 
not the case. The Factor IX protein purified from the 
affected patient is probably composed entirely of a 
Factor IX protein that does not function normally in 
coagulation reactions. 

Other abnormal Factor IX molecules almost certainly 
exist as initially postulated by Fantl et al. (49), Roberts 
et al. (12), and Hougie and Twomey (50). The latter 
authors described a hemophilia Bm kindred which was 
shown to be CRM+ by Brown et al. (16) and which 
has recendy been purified and characterized by Osterud 
et al. (51). Factor IXbm differs from Factor IXch in that 
the former is associated with a prolonged ox-brain 
prothrombin time and can be activated by Factor XIa 
in the presence of Ca*"^ (51). Furthermore, it should 
be noted that Factor IXch is different from the Factor 
IX abnormality in hemophilia B Leyden (52). The Fac- 
tor IX level in hemophilia B Leyden increases as the 
patient gets older. This is not the case with the pa- 
tient with Factor IXch who has been followed since 
childhood. Factor IXch is also different from CRM"* 
and CRM" kindred previously reported by us (53). 
Furthermore, Factor VII levels are normal in the pa- 
tient with Factor IXch who is thus different from the 
Factor IX variants reported by Kasper et al. (14) and 
Girolami et al. (15). 

Studies of abnormal Factor IX molecules should help 



in resolving some of the structure function relation- 
ships of normal Factor IX because the putative amino 
acid substitutions in the abnormal forms may affect 
different functions of the Factor IX molecule. For ex- 
ample, some abnormal Factor IX proteins may exhibit 
delayed activation by Factor XIa and calcium, while 
others may exhibit defective interaction with other 
blood clotting factors. The molecular basis of the inter- 
action of Factor IX with other blood clotting factors, 
calcium, and phospholipid should provide better in- 
sight into the pathophysiology of the clinical disease 
state, hemophilia B. 
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Human immunodeficieiicy virus type 1 (HIV-1) entry is governed by the interaction of the viral envelope 
glycoprotein (Env) with its receptor. The HIV-1 receptor is composed of two molecules, the CD4 binding 
receptor and a coreceptor. The seven-membrane-spanning chemokine receptor CCR-5 is one of the coreceptors 
used by primary isolates of HTV-l. We demonstrate that the mouse homolog of CCR-5 (mCCR-5) does not 
function as an HIV-1 coreceptor. A set of chimeras of human CCR-5 and mCCR-5 was studied for £nv-induced 
ceil fusion and HIV-1 infection. Using the HTV-l^^ envelope glycoprotein in a syncytium formation assay, we 
show that replacement of any fragment containing extracellular domains of mCCR-5 by its human counter- 
parts is sufficient to allow £nv-induced fusion. Conversely, replacement of any fragment containing human 
extracellular domains by its murine counterpart did not lead to coreceptor function loss. These results show 
that several domains of CCR-5 participate in coreceptor function. In addition, using a panel of primary 
nonsyncytium-inducing and syncytium-inducing isolates that use CCR-5 or both CXCR-4 and CCR-5 as 
coreceptors, we show that the latter dual-tropic isolates are less tolerant to changes in CCR-5 than strains with 
a more restricted coreceptor use. Thus, different strains are likely to have different ways of interacting with the 
CCR-5 coreceptor. 



Human immunodeficiency virus type 1 (HIV-1) enters target 
cells by fusion of its envelope with the membrane of the cell. 
This process is initiated by the high-affinity binding of the 
envelope glycoproteins (Env and gpl20) to the CD4 molecule. 
Subsequent to the Env-CD4 interaction, a number of confor- 
mational changes in gpl20 and probably in CD4 take place, 
leading to virus entry or syncytium formation (reviewed in 
references 58 and 83), These post-CD4 binding events are not 
completed in most nonhuman cell lines and in some human 
cell lines expressing human CD4, and membrane fusion fails to 
occur (5, 20, 23, 53), It was suggested that these cells lack a 
component necessary for the fusion and entry processes to be 
completed (13, 35, 41). Many of these experiments were done 
with syncytium-inducing (SI) T-cell-line-adapted (TOLA) 
strains of HIV-1, but later evidence suggested that different 
molecules may be required for the entry of primaiy non-SI 
(NSI) isolates (2, 12). The second components of the receptor 
complex, or coreceptors, have been identified (3, 22, 32, 34, 36, 
38) and belong to the seven-transmembrane G protein-cou- 
pled chemokine receptor family (reviewed in reference 69). 

As predicted, the coreceptors for TOLA isolates and pri- 
mary isolates are different: the CXCR-4 chemokine receptor 
(previously known as LCRl, LESTR, HUMSTR, or fusin) 
functions as an HIV-1 coreceptor for TCLA as well as for 
primary SI or T-cell-tropic strains, whereas all primary NSI or 
macrophage-tropic (M-tropic) strains tested so far use the 
CCR-5 chemokine receptor (3, 22, 32, 34, 36. 38, 78). Other 
members of the chemokine receptor family, including CCR-2b 
and CCR-3, can also be used by a restricted number of HIV-1 
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isolates (22, 34, 42, 71). This extension of the use of the core- 
ceptor by primary isolates has been shown to correlate with 
disease progression (30). However, CXCR-4 and CCR-5 seem 
to be the main coreceptors used by HIV-1 in vivo, irrespective 
of the viral genetic subtype (89). The importance of CCR-5 as 
an HIV-1 coreceptor in vivo has been confirmed with the 
identification of a CC/?-5-defective allele in some multiply 
exposed uninfected individuals (31, 43, 51, 73). 

Chemokines are small proteins involved in cellular recruit- 
ment and activation. They show some sequence homology and 
can be divided into four subgroups according to the spacing of 
their amino-terminal cysteine residues. The two main groups 
are CXC (or a) chemokines and CC (or p) chemokines (re- 
viewed in references 69 and 84). CXCR-4 binds the CXC 
chemokine stromal cell-derived factor 1 (9, 60), and CCR-5 
binds RANTES, macrophage inflammatory protein la (MIP- 
la), and MIP-ip (29, 70, 72), which are CC chemokines. These 
chemokine ligands are able to block HIV-1 entry and fusion (9, 
26, 27, 60, 63, 65). In addition, receptor antagonists based on 
these chemokines can also block HIV-1 entry (4, 76). The exact 
mechanism of this inhibition is not yet known but most likely 
involves steric blockade of the coreceptor by the ligand. Alter- 
natively, chemokine-mediated inhibition of HIV entry may be 
due to down-regulation of the receptor on the surfaces of 
permissive cells. 

In fact, it has recently been suggested that a ternary complex 
consisting of gpl20, CD4, and the coreceptor forms on the 
surfaces of permissive cells (50, 81, 88), The molecular details 
of the interaction between gpl20, CD4, and the coreceptor are 
still not clear. It has been demonstrated that the third hyper- 
variable region (V3 loop) of the gpl20 envelope glycoprotein is 
likely to be a component of the coreceptor binding site (81, 88) 
and to influence coreceptor choice (22, 33, 67), but the inter- 
action between gpl20 and the coreceptor molecule has not yet 
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been completely characterized. Using molecular chimeras 
composed of two members of the CC chemokine receptor 
family, CCR-5 and CCR-2b (6, 71), or of mouse and human 
CCR-5 (mCCR-5 and hCCR-5, respectively) (6, 8), three 
groups of researchers have recently shown that the amino- 
terminal and possibly the first and second extracellular loops of 
CCR-5 are important for coreceptor function and specificity. 
Two of these studies, however, assessed coreceptor use by 
cell-cell fusion only (8, 71), while the third (6) used cell-free 
virus infection, but with a limited number of isolates only. 
Although cell-cell fusion usually correlates well with cell-free 
virus infection, some exceptions to this rule have been reported 
(55, 77). 

To map the regions important for CCR-5 to function as an 
HIV-1 coreceptor for cell-cell fusion and virion-cell fusion, we 
took advantage of the existence of the mouse homolog of 
CCR-5 (mCCR-5) (.10, 57). We confirm here that this molecule 
does not function as a coreceptor for HIV-1. By generating 
molecular chimeras composed of hCCR-5 and mCCR-5, we 
have further defined regions in the coreceptor that are impor- 
tant for HIV-1 entry and fusion with Env-expressing cells. 
Several domains of CCR-5 are shown to be involved in HIV-1 
entry. Moreover, different HIV-1 isolates showed various pat- 
terns of interaction, implying a differential use of the CCR-5 
coreceptor for NSI M-tropic versus SI dual-tropic strains. 

MATERIALS AND METHODS 

Cells and viruses. CCC-CD4 cells are cat kidney cells stably transfected with 
a human CD4 expression vector (24). HeLa-CD4-LTR/flfcZ (HeLa-P4) cells are 
HeLa cervical epithelial cells stably transfected with human CD4 and a long 
terminal repeat (LTR)'/acZ construct (25). Upon infection by HIV or fusion 
with Tat-expressing cells, transcription of the lacZ gene directed by the LTR is 
stimulated and leads to the accumulation of ^-galactosidase in the cell. HeLa- 
Env cells are HeLa cells stably transfected with a dkgag-pol provirus containing a 
dhfr gene in place of nef ^ which confers resistance to methotrexate. HeLa-Env^^ 
(74) and HeLa-Env^i^A (a gift from A. Brelot, Institut Cochin de G6n6tique 
Moleculaire, Paris, France) (68) respectively express the HIV-1 LAI and ADA 
envelope glycoproteins. All these HeLa-Env cells also constitutively express the 
HIV-1 transactivator Tat All the adherent cell lines were maintained in Dul- 
becco modified Eagle medium supplemented with 10% fetal calf scrum (FCS) 
(or 5% FCS for CCC-CD4), antibiotics, and 0.5 rag of G418 (CCC-CD4 and 
HeLa-P4) per ml or 5 p.M methotrexate (HeLa-Env). 

All the HlV-1 isolates used in this study have been described previously. LAI 
(82) is a TOLA strain, and GUN-1 (80) is also TOLA but dual tropic, as it can 
also infect macrophages (56, 77). 89.6, a gift from R. Collman (University of 
PennsyNania, Philadelphia), is a primary HIV-1 strain that infects macrophages 
and certain CD4'^ T-cell lines and is therefore also dual tropic (28). ADA (86), 
SF-162 (19), M23, and E80 are primary NSI M-tropic strains of HIV-1 (78). All 
isolates are of subtype B except E80 and M23, which arc unclassified. Viral stocks 
from primary isolates and TCLA strains were produced in peripheral blood 
mononuclear cells (PBMCs) and H9 cells, respectively, as described previously 
(78). ADA viral stocks were produced by transfection of the molecular clone in 
HeLa cells (1). The ADA-producing plasmid, a gift from M. Alizon (Institut 
Cochin dc G^n6tique Moldculaire, Paris, France), is a recombinant between the 
90. 1 molecular clone of HIV-Ilai and the Kpnl-Aval fragment encompassing the 
ADA envelope protein (68). 

hCCR-5 and mCCR-5 chimera construction. The hCCR-5 (70) (a gift from P. 
Gray, ICOS Corp., Bothell, Wash.), mCCR-5 (57), and CXCR-4 cDNAs were 
subcloned by PCR into the pcDNA3 expression vector (Invitrogen, Leek, The 
Netherlands) between the //mdlll and Notl sites. The sequences, determined by 
automated sequencing (Applied Biosystems) with a Perkin-Elmer sequencing kit 
and analyzed with the Genetics Computer Group package, were found to differ 
at a few positions from the published sequences (Fig. 1). The differences resulted 
in conservative amino acid (aa) changes: F80L in mCCR-S and F166L in 
hCCR-5. The cDNAs encoding the receptors were fused at the carboxy-tcrminus 
to the 12-aa-long c-myc epitope tag: NH,-GG REOKLISEEDLA -COOH (37). 
We decided to tag all the constructs at the end of the C-terminal intracellular 
domain to avoid interference with i>otenttal interactions between CD4 and/or 
gpl20 and extracellular segments of the coreceptor. To generate hCCR-5 and 
mCCR-5 chimeras, a unique EcoRW restriction site was engineered by site- 
directed mutagenesis (Transformer kit; Clontech, Palo Alto, Calif.) in the 
hCCR-S sequence at the same position as in the mCCR-5 sequence (Fig. 1). 
Using the same technique, we also introduced a unique Kpnl restriction site in 
the mCCR-5 sequence at the same position as in the hCCR-5 sequence (Fig. 1). 
Introduction of these new common sites did not introduce changes in the amino 
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FIG. 1. Amino acid sequence alignment of hCCR-5 and mCCR-5. The two 
sequences were aligned with the gap program. The sequences display 91% 
similarity and 83% identity. Potential transmembrane segments (listed under 
SwissProt accession no. F51682) are shaded and numbered with Roman numer* 
als. Extracellular loops arc also numbered. Restriction endonuclcase sites are 
labeled and overlined. Only changes at the amino acid level are shown, while 
identical amino acids are indicated by dashes. Asterisks identify differences from 
the published sequences. N-TERM., N terminus. 



acid sequences of the proteins. Tlic different chimeric constructs were then 
generated by exchanging fragments by using the four common unique restriction 
sites, //i/idlll, EcoRV, Kpnl, and Wo/I. The WmdlU-EcoRV fragment spans the 
N terminus, transmembrane 1 (TMl), and intracellular loop 1 (II) of CCR-5; the 
EcoRV-Kpnl fragment spans TM2, extracellular loop 1 (El), and TM3; and the 
Kpnl-Notl fragment spans 12. TM4, E2, TM5. 13, TM6, E3, TM7, and the C 
terminus. The. structures of the hCCR-5-mCCR-5 chimeras are schematically 
depicted in Fig. 2. 

CCR-5 cell surface expression. The constructs encoding the different CCR-5 
chimeras were transfected into subconfiucnt HeLa-P4 or CCC-CD4 cells (5 p-g 
per 50-mm-diameter petri dish) by the calcium phosphate precipitation method. 
After 16 to 20 h of incubation, the cells were washed in phosphate-buffered saline 
(PBS), fed, and incubated for another 48 h. The cells were then fixed in meth- 
anol-acctone (1:1) for 5 min at room temperature and subsequently washed in 
PBS-1% FCS. The cells were then incubated with purified anti-m>'c (9E10) 
monoclonal antibody (1:100 in PBS-1% FCS) for I h at room temperature. After 
four washes in PBS-1% FCS, fluorescein isothiocyanaie-conjugatcd goat anti- 
mouse immunoglobulin Gl antibody (Southern Technologies Associates Inc., 
Birmingham, Ala.) was added for 1 h at room temperature (1:100 in PBS-1% 
FCS). After two washes in PBS-1% FCS and two washes in PBS, the cells were 
mounted in Mowiol (Calbiochem) and observed on a Zeiss ftuoresccncc micro- 
scope. Surface expression was estimated by randomly counting 10 fields. Cells 
were scored as positive if the signal was associated with the cell plasma mem- 
brane. 
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FIG. 2. Schematic representation of hCCR-5-mCCR-5 chimeras. Segments 
of human origin are filled, while segments of murine origin arc open. Trans- 
membrane domains are shaded, and inverted triangles indicate junction sites. 
The relative level of surface membrane-associated expression of each chimeric 
receptor, as estimated by immunofluorescence 2 to 3 days after transfection of 
HeLa-P4 or CCC-CD4 cells (see Materials and Methods), is indicated on the 
ri^t-hand side and expressed as the percentage of hCCR-5 expression as fol- 
lows: + , 20 to 50%; ++, 50 to 70%; and ++ +, 70 to 100%. These estimations 
are representative of three independent experiments. 



Fusion assay. HeLa>P4 cells were plated in 12-well (or 24-wel]) trays and 
transfcctcd by the calcium phosphate precipitation method the next day with the 
different CCR-5 constructs with 1 (or 0.5) fxg of DNA per well. After 16 to 20 h, 
the cells were washed in PBS, split into four replicates in 96-wcll plates, and 
cocultivated with HeLa-EnvAr>yi^ or HcLa-Env^^ (10* cellsAvell) for 24 h. An in 
situ X-Gal (5-bromo-4-chloro-3-indoIyl-^-D-galactopyranoside) assay was then 
done in duplicate. In brief, cells were washed in PBS, fixed in PBS-0.5% glutar- 
aldchyde for 5 min, washed in PBS, and incubated with the X-Gal substrate (0.5 
mg of X-Gal per ml in PBS containing 3 mM ferricyanide, 3 mM ferrocyanide, 
and 2 mM MgCl2; Calbiochcm) for 2 h at 37''C. The two other wells were assayed 
for ^-galactosidase production by a colorimetric assay (59). After being washed 
in PBS, the cells were lysed in 100 p.! of PBS-O.S% Nonidct P-40 for 5 min at 
room temperature. Fifty microliters of that lysate was then mixed with 50 m,I of 
2X CPRG substrate (16 mM chlorophcnol rcd-^-D-galactopyranoside [Boehr- 
ingcr Mannheim], 0.12 M Na2HP04 • THjO, 0.08 M NaHzPOa • H2O, 0.02 M 
KCl, 0.02 M MgS04 • 7H2O, 0.01 M p-mercaptoethanol) in a 96-well plate and 
incubated for 2 to 3 h in the dark at room temperature. These conditions were 
checked for linearity of the en^matic colorimetric reaction. The absorbance at 
540 nm was then read on a Dynatech microplatc reader. 

Infectlvity assay. Subconfluent CCC-CD4 cells in 100 or 50-mm -diameter petri 
dishes were transfcctcd with 10 or 5 ji.g of the different CCR-5 constructs by 
using Lipofectamine (Gibco BRL, Life Technologies) or calcium phosphate 
precipitation for 5 or 16 to 20 h, respectively. The cells were then split into 
48-well plates at 2 X 10* to 4 x 10^ cells/well. They were challenged the next day 
with HIV-1 (~10^ to 10** focus forming units per well) for 2 to 5 h, refed, 
immunostained 3 days postinfection with an anti-p24 antibody as the primary 
antibody, and subsequently incubated with a secondary antibody conjugated to 
p-galactosidase (Genosys Biotechnologies, Inc.) as described previously (24). 
Tlic blue-stained syncytia were then scored after an in situ X-Gal assay. 



RESULTS 

The murine homolog of CCR-5 is inactive as a coreceptor 
for HIV-1. The murine homolog of CCR-5 (mCCR-5) was 
recently cloned and has 83% identity and 91% similarity at the 
amino acid level to hCCR-5 (57). The differences are not 
located in a particular region of the molecule but are distrib- 
uted throughout the sequence (Fig. 1). The murine protein is 
a structural and functional homolog of hCCR-5, since it acts as 
a receptor for mouse MlP-la, mouse MlP-ip, and mouse 
RANTES (10, 57). Because of its closely related sequence and 
chemokine binding profile, we decided to test its functionality 
as an HIV-1 coreceptor. 

hCCR-5 and mCCR-5 were tagged at their C termini with 
the c-myc epitope (9E10), cloned into a pcDNA3 expression 
vector, and transfcctcd into HeLa-P4 cells. This CDA^ HeLa 
cell line is stably transfected with a lacZ gene under the control 
of the HIV-1 LTR and is therefore inducible by the HIV-1 
transactivator Tat, Because of their human origin and the 
endogenous expression of the CXCR-4 receptor (38), 
HeLa-P4 cells are permissive for entry and fusion of TCLA 
and primary SI HIV-1 strains but not for primary NSI M-tropic 
strains. After transfection of hCCR-5 or mCCR-5 into 
HeLa-P4 cells, the cells were cocultivated with HeLa-EnvLAj 
or HeLa-Env^y,^ cells. As expected, HeLa-Env^Ai were able 
to form syncytia with untransfected cells as well as with 
hCCR-5- and mCCR-5-transfected cells (not shown). HeLa- 
Env^y^y however, could form syncytia only with HeLa-P4 cells 
transfected with hCCR-5, as mCCR-5 did not serve as a core- 
ceptor for HIV-1 Env-induced cell fusion (Fig. 3 and 4). 
The surface expression level of the mCCR-5 protein in 
HeLa-P4 cells was fivefold less that of hCCR-5 (Fig. 2). How- 
ever, overexpression of mCCR-5 in CD4'^ CCC cat kidney 
cells or 3T3 mouse fibroblasts by vaccinia virus/T7 RNA poly- 
merase infection did not result in syncytium formation upon 
cocultivation with HeLa-Env^DA while hCCR-5 did (data not 
shown). Thus, despite its similarity with hCCR-5, mCCR-5 is 
not functional as an HIV-1 coreceptor. We therefore used this 
property to map the regions of CCR-5 important for HIV-1 
entry and fusion by using chimeric CCR-5 receptors. 

Mapping of CCR-S extracellular domains involved in HTV- 
^ADA envelope-induced fusion. To map the extracellular do- 
mains involved in coreceptor function for HIV-1 NSI M-tropic 
strains, we chose to use the ADA isolate as an Env prototype. 
Since hCCR-5 and mCCR-5 are conserved at the amino acid 
level, it was possible to construct functional molecular chime- 
ras between the two receptors (described in Materials and 
Methods). The nomenclature that we used for the different 
regions of chemokine receptors is as follows: El, E2, and E3 
for extracellular loops 1 to 3, amino-terminal extracellular do- 
main, II, 12, and 13 for intracellular loops 1 to 3, carboxy- 
terminal intracellular domain, and TMl to TM7 for the seven- 
membrane-spanning segments. We reasoned that to be used as 
an HIV-1 coreceptor, the important functional domains of the 
molecule were likely to be the four extracellular domains (N 
terminus, El, E2, and E3). However, we should bear in mind 
that the transmembrane and intracellular domains can play a 
role in the overall conformation of the receptor, in the inter- 
action with gpl20 and/or CD4, or in the fusion step per se. 
There are seven amino acid differences (and a 2-aa insertion in 
mCCR-5) between hCCR-5 and mCCR-5 amino-terminal ex- 
tracellular domains (30 to 32 aa long), five differences in El (13 
aa long), seven differences in E2 (32 aa long), and no differ- 
ences in E3 (17 aa long). We generated six chimeras by ex- 
changing fragments encompassing each of the first three ex- 
tracellular domains. The resulting constructs are depicted in 
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FIG. 3. Formation of syncytia between HlV-l^^a^^ envelope-expressing cells and HeLa-P4 cells expressing diflFerent hCCR-5-mCCR-5 chimeras. HeLa-P4 cells 
transiently transfected with an empty vector (A), hCCR-5 (B), mCCR-5 (C), or the different chimeras HMMM (D), MHMM (E), and HHMM (F) were cocultivated 
for 24 h with HeLa-Env^DA cells, iixed, and stained with the X-Gal substrate as described in Materials and Methods. Photographs were taken at a x 160 magnification. 
Results to similar those seen in panels B and F were obtained for the MHHH, HMHH, and MMHH chimeric receptors. 



Fig. 2. Each of the four extracellular domains is represented by 
the letter H or M depending on the human or murine origin of 
the extracellular domains in the chimera, and the constructs 
are named accordingly. 

Levels of cell surface expression of the six chimeras and the 
two parental clones were estimated by transfecting the C-ter- 
minus-tagged constructs into HeLa-P4 or CCC-CD4 cells and 
staining them with the 9E10 (c-myc epitope) monoclonal an- 
tibody by indirect immunofluorescence after pemieabilization 
of the cells. Cells were scored as positive if the 9E10 staining 
was associated with the cell plasma membrane. As previously 
mentioned, mCCR-5 was expressed at a lower level than its 
human counterpart. All other hCCR-5-mCCR-5 constructs 
were expressed at levels similar to that of the hCCR-5 parental 
clone except for the HMMM chimera, which showed about 
50% of the level of hCCR-5 expression (Fig. 2). 

These different chimeras were tested for coreceptor activity 
by transient transfection into HeLa-P4 cells and coculture with 
HeLa-Env^DA' Blue-stained syncytia were scored after an. in 
situ X-Gal assay, or p-galactosidase activity was measured by a 
CPRG colorimetric assay (Fig. 3 and 4). To map the domain(s) 
necessary and sufficient to confer coreceptor activity to 



mCCR-5, we first introduced segments of hCCR-5 containing 
each extracellular domain into nonfunctional mCCR-5. The 
HMMM chimera inefficiently mediated HIV-I^d^ Env-in- 
duced fusion (Fig. 3 and 4), indicating that a fragment encom- 
passing the N-terminal, TMl, and II domains was not sufficient 
to confer complete coreceptor activity or that the lower level of 
expression of this construct resulted in reduced coreceptor 
activity. However, when we introduced a fragment containing 
the TMl. El, and TM2 domains in mCCR-5 (MHMM), it was 
possible to observe syncytium formation with Env-expressing 
cells. This fusion activity was reproductively about 30% of the 
level of the hCCR-5 control when we assayed by the CPRG 
colorimetric assay (Fig. 4). This reduced activity was due to a 
reduction in the size of syncytia (Fig. 3). Tlius, the first extra- 
cellular loop when it is introduced into the mCCR-5 receptor 
is sufficient to mediate ADA Env-mediated fusion. To deter- 
mine the role of the second extracellular loop, we used the 
MMHH chimera, which is identical in extracellular sequence 
to the theoretical MMHM chimera. This chimera had core- 
ceptor activity similar to that of parental hCCR-5, indicating 
that E2 is also important in mediating HIV-1 Env interaction. 
We then placed two domains of hCCR-5 into the nonfunc- 
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FIG. 4. Fusion assay between HIV•ly|^;,y^ envelope-expressing cells and HeLa-F4 cells expressing different hCCR-5-mCCR-S chimeras. HeLa-P4 cells transiently 
transfcctcd with the different constructs were cocultivatcd for 24 h with HcLa-Envy^Q;^ cells in 96-wcli plates, lyscd, and tested for ^-galactos^dasc activity by a 
colorimctric CPRG assay as described in Materials and Methods. The histogram presented here is representative of three independent experiments done in duplicate. 
The results are expressed as the means of ^1540 values ± standard deviations. 



tional murine background. The chimera HHMM (for MMHH, 
see above) worked as well as the hCCR-5 parental molecule 
for ADA-induced fusion (Fig. 3 and 4) and better than the first 
two individual extracellular domains alone (see HMMM and 
MHMM). This construct confirms that the first two extracel- 
lular domains are important for coreceptor activity. Further- 
more, the functionality of these two complementary chimeras 
(HHMM and MMHH) for HIV-1 fusion (Fig. 3 and 4) and 
entry (Table 1; see below) indicates that none of the intracel- 
lular and transmembrane domains of mCCR-5 are detrimental 
for coreceptor activity when they are placed in a functional 
background. This analysis shows that a combination of any 
single extracellular domain of hCCR-5 placed in the nonfunc- 
tional mCCR-5 background is able to mediate HIV-l^j,^ Env- 
induced fusion. 

We then determined whether any domain of hCCR-5 was 
indispensable for the fusion process to occur. For that purpose. 



we sequentially replaced segments containing each extracellu- 
lar domain of hCCR-5 by an mCCR-5 segment. The resulting 
chimeras, MHHH, HMHH, and HHMM (for the last, see 
above), behaved like hCCR-5 in ADA Env-induced fusion 
(Fig. 4), demonstrating that none of hCCR-5 sequences were 
essential for coreceptor function. The ailtemative explanation 
is that mCCR-5 sequences placed in a functional h CCR-5 
background adopt a different conformation and can mediate 
some interaction with the fusion complex. We therefore con- 
clude that multiple domains of hCCR-5 are involved in HIV- 
Iada Env-induced fusion. It was not possible to identify a 
single domain necessary and sufficient for fusion and likely to 
contain a binding site for the HlV-1 envelope. 

Different HIV-1 isolates have distinct requirements for uti- 
lization of the CCR-5 coreceptor. Since different HIV-1 iso- 
lates can use the same coreceptor in distinct ways (66, 71), we 
extended our analysis to different primary SI and NSI isolates 



TABLE 1. Infectivity of and syncytium formation by primary and TCLA HIV-1 isolates in cells transfected with different CCR-5 derivatives" 



No. of bluc-stained syncytia per well* with transfcctcd coreceptor: 

Viral isolate — — — 

None' CXCR-4 hCCR-5 mCCR-5 HMMM MHMM HHMM MHHH HMHH MMHH 



NSI M-tropiC^ 






















ADA 


1 


6 


965 


0 


146 


62 


643 


985 


1,104 


976 


SF-162 


0 


0 


848 


0 


161 


302 


420 


904 


991 


867 


M23 


0 


0 


H8 


0 


0 


8 


25 


124 


101 


4 


E80 


0 


0 


92 


0 


0 


3 


30 


128 


123 


0 


SI dual-tropic 






















89.6 


0 


662 


158 


0 


5 


1 


1 


6 


21 


2 


GUN-r 


0 


426 


122 


1 


0 


0 


14 


0 


122 


1 



^ Parental and chimeric receptors were transfcctcd into CCC-CDA cells, infected, and immunostaincd 3 days postinfection as described in Materials and Methods. 
Results of one of three independent experiments giving essentially the same results are presented. 
" In 4S-WCII plates. 

Transfection was with the empty pcDNAB vector. 
''si status and tropism were previously determined (78). 
' TCLA strain. 
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that use CCR-5 as a coreceptor and tested them in ceil-free 
infection. 

The panel of primary isolates used here have been previ- 
ously described for their tropism, SI status, and coreceptor 
usage (78). The different hCCR-5-mCCR-5 chimeras were 
transiently transfected into nonpermissive feline CCC-CD4 
cells before being challenged with cell-free viral supernatant. 
The cells were immunostained for intracellular p24 production 
3 days postinfection. The results are summarized in Table 1, 
The ADA and SF-162 isolates were able to infect CCC-CD4 
cells expressing each of the hCCR-5-mCCR-5 chimeric recep- 
tors tested, although at lower levels for HMMM and MHMM. 
These results parallel those obtained in the syncytium forma- 
tion assay. Two other primary NSI M-tropic strains (M23 and 
E80) able to use the MHHH, HHMM, and HMHH chimeras 
were, however, unable to use HMMM, MHMM, and MMHH 
for efficient entry into transiently transfected CCC-CD4 cells. 

Surprisingly, infection by the SI dual-tropic strains 89.6 and 
GUN-1 was more severely affected by modification of hCCR-5. 
Infection by 89.6 was efficient only in cells expressing hCCR-5 
and CXCR-4, while GUN-1 was able to use only the HMHH 
chimeric receptor. These results were unexpected, since these 
isolates are able to interact with at least two divergently related 
chemokine receptors (CXCR-4 and CCR-5). The reason for 
this dependence on CCR-5 structure and sequence integrity is 
not yet clear but may relate to a particular envelope confor- 
mation and/or sequence of these dual-tropic isolates. The 
TCLA strain LAI is able to use only CXCR-4 as a coreceptor 
and did not use any of the chimeric CCR-5 receptors (data not 
shown). Since the ADA and SF-162 envelope glycoproteins 
were able to use all of the chimeras transfected into CCC-CD4, 
we assume that each receptor is expressed, folded, and trans- 
ported correctly in this cell type as well and that the results 
observed are not cell type specific. However, variation in levels 
of surface expression of chimeras (particularly for HMMM) 
may account for some of the lower efficiency of entry observed 
for some isolates. 

Our results show that coreceptor utilization is probably con- 
formationally complex, involving multiple extracellular do- 
mains of CCR-5. Furthermore, different isolates are more sen- 
sitive than others to change in the sequence and/or 
conformation of the coreceptor. It is noteworthy that these 
requirements correlate with tropism, with SI dual-tropic iso- 
lates being more dependent on hCCR-5 integrity than NSI 
M-tropic isolates. 

DISCUSSION 

In this study, we were interested in trying to determine the 
extracellular domains of CCR-5 important for HIV-1 entry and 
fusion. For this purpose we used the murine homolog of 
CCR-5 that has recently been cloned (10, 57) and shown to be 
nonfunctional as a coreceptor for HIV-1 (6, 8). This molecule 
has 83% identity at the amino acid level with its human coun- 
terparts (Fig. 1) but is still not able to be used as a coreceptor 
by a range of primary M-tropic and dual-tropic HIV-1 isolates 
(Fig. 3 and 4 and Table 1). Despite their high level of similarity 
(91%), hCCR-5 and mCCR-5 have different activities in HIV-1 
coreceptor function. A rhesus macaque homolog of CCR-5, 
which differs from hCCR-5 at eight amino acid positions, is 
able to function as an HIV-1 coreceptor (16, 54). Thus, the 
coreceptor activities of these nonhuman CCR-5 homologs in 
vitro correlates with their activities in vivo: CD4'*" mouse lym- 
phocytes and CD4 transgenic mice are not infected by HIV-1, 
whereas macaque PBMCs and macrophages are infected in 
vitro and in vivo by a simian-human immunodeficiency virus 
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recombinant that expresses the HlV-lgp.igj M-tropic envelope 
glycoproteins (52). 

Some of the changes in mCCR-5 are located in extracellular 
domains, suggesting that these differences may influence its 
function as an HIV-1 coreceptor. Subtle changes can have a 
dramatic effect on ligand interaction as exemplified by the 
chemokine binding profile of mCCR-5, which is unable to bind 
efficiently human MlP-la but binds human MIP-lp and hu- 
man RANTES at levels of affinity similar to those of the cor- 
responding murine chemokines (57). Using six hCCR-5 and 
mCCR-5 chimeric receptors and testing them for HIV-ly^^A 
Env-induced cell fusion and cell-free infection with six differ- 
ent isolates, we showed that (i) several if not all extracellular 
regions of CCR-5 are implicated in the interaction with HIV-1 
and (ii) there are different CCR-5 requirements for different 
isolates. 

Using the ADA envelope glycoproteins in a syncytium for- 
mation assay, we showed that any combination of hCCR-5 
segments placed in nonfunctional mCCR-5 was able to restore 
coreceptor function. Partial fusion was observed when we in- 
troduced a fragment spanning the human amino-terminal or 
El domain into mCCR-5. Unfortunately, reduced surface ex- 
pression of HMMM may explain inefficient fusion and entry 
for this construct. Replacing fragments containing each extra- 
cellular domain of hCCR-5 with fragments containing the cor- 
responding murine domain showed that none of the human 
domains was indispensable for the function. No single extra- 
cellular domain proved to be necessary or sufficient for CCR-5 
to be used as an HIV-1 coreceptor. It was, however, not pos- 
sible by this strategy to test the role of the E3 region, since this 
loop is identical between hCCR-5 and mCCR-5. Moreover, no 
important roles were noted for transmembrane and intracel- 
lular regions in HIV-1 coreceptor activity with the chimeras 
used here. The implication from these data is that the inter- 
action between Env and CCR-5 involves more than one region 
of the coreceptor. Clearly, mCCR-5 is incapable of making 
some key interactions resulting in lack of fusion activity, but 
this deficiency can be repaired in several ways. 

Using a panel of primary NSI M-tropic and SI dual-tropic 
isolates, we found that the requirements were not the same 
between these phenotypically different strains. Cell-free virus 
infection by the ADA and SF-162 isolates gave similar results 
in the syncytium formation assay. However, two other primary 
NSI M-tropic strains (M23 and E80) were able to fuse only 
with some of the chimeric coreceptors. For these two isolates, 
we were unable to challenge transfected cells with as high an 
input of virus as that used for ADA and SF-162. Thus, the 
lower levels of or lack of infection by these two viruses noted 
for some of the constructs may partly be due to the low viral 
input. In the case of MMHH, however, it is clear that while full 
infectivity by ADA and SF-162 was observed, M23 and E80 
plating was at background levels. Further studies will be 
needed to assess the extent of variation in the use of CCR-5 
between distinct NSI M-tropic strains. Bieniasz et al. recently 
showed a differential usage of CCR-5 between the two NSI 
M-tropic isolates ADA and Ba-L in cell-cell fusion that corre- 
lated to the V3 loop sequence (8). 

In contrast, the SI dual-tropic strains generally could use few 
of the chimeric receptors. The strain specificity reported here 
was not totally unexpected, as it has been shown that different 
HIV-1 strains have different requirements for interaction with 
the N-terminal extracellular domain of CXCR-4 (66) or 
CCR-5 (71). A recent study using CCR-5'CCR-2b chimeras 
also showed that HIV-1 must interact with multiple domains, 
the N-terminal and El domains being the most important (71). 
Using the 89.6 dual-tropic isolates, Rucker et al. showed that 
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this isolate has requirements different from those of JR-FL for 
CCR-5 use. However, the interpretation of the results was 
complicated by the fact that the partner used for the chimeras 
was CCR-2b, which is also used as a coreceptor by 89.6. We 
showed here that the dual-tropic 89.6 and GUN-1 isolates were 
inefficient in using most of the hCCR-5-mCCR-5 chimeras for 
entry, despite the fact that the level of infection of CCR-5 - 
expressing cells was the same as for M-tropic isolates. 

The differences observed between NSI M-tropic and SI dual- 
tropic isolates for CCR-5 usage are surprising, since the latter 
isolates can use multiple coreceptors to gain entry into cells: 
89.6 and GUN-1 can use CCR-5 and CXCR-4 (33, 78), and in 
addition, 89.6 can use CCR-3 and CCR-2b (34). Thus, they are 
able to use different divergent members of the chemokine 
receptor family for entry into cells and yet subtle changes in 
CCR-5 abolish their entry. In contrast, M-tropic isolates 
proved less sensitive to changes in CCR-5 despite the fact that 
their coreceptor usage is more restricted. This apparent para- 
dox suggests that multiple ways of interacting with CCR-5 exist 
among HIV-1 strains, which may influence their ability to gain 
the use of other coreceptors. It is possible that evolution of a 
CCR-5 -specific HIV-1 envelope to exploit more than one co- 
receptor may lower its affinity for CCR-5, making it more 
sensitive to changes in the sequence. The changes that take 
place in NSI isolate envelope glycoprotein sequences in vivo 
during the NSI-to-SI phenotypic switch (14, 18, 39, 49) must 
reflect the ability of their envelope glycoproteins to interact 
with CXCR-4 (46). These changes may not be compatible with 
interaction with CCR-5 at the same sites or with the same 
affinity. Alternatively, the ability of gpl20 to interact with the 
coreceptor is necessary but not sufficient for the entry process 
to be completed. 

It has recently been shown that sequences inside the V3 loop 
known to influence tropism (17, 21, 44, 56, 61, 75, 80, 85, 87) 
and post-CD4 binding events (7, 40, 45, 64) also influence 
coreceptor usage (22, 33, 67), formation of the CD4-CCR-5- 
gpl20 ternary complex (81, 88), and sensitivity to p-chemo- 
kine-mediated inhibition of entry (27, 46). The V3 loop is 
therefore a candidate region for interaction with CCR-5 but is 
probably not the only Env region involved. In fact, it has been 
shown that a recombinant NSI M-tropic isolate containing a 
TCLA strain V3 loop was still able to infect PBMCs and 
macrophages. This recombinant probably still interacted with 
CCR-5 despite the TCLA strain-derived V3 loop sequence (15, 
62). Other variable loops like VI and V2 that influence tropism 
and the SI ability of HlV-1 (11, 47, 79) may also be invoked. 
Other factors, such as a possible CD4-CCR-5 interaction, 
must also be taken into account to explain the precise mech- 
anism of interaction of the HIV-1 envelope glycoproteins with 
the coreceptor. Results from a study by Wu et al. (88) showing 
that a soluble form of CD4 consisting of the two amino-termi- 
nal domains (D1D2 sCD4) can inhibit MlP-la and MIP-ip 
binding to CCR-5 in the absence of gpl20 support this possi- 
bility. Furthermore, differences in affinities for the binding 
receptor CD4 may also influence the way different HIV-1 
strains interact with the receptor complex (48). 

During the course of this work, two studies also reported by 
a similar approach that several CCR-5 extracellular domains 
were involved in HIV-1 entry (6, 8). In contrast to our results, 
Atchinson et al. (6) showed that an MMHH construct was only 
minimally active for Ba-L, while our equivalent construct fully 
supported entry of two NSI M-tropic isolates: ADA and SF- 
162. Bieniasz et al. using an envelope-mediated cell fusion 
assay reached conclusions similar to those of our study: they 
observed differential usage of hCCR-5 and mCCR-5 chimeras 
between distinct NSI M-tropic isolates (ADA and Ba-L) and 
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restrictive utilization of the chimeric receptors by the 89.6 
isolate. Our analysis further shows that this is also true in 
cell-free virus infection and that it is likely to be a general 
feature of dual-tropic strains to interact less efficiently or dif- 
ferently with CCR-5. 

In summary, we have shown that multiple domains of CCR-5 
are involved in HIV-1 entry and syncytium formation. Six dif- 
ferent CCR-5-using isolates were tested and shown to vary in 
coreceptor requirements for cell entry. These distinct corecep- 
tor needs correlated with tropism, with dual-tropic strains be- 
ing more sensitive to changes in their CCR-5 sequences. It will 
be interesting to see how these dual-tropic isolates interact 
with the other major coreceptor, CXCR-4. Other chimeras and 
mutants of CCR-5 and CXCR-4 need to be tested to precisely 
understand the HIV-1 entry mechanism. 

ACKNOWLEDGMENTS 

We thank Ron Collman (University of Pennsylvania, Philadelphia), 
Pat Gray (ICOS Corp.), and Anne Brelot and Marc Alizon (Institut 
Cochin de G6n6tique Mol^culaire) for providing reagents; Paul 
Kellam for help with sequencing; and Matthias Dittmar and Tim Wells 
for helpful discussions and critical readings of the manuscript. 

This work was supported by the Medical Research Council. L.P. was 
supported by an MRC-INSERM exchange fellowship. 

REFERENCES 

1. Adachi, A, H. E. Gendelmann, S. Koenig, T. Folks, R. Wiliey, A Rabson, 
and M. A Martin. 1986. Production of acquired immunodc6cicncy syn- 
drome-associated retrovirus tn human and non-human cells transfected with 
an infectious molecular clone. J. Virol. 59:284-291. 

2. Alkhatibt G., C. C. Broder, and E. A. Berger. 1996. Cell type-specific fusion 
cofactors determine human immunodeficiency virus type I tropism for T-ccU 
lines versus primary macrophages. J. Virol. 70:5487-5494. 

3. Alkhatib, G., C. Combadlere, C. C. Broder, Y. Feng, P. E. Kennedy, P. M. 
Murphy, and A E. Berger. 1996. CC CKR5: a RANTES, MlP-la, MlP-lp 
receptor as fusion cofactor for macrophage-tropic HIV-1. Science 272:1955- 
1958. 

4. Arenzana-Seisdedos, F., J.-L. Virelizier, D. Rousset, I. Clark-Lewis, P. 
Loetscher, B. Moser, and M. Baggiolini. 1996. HIV blocked by chemokine 
antagonist. Nature 383:400. 

5. Ashom, P. A, E. A. Berger, and B. Moss. 1990. Human immunodeficiency 
virus envelope glycoproicin/CD4-mediatcd fusion of nonprimatc cells with 
human cells. J. Virol. 64:2149-2156. 

6. Atchinson, R. E., J. Gosling, F. S. Monteclaro, C. Franci, L. DIgilio, I. F. 
Charo, and M. A. Goldsmith. 1996. Multiple extracellular elements of CCRS 
and HIV-1 entry: dissociation from response to chcmokincs. Science 274: 
1924-1926. 

7. Bergeron, L., N. Sullivan, and J. Sodroski. 1992. Target cell-specific deter- 
minants of membrane fusion within the human immunodeficiency virus type 
1 gpl20 third variable domain and gp41 amino terminus. J. Virol. 66:2389- 
2397. 

8. Bieniasz, P. D., R. A Fridell, I. Araraori, S. S. G. Ferguson, M. G. Caron, 
and B. R. Cullen. HIV-1 induced cell fusion is mediated by multiple regions 
within both the viral envelope and the CCR-5 co-receptor. EMBO J., in 
press. 

9. Bleul, C. C, M. Farzan, 11. Choe, C. Parolin, I. Clark-Lewis, J. Sodroski, and 
T. A. Springer. 1996. The lymphocyte chcmoattractant SDF-1 is a ligand for 
LESTR/fusin and blocks HlV-1 entry. Nature 382:829-833. 

10. Boring, U, J. Gosling, F. S. Monteclaro, A J. Lusis, C.-L. Tsou, and L F. 
Charo. 1996. Molecular cloning and functional expression of murine JE 
(monocyte chemoattractant protein 1) and murine macrophage inflamma- 
tory protein la receptors. J. Biol. Chcm. 271:7551-7558. 

11. Boyd, M. T., G. R. Simpson, A J. Cann, M. A Johnson, and R. A Weiss. 
1993. A sin^c amino acid substitution in the VI loop of human immunode- 
ficiency virus type 1 gpl20 alters cellular tropism. J. Virol. 67:3649-3652, 

12. Broder, C, and E. A Berger. 1995. Fusogenic selectivity of the envelope 
glycoprotein is a major determinant of human immunodeficiency virus type 
1 tropism for CD4"*" T-celi lines vs. primary macrophages. Proc. Natl. Acad. 
Sci. USA 92:9004-9008. 

13. Broder, C. C, D. S. DImitrov, R. Blumenthal, and E. A Berger. 1993. The 
block to HIV-1 envelope glycoprotein-mediatcd membrane fusion in animal 
cells expressing human CD4 can be overcome by a human ceil component(s). 
Virology 193:483-491. 

14. Callahan, L. 1994. HIV-1 virion-cell interactions: an electrostatic model of 
pathogenicity and syncytium formation. AIDS Res. Hum. Retroviruses 10: 
231-233. 



5010 PICARD ET AL. 



15. Carrillo, and L. Ratner. 1996. Human immunodeficiency viruij type I 
tropism for T- lymphoid cell lines: role of the V3 loop and C4 envelope 
determinants. J. Virol. 70:1301-1309. 

16. Chen, Z., P. Zhou, O. D. Ho, N. R. Landau, and P. A. Marx. 1997. Genetically 
divergent strains of simian immunodeficien(^ virus use CCR5 as a corecep- 
tor for entry. J. Virol. 71:2705-2714. 

17. Cheng-Mayer, C, M. Oulroga, J. W. Tung, D. Dina, and J. A. Levy. 1990. 
Virol determinants of human immunodeficiency virus type 1 T-cell or mac- 
rophage tropism, cytopathogcnicity, and CD4 antigen modulation. J. Virol. 
64:4390-4398. 

18. Cheng-Mayer, C, T. Shioda, and J. A. Levy. 1991. Host range, replicativc, 
and cytopathic properties of human immunodeficiency virus type 1 are de- 
termined by very few amino acid changes in tat and gpl20. J. Virol. 65:6931- 
6941. 

19. Cheng-Mayer, C, C. Weiss, D. Seto, and J* A. Levy. 1989. Isolates of human 
immunodeficiency virus type 1 from the brain may constitute a special group 
of the AIDS virus. Proc. Natl. Acad. Sci. USA 86:8575-8579. 

20. Chesebro, B., R. Buller, J. Portis, and K. Wehrly. 1990. Failure of human 
immunodeficiency virus entry and infection in CD4-positive human brain 
and skin cells. J. Virol. 64:215-221, 

21. Chesebro, B., IC Wehrly, J. Nishio, and S. Perryman. 1992. Macrophage- 
tropic human immunodeficiency virus isolates from different patients exhibit 
unusual V3 envelope sequence homogeneity in comparison with T-cell tropic 
isolates: definition of critical amino acids involved in cell tropism. J. Virol. 
66:6547-6554. 

22. Choe, H., M. Farzan, Y. Sun, N. Sullivan, B. Rollins, P. D. Ponath, L. Wu, 
C. R. Mackay, G. LaRosa, W. Newman, N. Gerard, C. Gerard, and J. 
Sodroskl. 1996. The beta-chemokine receptors CCR3 and CCR5 facilitate 
infection by primary HIV-1 isolates. Cell 85:1135-1148. 

23. Clapham, P. R., D. Blanc, and R. A. Weiss. 1991. Specific cell surface 
requirements for the infection of CD4-positive cells by human immunode- 
ficiency virus ^pes 1 and 2 and by simian immunodcficicnQr virus. Virolo^ 
181:703-715. 

24. Ctapham, P. R^ A. McKnight, and IL A. Weiss. 1992. Human immunodefi- 
ciency virus type 2 infection and fusion of CD4-ncgativc human cell lines: 
induction and enhancement by soluble CD4. J. Virol. 66:3531-3537. 

25. Clavel, F., and P. Charneau. 1994. Fusion from without directed human 
immunodeficiency virus particles. J. Virol. 68:1179-1185. 

26. Cocchi, F., A. L. DeVico, A. Garzlno-Demo, S. K. Arya, R. C. Gallo, and P. 
Lusso. 1995. Identification of RANTES, MlP-la, and MIP-ip as the major 
HIV-suppressive factors produced by CD8^ T cells. Science 270:1811-1815. 

27. Cocchi, F., A. L. DeVIco, A. Garzino-Denio, A. Cara, R. C. Gallo, and P. 
Lusso. 1996. The V3 domain of the gpl20 envelope glycoprotein is critical 
for chemokine-mcdiatcd blockade of infection. Nat. Med. 2:1244-1247. 

28. CoUman, R., J. W. Balliet, S. A. Gregory, JL Friedman, D. L. Kolson, N. 
Nathanson, and A. Srinivasav. 1992. An infectious molecular clone of an 
unusual macrophagc-tropic and highly cytopathic strain of human immuno- 
deficiency virus type 1. J. Virol. 66:7517-7521. 

29. Combadiere, C, S. K. Ahuja, H. Lee Tiffany, and P. M. Murphy. 1996. 
Cloning and functional expression of CC CK.R5, a human monocyte CC 
chemokine receptor selective for MlP-la, MIP-ip, and RANTES. J. Leu- 
kocyte Biol. 60:147-152. 

30. Connor, R. I., K. E. Sheridan, D. Ceradini, S. Choe, and N. R. Landau. 1997. 
Change in coreccptor use correlates with disease progression in HIV-1- 
infected individuals. J. Exp. Med. 185:621-628. 

31. Dean, M., M. Carrlngton, C^ Winkler, G. A. lluttley, M. W. Smith, R. 
Allikmets, J. J. Goedert, S. P. Buchbinder, E. Vittinghoff; E. Gomperts, S. 
Donfleld, D. Vlahov, R. Kaslow, A. Saah, C. Rinaldo, R. Detels, Hemophilia 
Growth and Development Study, Multicenter AIDS Cohort Study, -Multi- 
center Hemophilia Cohort Study, San Francisco City Cohort, ALIVE Study, 
and S. J. O'Brien. 1996. Genetic restriction of HIV-1 infection and progres- 
sion to AIDS by a deletion allele of the CKR5 structural gene. Science 
273:1856-1862. 

32. Deng, R* Uu, W. Elmeier, S. Choc, D. Unutmaz, M. Burkhart, P. Dl 
Marzio, S. Marmon, R. E. Sutton, C. M. Hill, C. B. Davis, S. C. Peiper, T. J. 
Schall, D. R. Littman, and N. R. Landau. 1996. Identification of a major 
co-rcccptor for primary isolates of HIV-1. Nature 381:661-666. 

33. Dittmar, M. T., A. McKnIght, G. Simmons, P. R. Clapham, R. A. Weiss, and 
P. Simmonds. 1997. HIV-1 tropism and co-receptor usage. Nature 385:495- 
496. 

34. Ooranz, B. J., J. Rucker, Y. Yi, R. J. Smyth, M. Samson, S. C. Peiper, M. 
Parmcntier, R. G. Collman, and R. W. Doms. 1996. A dual-tropic primary 
HIV-1 isolate that uses fusin and the beta-chemokine receptors CKR-5, 
CKR-3, and CKR-2b as fusion cofactors. Cell 85:1149-1158. 

35. Dragic, T., P. Charneau, F. Clavel, and M. Alizon. 1992. Complementation 
of murine cells for human immunodeficiency virus cnvclopc/CD4-mcdiatcd 
fusion in human-murine heterokaryons. J. Virol. 66:4794-4802. 

36. Dragic, T, V* Litwin, G. Aliaway, S. R. Martin, Y. Huang, K. A. Nagashima, 
C. Cayanan, P. J. Maddon, R. A. Koup, J. P. Moore, and W. A. Paxton. 1996. 
HIV-1 entry into CD4"*" cells is mediated by the chemokine receptor CC- 
CKR5. Nature 381:667-673. 

37. Evan, G. i., G. K. Lewis, G. Ramsay, and J. M. Bishop. 1985. Isolation of 



J. Virol. 



monoclonal antibodies specific for human c-myc proto-oncogcnc product. 
Mol. Cell. Biol. 5:3610-3616. 

38. Feng, Y., C. C. Broder, P. E. Kennedy, and E. A- Berger. 1996. HIV-1 entry 
cofacton functional cDNA cloning of a seven-transmembrane G-protein 
coupled receptor. Science 272:872-877. 

39. Fouchier, R. A. M,, M. Groenik, N. A. Kootstra, M. Tersmette. H. G. Huis- 
man, F. Miedema, and 11. Schuitemaker. 1992. Phenotype-associated se- 
quence variation in the third variable domain of the human immunodefi- 
ciency virus type 1 gpl20 molecule. J. Virol. 66:3183-3187. 

40. Freed, £. O., D. J. Myers, and R. Risser. 1991. Identification of the principal 
neutralization determinant of human immunodcficicnQr virus type 1 as a 
fusion domain. J. Virol. 65:190-194. 

41. Harrington, R., and A. P. Geballe. 1993. Cofactor requirements for human 
immunodeficiency virus type 1 entry into a CD4-cxprcssing human cell line. 
J. Virol. 67:5939-5947. 

42. He, J., Y. Chen, M. Farzan, H. Choe, A. Ohagen, S. Gartner, J, Busciglio, X. 
Yang, W. Hofmann, W. Newman, C. R. Mackay, J. Sodroski, and D. 
Gabuzda. 1997. CCR3 and CCR5 are co-receptors for HIV-1 infection of 
microglia. Nature 385:645-649. 

43. Huang, Y., W. A. Paxton, S. M. Wolinsky, A. U. Neumann, L. Zhang, T. He, 
S. Kang, D. Ceradini, Z. Jin, K. Yazdanbakhsh, K. Kunstman, D. Erickson, 
£. Dragon, N. R. Landau, J. Phair, D. D. Ho, and R. A. Koup. 1996. The role 
of a mutant CCR5 allele in HIV-1 transmission and disease progression. Nat. 
Med. 2:1240-1243. 

44. Hwang, S. R., T. J. Boyle, K. Lyerly, and B. R. Cullen. 1992. Identification of 
the envelope V3 loop as the primary determinant of cell tropism in HIV-1. 
Science 253:71-74. 

45. Ivanoff, L. A., J. W. Dubay, J. F. Morris, S. J. Roberts, U Gutshall, E. J. 

Sternberg, E. Hunter, T. J. Matthews, and S, R. Petteway, Jr. 1992. V3 loop 
of the HIV-1 envelope protein is essential for virus infcctivity. Virology 
187:423^32. 

46. Jansson, M., M. Popovic, A. Karlsson, F. Cocdii, P. Rossi, J. Albert, and H. 
Wtgzell. 1996. Sensitivity to inhibition by p-chcmokincs correlates with bio- 
logical phenotypes of primary HIV-1 isolates. Proc Natl. Acad. Sci. USA 

93:15382-15387. 

47. Koito, A., G, Harrowe, J. A. Levy, and C. Cheng-Mayer. 1994. Functional 
role of the V W2 region of human immunodeficiency virus type 1 envelope 
glycoprotein gpl20 in infection of primary macrophages and soluble CCM 
neutralization. J. Virol. 68:2253-2259. 

48. Kozak, S. L., E. J. Piatt, N. Madani, F. E. Ferro, Jr., K. Peden, and D. Kabat. 
1997. CD4, CXCR-4, and CCR-5 dependencies for infections by primary 
patient and laboratory-adapted isolates of human immunodeficiency vims 
type 1. J. Virol. 71:873-882. 

49. Kuiken, C. L., J. J. de Jong, E. Baan; W. Keulen, M. Tersmette, and J. 
Goudsmith. 1992. Evolution of V3 envelope domain in proviral sequences 
and isolates of human immunodeficiency virus type 1 during the transition of 
the viral biological phenotype. J. Virol. 66:4622-4627. 

50. Lapham, C. K., J. Ouyang, B. Chandrasekhar, N. Y. Nguyen, D. S. Dimitrov, 
and H. Golding. 1996. Evidence for cell-surface association between fusin 
and the CD4-gpl20 complex in human cell lines. Science 274:602-605. 

51. Liu, R., W. A. Paxton, S. Choe, D. Ceradini, S. R. Martin, R. Honik, M. E, 
MacDonald, H. Stuhlmann, R. A. Koup, and N. R. Landau. 1996. Homozy- 
gous defect in HIV- 1 coreccptor accounts for resistance of some multiply- 
exposed individuals to HIV-1 infection. Cell 86:367-377. 

52. Luciw, P. A^ £. Pratt-Lowe, K. £. S. Shaw, J. A. Levy, and C. Cheng-Mayer. 

1995. Persistent infection of rhesus macaques with T-ccll-line-tropic and 
macrophage-tropic clones of simian/human immunodeficiency viruses 
(SHIV). Proc. Natl. Acad. Sci. USA 92:7490-7494. 

53. Maddon, P. J., A. G. Dalgleish, J. S. McDougal, P. R. Clapham, R. A. Weiss, 
and R. Axel. 1986. The T4 gene encodes the AIDS virus receptor and is 
expressed in the immune ^stem and the brain. Cell 47:333-348. 

54. Marcon, L., H. Choe, K. A. Martin, M. Farzan, P. D. Ponath, L. Wu, W. 
Newman, N. Gerard, C. Gerard, and J. Sodroski. 1997. Utilization of C-C 
chemokine receptor 5 by the envelope glycoproteins of a pathogenic simian 
immunodeficiency virus, S\Vm^^9. J. Virol. 71:2522-2527. 

55. McKnight, A., P. R. Clapham, and R. A. Weiss. 1994. HIV-2 and SIV 
infection of nonprimatc cell lines expressing human CD4: restrictions to 
replication at distinct stages. Virology 201:8-18. 

56. McKnight, A., R. A. Weiss, C. Shotton, Y. Takeuchi, H. Hoshino, and P. IL 
Clapham. 1995. Change in tropism upon immune escape by human immu- 
nodeficiency virus. J. Virol. 69:3167-3170. 

57. Meyer, A., A. J, Coyle, A- E, 1. Proudfoot, T. N. C. Wells, and C. A. Power. 

1996. Cloning and characterization of a novel murine macrophage inflam- 
matory protein- la receptor. J. Biol. Chem. 271:14445-14451. 

58. Moore, J. P., B. A. Jameson, R. A. Weiss, and Q. J. Sattentau. 1993. The 
HIV-ccll fusion reaction, p. 233-289. In J. Bcntz (cd.). Viral fusion mecha- 
nisms. CRC Press, Boca Raton, Fla. 

59. Nussbaum. O., C. C. Broder, and E. A. Berger. 1994. Fusogenic mechanisms 
of enveloped-virus glycoproteins analyzed by a novel recombinant vaccinia 
virus-based assay quantitating cell fusion-dependent reporter gene activa- 
tion. J. Virol. 68:5411-5422. 

60. Oberlin, E., A. Amara, F. Bachelerie, C. Bessia, J.-L. Virelizier, F. Arenzana- 



0 



Vol, 71, 1997 

Seisdedos, O. Schwartz, J.-M. Heard, I. Clark-Lewis, D. F. Legler, M. 
Loetscher, M. BaggioUni, and B. Moser. 1996. The CXC chemokine SDF-1 
is the ligand for LESTR/fusin and prevents infection by T-ccU-linc-adapted 
HIV-1. Nature 382:833-«35. 

61. O'Brien, W. Y. Koyanagi, A. Namazie, J.-Q. Zbao, A. Diagne, K. Idler, 
J. A. Zacic, and I. S. V. Chen. 1990. HIV-1 tropism for mononuclear phago- 
cytes can be determined by regions of gpl20 outside the CD4>binding do- 
main. Nature 348:69-73. 

62. O'Brien, W. A^ M. Sumner-Smith, S.-H. Mao, S. Sadeghf, J.-Q. Zhao, and 
I. S. Y. Chen. 1996. Anti-human immunodeficiency virus type 1 activity of an 
oligocationic compound mediated via gpl20 V3 interactions. J. Virol. 70: 
2825-2831. 

63. Oravecz, T., M. Pall, and M. A. Norcross. 1996. p- Chemokine inhibition of 
monocytotropic HIV- 1 infection. Interference with a postbinding fusion 
step. J. Immunol. 157:1329-1332. 

64. Page, K. A., S. M. Stearns, and D. R. Littman. 1992. Analysis of mutations 
in the V3 domain of gpl60 that affect fusion and infectivity. J. Virol. 66:524- 
533. 

65. Paxton, W. A., S. R. Martin, D. Tse, T. R. O'Brien, J. Skurnick, N. L. 
VanDevanter, N. Padian, J. F. Braun, D. P. Kotlcr, S. M. Wolinsi^, and R. A. 
Koup. 1996. Relative resistance to HIV-1 infection of CD4 lymphocytes from 
persons who remain uninfected despite multiple high-risk sexual exposure. 
Nat. Med. 2:412-417. 

66. Picard, L., D. A. Wilkinson, A. McKnIght, P. W. Gray, J. A. Hoxie, P. fL 
Clapham, and R. A. Weiss. Role of the amino-terminal extracellular domain 
of CXCR-4 in human immunodeficiency virus type 1 entry. Virology, in 
press. 

67. Pleskol^ O., N. Sol, B. Labrosse, and M. Alizon. 1997. Human immunode- 
ficiency virus strains differ in their ability to infect CD4'^ cells expressing the 
rat homolog of CXCR-4 (fusin). J. Virol. 71:3259-3262. 

68. Pleskoff, O., C. Treboute, A. Brclot, N. Hevekcr, M. Seman, and M. Alizon. 
A chemokine receptor encoded by the human cytomegalovirus is a cofactor 
for HIV-1 entry. Science, in press. 

69. Premack, B. A., and T. J. SchalK 1996. Chemokine receptors: gateways to 
inflammation and infection. Nat. Med. 2:1174-1178. 

70. Raport, C. J., J. Gosling, V. L. Schweickart, P. W. Gray, and 1. F. Charo. 
1996. Molecular cloning and functional characterization of a novel human 
CC chemokine receptor (CCR5) for RANTES, MIP-p, and MlP-la. J. Biol. 
Chcm. 271:17161-17166. 

71. Ruckcr, J., M. Samson, B. J. Doranz, F. Libert, J. F. Bereson, Y. Yi, R. G. 
Coliman, C. C. Broder, G. Vassart, R. W. Doms, and M. Parmentier. 1996, 
Regions In p -chemokine receptors CCR5 and CCR2b that determine HIV-1 
cofactor specificity. Cell 87:437-446. 

72. Samson, M., O. Labbe, C. MoUereau, G. Vassart, and M. Parmentier. 1996. 
Molecular cloning and functional expression of a new human CC-chemokine 
receptor gene. Biochemistry 11:3362-3367. 

73. Samson, M., F. Libert, B. J. Doranz, J. Rucker, C. Liesnard, C.-M. Farber, 
S. SaragostI, C. Lapoum^roulie, J. Cognaux, C. Forcellle, G. Muyldermans, 
C. Verhofstede, G. Burtonboy, M. Georges, T. Imai, S. Rana, Y. Yi, R. J. 
Smyth, R. G. Coliman, R. W. Doms, G. Vassart, and M. Parmentier. 1996. 
Resistance to HIV-1 infection in Caucasian individuals bearing mutant alleles 
of the CCR-5 chemokine receptor gene. Nature 382:722-725. 

74. Schwartz, O., M. Alizon, J. M. Heard, and O. Danos. 1994. Impairment of T 
cell receptor-dependent stimulation in CD4'^ lymphocytes after contact with 
membrane-bound HlV-1 envelope, glycoprotein. Virology 198:360-365. 

75. Shioda, T., J. A. Levy, and C. Cheng-Mayer. 1991. Macrophage and T-cell 



DOMAINS OF CCR-5 INVOLVED IN HIV-1 ENTRY 5011 



line tropisms of HIV-1 arc determined by specific regions of the envelope 
gpl20 gene. Nature 349:167-169. 

76. Simmons, G., P. R. Clapham, L. Picard, R. E. Offord, M. M. Rosenkilde, 
T. W. Schwartz, R. Baser, T. N. C. Wells, and A. £. 1. Proudfoot. 1997. Potent 
inhibition of HIV-1 infectivity in macrophages and lympho(^cs a novel 
CCR5 antagonist. Science 276:276-279. 

77. Simmons, G., A. McKnight, Y. Takcuchi, H. Hoshino, and P. R. Clapham. 
1995. Cell-to-cell fusion, but not virus entry in macrophages by T-cell line 
tropic HIV-1 strains: a V3 loop-determined restriction. Virology 209:696- 
700. 

78. Simmons, G., D. Wilkinson, J. D. Reeves, M. T. Dittmar, S. Beddows, J. 
Weber, G. Carnegie, U. Desselberger, P. W. Gray, R. A. Weiss, and P. R. 
Clapham. 1996. Primary, syncytium-inducing human immunodeficiency virus 
type 1 isolates are dual-tropic and most can use either Lestr or CCR5 as 
corcccptors for virus entry. J. Virol. 70:8355-8360. 

79. Sullivan, N., M. Thali, C. Furman, D. D. Ho, and J. Sodroski. 1993. Effects 
of amino acid changes in the V2 region of the human immunodeficiency 
virus type 1 gpl20 glycoprotein on subunit association, syncytium formation, 
and recognition by neutralizing antibody. J. Virol. 67:3674-3679. 

80. TakeuchI, Y., M. Akutsu, K. Murayama, N. Shlmozu, and H. Hoshino. 1991. 
Host-range mutant of human immunodeficiency virus type^l: modification of 
cell tropism by a single point mutation at the neutralization epitope in the 
env gene. J. Virol. 65:1710-1718. 

81. Trkola, A., T. Draglc, J. Arthos, J. M. Binley, W. C Olson, G. P. Allaway, C. 
Cheng-Mayer, J. Robinson, P. J. Maddon, and J. P. Moore. 1996. CD4- 
dependent, antibody-sensitive interactions between HIV-1 and its co-recep- 
tor CCR.5. Nature 384:184-187, 

82. Waln-Hobson, S., J.-P. Vartanian, M. Henry, N. Chenciner, R. Cheynier, S. 
Delassus, L. P. Martins, M. Sala, M. T. Nug^re, D. Guetard, D. Klatzmann, 
J.-C. Gluckmann, W. Rosenbauro, F. Barr^-Sinoussi, and L. Montagnier. 
1991. LAV revisited: origins of the early HIV-1 isolates from Institut Pas- 
teur. Science 252:961-965. 

83. Weiss, R. A. 1993. Cellular receptors and viral glycoproteins involved in 
retrovinis entry, p. 1-108, In J. A. Levy (ed.). The retroviridae, vol. 2. Plenum 
Press, New York, N.Y. 

84. Wells, T. N., C. A. Power, M. Lustl-Narasimhan, A. J. Hoogewerf, R. M. 
Cooke, C. W. Chung, M. C. Peitsch, and A. E. Proudfoot. 1996. Selectivity 
and antagonism of chemokine receptors. J. Leukocyte Biol. 59:53-60. 

85. Westervelt, P., H. E. Gendelman, and L. Ratner. 1991. Identification of a 
determinant within the human immunodeficiency virus type 1 surface enve- 
lope glycoprotein critical for productive infection of primary monoi^tes. 
Proc. Natl. Acad. Sci. USA 88:3097-3101. 

86. Westervelt, P., D. Trowbridge, L. Epstein, B. Blumberg, Y. Li, B. Hahn, G. 
Shaw, R. Price, and L. Rattner. 1992. Macrophage tropism determinants of < 
human immunodeficiency vims type 1 in vivo. J. Virol. 66:2577-2582. 

87. Willey, R. L., T. S. Theodore, and M. A. Martin. 1994. Amino acid substi- 
tutions in the human immunodeficiency virus type 1 gpl20 V3 loop that 
change tropism also alter physical properties of the virion envelope. J. Virol. 
68:4409-4419, 

88. Wu, L., N. P. Gerard, R. Wyatt, H. Choc, C. Parolin, N. Ruffing, A. Borsetti, 
A. A. Cardoso, E. Desjardin, W. Newman, C. Gerard, and J. Sodroski. 1996. 
CD4-induced interaction of primary HIV-1 gpl20 glycoproteins with the 
chemokine receptor CCR-5. Nature 384:179-183- 

89. Zhang, L., Y. Huang, T. He, Y. Cao, and D. D. Ho. 1996. HIV-1 subtype and 
second-receptor use. Nature 383:768. 



Journal of Virology, June 1997, p. 4744-4751 Vol. 71, No. 6 

0022-538X/97/$04.00 + 0 

Copyright © 1997, American Society for Microbiology 

Role of the First and Third Extracellular Domains of CXCR-4 
in Human Immunodeficiency Virus Coreceptor Activity 

ANNE BRELOT, NIKOLAUS HEVEKER, OLIVIER PLESKOFF, NATHALIE SOL, and MARC ALIZON* 

INSERM U332, InstUute Cochin de Genetique Moleculaire, 75014 Paris, France 

Received 17 January 1997/Accepted 4 March 1997 

The CXCR-4 chemokine receptor and CD4 behave as coreceptors for cell line-adapted human immunode- 
ficiency virus types 1 and 2 (HTV-l and HIV-2) and for dual-tropic HIV strains, which also use the CCR-5 
coreceptor. The cell line-adapted HIV-l strains LAI and NDK and the dual-tropic HrV-2 strain ROD were able 
to infect €04*^ cells expressing human CXCR-4, while only LAI was able to infect cells expressing the rat 
homolog of CXCR-4. This strain selectivity was addressed by using human-rat CXCR-4 chimeras. All chimeras 
tested mediated LAI infection, but only those containing the third extracellular domain (e3) of human CXCR>4 
mediated NDK and ROD infection. The e3 domain might be required for the functional interaction of NDK and 
ROD, but not LAI, with CXCR-4. Alternatively, LAI might also interact with e3 but In a different way. 
Monoclonal antibody 12G5, raised against human CXCR-4, did not stain cells expressing rat CXCR-4. 
Chimeric human-rat CXCR-4 allowed us to map the 12G5 epitope in the e3 domain. The ability of 12G5 to 
neutralize infection by certain HIV-l and HlV-2 strains is also consistent with the role of e3 in the coreceptor 
activity of CXCR-4. The deletion of most of the amino-terminal extracellular domain (el) abolished the 
coreceptor activity of human CXCR-4 for ROD and NDK but not for LAI. These results indicate that HIV 
strains have different requirements for their interaction with CXCR-4. They also suggest differences in the 
interaction of dual-tropic HIV with CCR-5 and CXCR-4. 



Human immunodeficiency virus types 1 and 2 (HIV-l and 
HIV-2) infect cells by a membrane fusion process mediated by 
their envelope glycoproteins (gpl20/gp41, or Env) that re- 
quires the interaction of gpl20 with the cellular receptor CD4 
(reviewed in references 27 and 41). The complete resistance of 
several nonhuman and human CD4'*' cell lines to HIV-l in- 
fection and to the formation of syncytia with Env"^ cells sug- 
gests that other cellular factors are required for Env-mediated 
fusion (3, 6, 10, 13, 19, 25). The existence of such factors could 
also explain cell tropism differences between clinical isolates 
(primary strains) and cell line-adapted strains (23). Primary 
strains generally have a non-syncytium-inducing (NSI) pheno- 
type and replicate in peripheral blood mononuclear cells and 
macrophages but not in CD4'*' cell lines. Certain primary 
strains, usually isolated at late stages of HIV infection, have a 
syncytium-inducing (SI) phenotype and can replicate in CD4"*^ 
cell lines and peripheral blood mononuclear cells, but most 
HIV strains with these properties have been obtained by ex 
vivo adaptation. Finally, certain HIV strains, termed dual 
tropic, display features of both primary and cell line-adapted 
strains. These phenotypic differences among HIV strains are 
principally supported by genetic differences in their gpl20 sur- 
face envelope proteins, in particular in the third hypervariable 
(V3) domain (reviewed in references 18, 27, and 41). 

The cellular factors responsible for the permissiveness of 
CD4'*" cells to HIV-l entry were found to be members of the 
chemokine receptor family. Chemokines are small (molecular 
weight, 8,000 to 10,000) soluble proteins that mediate leuko- 
cyte chemotaxis by interacting with a family of G -protein- 
coupled receptors with seven meibbrane-spanning (TM) do- 
mains. Chemokines are classified into two groups, a (or CXC) 
and p (or CC), depending upon the position of two conserved 
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cysteines in their amino-terminal domains (reviewed in refer- 
ence 28). A factor allowing entry of cell line-adapted HIV-l 
strains was first identified by genetic complementation of mu- 
rine CD4'*" cells and was named fusin (16). It was later shown 
to be a receptor for stromal cell-derived factor 1 (SDF-1), a 
CXC chemokine, and renamed CXCR-4 (5, 30). This chemo- 
kine receptor can also be used for cell entry by HIV-2 strains, 
in both CD4-dependent (33) and CD4-independent (15) 
modes. It does not allow infection by NSI (or macrophage- 
tropic) HIV-l strains, which were found to use CCR-5, a re- 
ceptor for the CC chemokines RANTES, MlP-la, and MIP-lp 
(2, 9, 11, 12, 14). Dual-tropic HIV-l (9, 12), primary HIV-l 
with an SI phenotype (37, 44), and HIV-2 strains such as ROD 
(38) can use both CXCR-4 and CCR-5 for cell entry. In addi- 
tion to these chemokine receptors, certain HIV-l strains seem 
to be able to use CCR-3 and CCR-2b (9, 12). 

The mechanism by which chemokine receptors participate in 
the process of HIV-l entry is not entirely known, but different 
experiments suggest that gpl20 interacts with CXCR-4 or 
CCR-5 (22, 40, 43). This interaction is markedly stronger in the 
presence of CD4, which suggests that chemokine receptors and 
CD4 behave as coreceptors for HIV. The identification of the 
domains of the chemokine receptors that interact with gpl20 is 
important for understanding the process of HIV-l entry. We 
have previously shown that human CXCR-4 was a coreceptor 
for the genetically divergent HIV-l strains LAI (clade B) and 
NDK (clade D) and for HIV-2Rop- I" contrast, the rat ho- 
molog of CXCR-4 was able to efficiently mediate infection by 
LAI but not by NDK and ROD (33). This strain selectivity was 
used to identify domains of CXCR-4 supporting HIV corecep- 
tor activity. 

MATERULS AND METHODS 

Cell lines and viral strains. The" U373MG-CD4 cell line (19) and HcLa cell 
lines stably expressing wild-type or chimeric LAI Env (33, 36) have been de- 
scribed previously. HIV-l (sj^k (39) and HIV-l (1) were propagated in the 
T-cell line CEM. HIV-l la, was produced by transfection of HeLa cells with a 
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recombinant provirus (32). HIV-2roi> produced by a chronicalty infected 
cell line (7). 

Construction of chimeric CXCR-4. The Rc/CMV vectors allowing expression 
of human and rat CXCR-4 from the cytomegalovirus immediate-early promoter 
have been described previously (33). The coding sequence of the rat CXCR-4 
cDNA (kindly provided by R. S. Duman) was determined by direct automated 
sequencing of the expression vector. Several differences from the previously 
published sequence (42) (GenBank accession no. U54791) were noted, ail re- 
sulting in Identity of rat with human and/or mouse CXCR-4 (see Fig. 2). Con- 
structs A through G were obtained by substitution of rat and human CXCR-4 at 
one or two of the conserved restriction sites BstEU, Hindi, and ApalA (see Fig. 
2 and 3). Constructs H, 1, and J, were obtained by ligation of PCR fragments 
amplified from rat and human CXCR-4 into Rc/CMV between thei/f/idlll and 
JVo/I sites. The rat CXCR-4 fragments were amplified with T7 (5'-AATACGA 
CTCACTATAGG) as the 5' primer and either Nru-1 (5'-CGAAGATGATGT 
CAGGG), Mfc-l (5'-TGGAACACCACCATCC), or Cla-1 (5'-TCGATGCTG 
ATCCCC) as the 3' primer. The human CXCR-4 fragments were amplified with 
cither Nru-2 (S'-CGAACGTCAGTGAGGCAG), Mfe-2 (5'-ATTGCAGCACA 
TCATGGTTGG), or Cla-Z (5'-TTCCTTCATCCTCCTGG) as the 5' primer 
and SP6 (5'-ATTTAGGTACACTATAG) as the 3' primer. Constructs H to J 
were obtained by blunt-end ligation of PCR fragments T7/Nru-1 and Nru-2/SP6, 
T7/Mfe-1 and Mfe-2/SP6, and T7/Cla-1 and Cla-2/SP6, respectively, creating the 
restriction sites Nml at the TM4/e3 junction (construct H), Mfel at the e3/TM5 
junction (construct I), and Clal at the TM6/e4 junction (construct J). Construct 
K was derived by substitution of the ApalA-Notl fragment from rat CXCR-4 into 
construct J. Construct L was obtained by blunt-end ligation of PCR fragments 
T7/Qa-1, amplified from construct A, and CIa-2/SP6, amplified from construct C. 
Construct M was obtained by blunt-end ligation of PCR firagments T7/Mfe-U 
amplified from construct H, and Mfe-2/SP6, amplified from rat CXCR-4. Amino 
acid changes in the resulting protein, introduced by primers Mfe-1 and Mfc-2 
(V194M and P198L), were shovm to be detrimental to the coreceptor function 
and were corrected by site-directed mutagenesis with the oligonucleotide 5'-CA 
TGATGTGCTGAAACTGGAACACAACCACCCACAAGTCATr on a sin- 
gle-stranded DNA template. Construct N was obtained by blunt-end ligation of 
PCR fragments amplified from human CXCR-4 with primers T7 and Nru-4 
(5'-CGAAGATGAAGTCGGGAA) and from construct I with primers Nru-3 
(5'-GGAATGTCAGCCAGGGGG) and SP6. In this chimera, the first amino 
acid of e3 corresponds to the sequence of human CXCR-4 (N176) and not to rat 
CXCR-4 (D173). 

Mutations In human CXCR-4. The Ai4 mutant was obtained by blunt-end 
ligation between the Oral site (position 925 relative to the initial ATG codon) 
and a Smal site of the pUClS polylinker, creating a frameshift in the carbo;^- 
terminal domain after F309, with the addition of 10 irrelevant amino acids. The 
NllQ and A4-41 mutants were obtained by site-directed mutagenesis on a 
single-stranded human CXCR-4 template with the oligonucleotides 5'-GGTGT 
A TTGATCA GAAGTGT (flc/l site underlined) and 5'-GTAGATGGTGGGC 
CQCTCCATGGT {BspllOl site underlined), respectively. The A4-36 mutant 
was obtained by ligation of the partly complementary oligonucleotides 5'-CAT 
GGAGGGGAATAAGATCTTCC and S'-CCGAGGAAGATCTTATTCCCCT 
into the A4-41 mutant digested with Ncol (position —2 relative to the initial 
ATG codon) and Bspl20l. The A2-9 mutant was obtained by digestion of the 
NllQ mutant with Ncol and Bctl, filling in with T4 RNA polymerase, and 
blunt-end ligation. 

Functional assays for CXCR-4. The U373MG-CD4 cells were transfected with 
wild-type or mutant CXCR-4 expression vectors in 24-wcil plates by calcium 
phosphate precipitation (overnight contact). Infections or cocultures were per- 
formed 24 h after transfection. The virus inoculum (10 to 30 ng of p24 antigen 
per well) was left in contact with cells for 36 to 40 h. Cocultures with Env* cells 
(1:1 ratio) were grown for 24 h. Cells were then fixed and stained for p-galac- 
tosidase activity with the X-Gal (5-bromo-4-chloro-3•indolyI•^-D-galactopyrano- 
side) substrate, as described previously (13). Blue-stained cells or foci were 
scored under a magnification of X20. 

Flow cytometry. COS cells were transfected with expression vectors for 
CXCR-4 and green fluorescent protein (QFP) (EGFP-Nl vector; Clontech, Palo 
Alto, Calif.) by calcium phosphate precipitation. Cells were detached with phos- 
phate-buffered saline containing 1 mM EDTA and pelleted 36 to 48 h after 
transfection. About lO** cells were incubated for 1 h at 4*C with monoclonal 
antibody 12G5 (kindly provided by J. Hoxic) at a concentration of 7.5 pig/ml in 
phosphate-buffered saline containing 2% bovine serum. Cells were washed three 
times and incubated for 1 h with phycociythrin-conjugated rabbit anti-mouse 
immunoglobulins (Dako, Glostrub, Denmark). Stained cells were washed, fixed 
in 2% paraformaldehyde, and analyzed on an Epics Elite flow cytometer (Coul- 
tronics). 



RESULTS 

Strain specificity of rat CXCR-4. The ability of wild-type or 
mutant chemokine receptors to behave as HIV coreceptors 
was tested by their expression in the 004"^ human glioma cell 
line U373MG-CD4, which is naturally resistant to HI V-1 entry 
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and to fusion with Env"*" cells (19). This cell line was stably trans- 
fected with a Tat-inducible lacZ reporter gene (LTR/acZ), allow- 
ing US to detect complementation for HIV infection and for fu- 
sion with Taf^ Env^ cells by a simple in situ p-galactosidase 
assay, as described previously (19, 33). 

Upon expression of human CXCR-4 or its rat homolog, 
U373MG-CD4 cells were susceptible to infection with HIV- 
Ilai and fusion with cells stably expressing the LAI Env pro- 
teins (Fig. 1). In contrast, only human CXCR-4 allowed infec- 
tion of U373MG-CD4 cells with HIV-1 

NDK or H.IV-2j^oj5 and 
fusion with cells stably expressing chimeric LAI Env with the 
V3 loop of ROD or NDK (Fig. 1). In all subsequent experi- 
ments, parallel results were obtained in infection assays with 
NDK or ROD and in syncytium formation assays with HeLa 
cells stably expressing chimeric LAI Env with the V3 loop of 
NDK or ROD. 

Human-rat CXCR-4 chimeras. Human and rat CXCR-4 
have more than 90% amino acid identity. The differences are 
found essentially in the extracellular domains, in particular el 
and e3, while the TM and intracellular (i) domains are highly 
conserved or identical (Fig. 2). A first set of chimeras (con- 
structs A to G) was constructed by using restriction sites con- 
served between the rat and human CXCR-4 cDNAs. The 
transfection of these chimeras into U373MG-CD4 cells al- 
lowed infection with HIV-I^l^ 3 (a recombinant HIV-1 strain 
with LAI env) (Table 1) and fusion with HeLa cells stably 
expressing LAI Env (Fig. 3b). Infection mediated by chimeras 
A and E was less efficient while infection mediated by chimeras 
D and F, both with the TM7 and i4 domains from rat CXCR-4, 
was more efficient than infection mediated by human CXCR-4. 
These diflferences were not strain specific and did not seem to 
be supported by differences in cell surface expression (Fig. 3c). 
This first series of constructs allowed us to rule out a role for 
genetic differences in the el to TM3 and i4 domains in the 
selectivity of rat CXCR-4 for LAI and in the ability of human 
CXCR-4 to mediate infection by NDK or ROD. Accordingly, 
chimeric rat CXCR-4 with the 12 to e4 domains of human 
CXCR-4 (construct F) had the same phenotype as human 
CXCR-4. Since the i2, i3, and TM5 domains of human and rat 
CXCR-4 are identical, the phenotypic differences between hu- 
man and rat CXCR-4 can be supported only by genetic varia- 
tion in the TM4, e3, TM6, and e4 domains. 

The role of these domains was addressed with another series 
of chimeric receptors (constructs H to J) with replacements 
at the TM4/e3, e3/TM5, and TM6/e4 junctions (Fig. 3). Only 
construct H was able to mediate the infection of U373MG- 
CD4 cells by NDK or ROD (Table 1) and their fusion with 
HeLa cells expressing chimeric LAI Env (Fig. 3b). However, 
its efficiency was markedly reduced relative to that of infec- 
tion mediated by human CXCR-4 or construct F. The com- 
parison of results obtained with chimeras H and I indicates 
that the e3 domain of human CXCR-4 was necessary for 
NDK and ROD infection, while experiments with constructs 
K and L showed that differences in the e4 domain had no 
apparent role in mediating infection (Table 1). The substi- 
tution of the e3 domain from rat CXCR-4 into human 
CXCR-4 (construct N) abolished the coreceptor activity of 
CXCR-4 for NDK but not for LAI (Table 2). The reciprocal 
chimera (rat CXCR-4 with human e3 [construct M]) was 
able to mediate infection of U373MG-CD4 cells with LAI 
(Table 2), NDK (Table 2), and ROD (data not shown). It 
also allowed fusion with cells expressing LAI Env with NDK 
V3 (Fig. 3b). The efficiency of NDK infection was lower 
(about 50%) when U373MG-CD4 cells expressed chimera 
M than when they expressed human CXCR-4. However, 
chimera M was able to mediate NDK infection much more 
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FIG. 1. Coreccptor activity of human and rat CXCR-4 for HlV-1 and HlV-2 strains. Target ccWs (U373MG-CD4 [LTRIacZ]) transfected with the indicated 
CXCR-4 expression vector were infected with HIV-1lai, HIV-In^kj or HIV-Zrqd (a) or cocultured with HeLa cell lines stably expressing LAI Env or chimeric LAI 
Env with V3 from NDK or ROD (b). Infection with HIV or fusion with Env* cells results in transactivation of the LTR/ocZ reporter gene and high ^-galactosidasc 
activity. Cells were stained with the X-Gal substrate 36 to 40 h after infection or 24 h after coculture. Bars represent the average numbers of blue-stained foci (log scale) 
in duplicate wells (24-well plate). Asterisks indicate that fewer than 10 foci were obtained. 



efficiently than was chimera H. Also, its surface expression 
was apparently lower than that of chimera H or human 
CXCR-4 (Fig. 3c). The functional differences between chi- 
meras H and M could be due to conformational effects 



induced by chimerization. Overall, these experiments indi- 
cated that differences in the e3 domain were sufficient to 
account for the lack of co receptor activity of rat CXCR-4 for 
NDK and ROD. 
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FIG. 2. Alignment of the amino acid sequences of human, rat, and mouse CXCR-4. The human and rat CXCR-4 sequences were determined directly from the 
expression vectors used in this study. Asterisks indicate discrepancies with the previously established sequence of rat CXCR-4 (42). The mouse CXCR-4 sequence is 
from the work of Nagasawa et al. (29). The predicted TM domains are underlined. The positions of the restriction sites used for construction of rat-human chimeras 
arc indicated. Natural restriction sites arc underlined. 
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TABLE 1. Infection of U373MG-CD4 (LTR/ocZ) cells expressing 
WT or chimeric CXCR-4 by selected HIV strains 



Construct used for 
transfection 


No. of bluc-siaincd ccllsAvelf infected by: 


HIV-1ni>3 


HIV-Indk 


HIV.2roi> 


None (mock) 


It ^ 


A 




Human CXCR-4 


450, 495 


330, 400 


410. 360 


Rat CXCR-4 


400, 460 


1.2 


1.0 


Chimeras 








A 


105, 90 


75,65 


40,50 


B 


280, 310 


2.1 


2,3 


C 


610, 650 


2,4 


1.0 


D 


1,250, 1.040 


590, 530 


410, 470 


E 


115, 105 


0,0 


1,0 


F 


810, 980 


490, 550 


320, 280 


G 


440, 530 


2,0 


4,1 


H 


340, 300 


31,36 


30, 38 


1 


490, 570 


2,0 


3,2 


J 


430,510 


2,0 


2,1 


K 


520, 550 


2,4 


7.4 


L 


650, 730 


410, 480 


520,400 



" Values for duplicate wells (24-well plates). Cells were stained with X-Gal for 
^-galactosidase activity 40 h after infection. AJi constructs were tested in the 
same experiment. Cell counts of >150 were obtained by extrapolation from 
randomly selected fields. 



Deletion mutants. A series of deletions was engineered in 
human CXCR-4 cDNA by using natural restriction sites or 
site-directed mutagenesis. The deletion of the extracellular (e3 
and e4) or intracellular (i2 and i3) loops abolished HIV core- 
ceptor activity, but epitope-tagged forms of these mutants were 
not detected at the cell surface (data not shown), suggesting 
that these deletions were not compatible with correct folding 
of CXCR-4. The deletion of the carboxy-terminai intracyto- 
plasmic domain (Ai4) was fiilly compatible with coreceptor 
activity for LAI, NDK, and ROD (Table 3). Higher numbers of 
infected cells, or syncytia with Env*^ cells, were observed when 
U373MG-CD4 cells were transfected with this mutant than 
with wild-type (WT) human CXCR-4. 

The other mutations were introduced in the amino-terminal 
domain (el) of human CXCR-4. The NllQ mutation, which 
suppressed one of the two potential N gtycosylation sites in 
CXCR-4, increased its coreceptor activity for LAI, NDK, and 
ROD (Fig. 4; Table 3). The other potential N glycosylation 
site, at the TM4/e3 boundary (N176), is not conserved in rat 
CXCR-4. It seems, therefore, that N glycosylation is not nec- 
essary for HIV coreceptor activity. The deletion of the first 
nine amino acids of el (yielding the A2-9 mutant) had no 
apparent effect on infection or cell fusion mediated by LAI 
Env and slightly reduced the efficiency of NDK and ROD 
infection (Table 3). Surprisingly, CXCR-4 with an almost com- 
plete deletion of el (A4-36 mutant) was able to mediate LAI 
infection and the fusion of U373MG-CD4 cells with HeLa cells 
expressing LAI Env (Fig. 4; Table 3). This mutant was less 
efficient than WT CXCR-4, but flow cytometry experiments 
also showed reduced cell surface expression (see below). The 
coreceptor activity of the A4-36 mutant was extremely weak 
for NDK and null for ROD (Table 3), which is more evidence 
that these strains and LAI have different requirements for their 
interaction with CXCR-4. The A4-41 mutant mediated neither 
infection by LAI, NDK, and ROD nor fusion with Env"*^ cells 
(Fig. 4; Table 3). However, an epitope-tagged form was not 
detected at the cell surface (data not shown), suggesting that 
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the deletion, which extends beyond the predicted elA'Ml 
boundary, was not compatible with correct folding of CXCR-4. 

Mapping of the 12G5 epitope. Monoclonal antibody 12G5 
was selected for its ability to block cell fusion mediated by 
HIV-2 Env and was later found to detect a conformational 
epitope of human CXCR-4 (15). It was also recently shown to 
neutralize infection by HIV-1 and HIV-2 strains (26). Flow 
cytometry experiments were performed on simian COS cells 
cotransfected with expression vectors for human or rat 
CXCR-4 and for OFF. More than 80% of cells transfected with 
human CXCR-4 (GFP positive) were stained with 12G5, while 
about 2% of cells transfected with rat CXCR-4 were stained 
(Fig. 3c). Therefore, the 12G5 epitope is not conserved in rat 
CXCR-4. The rat-human CXCR-4 chimeras were tested for 
their reactivity with 12G5; it stained only cells expressing chi- 
meras bearing the e3 domain of human CXCR-4, including 
construct M, in which e3 was the only domain from human 
CXCR-4, This indicated that the 12G5 epitope is located 
within the e3 domain of CXCR-4. However, it cannot be ruled 
out that the I2G5 epitope is formed by residues in e3 and in 
another domain conserved between human and rat CXCR-4. 
The percentage of stained cells and the mean fluorescence 
intensity were lower for cells transfected with chimera M than 
for cells expressing WT CXCR-4 (Fig. 3c and 5d). This could 
be due to reduced surface expression of the chimera or to less 
efficient presentation of the epitope in this context. Cells ex- 
pressing the A4-36 mutant were stained by monoclonal anti- 
body 12G5, although less efficiently than cells expressing WT 
CXCR-4 (Fig. 50- 

DISCUSSION 

The CXCR-4 chemokine receptor behaves as a coreceptor 
for HIV-1 and HIV-2 strains adapted to replicate in immor- 
talized cell lines (15, 16, 33). It is also a coreceptor for dual- 
tropic HIV-1 strains (9. 12), for primary HIV-1 strains with an 
SI phenotype (37, 44), and for certain HlV-2 strains (38), all of 
which can also use CCR-5 as their coreceptor. The rat ho- 
molog of CXCR-4, differing from human CXCR-4 by a num- 
ber of residues in the extracellular domains, is a coreceptor for 
the cell line-adapted strain LAI but not for the cell line- 
adapted strain NDK or for ROD (which also uses CCR-5). A 
series of chimeric molecules was constructed from human and 
rat CXCR-4 and functionally tested by its expression in a 
CD4-*- cell line (U373MG-CD4) naturally resistant to infection 
by HIV-1 and HIV-2 and to Env-mediated cell-cell fusion. All 
chimeras tested allowed infection by LAI, indicating that they 
were folded and expressed at the cell suriEiace, which was con- 
firmed by flow cytometry experiments. In the context of chi- 
meras, the third extracellular domain (e3) of human CXCR-4 
was both necessary and sufficient to allow observation of co- 
receptor activity for NDK and ROD. The simpler interpreta- 
tion of these results is that e3 participates in the interaction of 
human CXCR-4 with NDK and ROD but not with LAI. Al- 
ternatively, these strains might interact differently with e3, with 
only LAI being able to tolerate the amino acid differences 
between human and rat CXCR-4. It seems less likely that e3 
substitutions can have indirect effects and modulate the inter- 
action of HIV with other domains of CXCR-4. although this 
possibility cannot be formally ruled out. 

Monoclonal antibody 12G5 (15) stained cells expressing hu- 
man, but not rat, CXCR-4. It reacted only with chimeras con- 
taining the e3 domain of human CXCR-4, indicating that e3 
bears the 12G5 epitope or part of it. Antibody 12G5 can 
neutralize infection by HIV-1 and HIV-2 strains, but important 
variations depending upon the viral strains and cell types used 
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FIG. 3. (a) Diagrams of CXCR-4 constructs; (b) functionality of chimeric constructs as HIV-1 corcceptors; (c) construct reactivity with antibody 12GS. Chimeric 
constructs were transfected into U373MG-CD4 cells, and ^ncytium formation assays with HeLa cells expressing WT or chimeric LAI Env were performed as described 
in the legend to Fig. 1. Results are shown semiquantitatively, relative to the number of syncytia observed in parallel transfections with WT human CXCR-4. Symbols: 
+ + , >60%; +, 20 to 60%; -, <10%. To assess 12G5 reactivity, COS cells were stained 36 h after cotransfection with WT or chimeric CXCR-4 and OFF vectors and 
analyzed by flow c^ometry as described in the legend to Fig. S. The percentage of 12GS-stained cells among GFP-positive cells was determined. 
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TABLE 2. Infection of U373MG-CD4 cells expressing WT or 
chimeric CXCR-4 by two HlV-1 strains 



Construct used for 
transfection 


No. of bluc-stained cellsAvcll" infected by: 


HIV-1lai 


HIV-Indk 


Human CXCR-4 


2,400. 2,400 


2,000. 2.000 


Rat CXCR-4 


2.400, NA* 


29.28 


Chimera M 


1,300. 1,300 


640,600 


Chimera N 


1,300, 1.200 


26,20 



" Values for duplicate wells. This experiment was performed as described in 
Tabic 1, footnote a. The larger numbers of infected cells relative to those in 
Tables 1 and 3 are due to a higher transfection efficiency. 
NA, not available. 



seem to exist (26). Overall, it seems that cell line-adapted 
HlV-1 strains, such as LAI, are relatively resistant to neu- 
tralization by 12G5 compared to dual-tropic HIV-1 strains 
or to HIV-2 (26). If this result is confirmed, it could also 
suggest that the e3 domain is not required for the interac- 
tion of LAI with CXCR-4, However, the antiviral activity of 
12G5 could have other mechanisms. The binding of 12G5 
might exert steric hindrance effects on domains other than 
e3, or it might even induce CXCR-4 internalization. In that 
case, strains such as LAI might be less affected than others 
by the reduced expression of their coreceptor at the cell 
surface. At this time, it is difficult to sort out these possible 
mechanisms supporting the antiviral activity of antibody 
12G5. 

The mutation of a potential N glycosylation site in the ami- 
no-terminal domain (el) increased the coreceptor activity of 
CXCR-4 for all strains tested. It can be envisioned that the 
absence of N-linked sugars facilitates coreceptor access. N 
glycosylation was also found to be dispensable for the corecep- 
tor activity of CCR-5 (34). The truncation of the carboxy- 
terminal intracytoplasmic domain (i4) increased the corecep- 
tor activity of CXCR-4. The i4 domain contains a number of 



FUNCTIONAL DOMAINS OF CXCR-4 4749 



TABLE 3. Infection of U373MG-CD4 cells expressing WT or 
mutant human CXCR-4 by selected HIV strains 



Construct used for 


No. of blue-stained ccllsAvcll*' infected by: 


transfection 


HIV-1ni^3 


HIV-Indk 


HIV.2rod 


None (mock) 


• 2.2 


U3 


3,5 


WT CXCR-4 


480, 530 


460,480 


320. 270 


CXCR-4 mutants 
Ai4 
NllQ 
A2-9 
A4-36 
A4-41 


750, 660 
. 1.050,1,250 
480, 560 
120. 150 
■5,2 


820, 950 
850, 900 
370,440 
5,20 
7,3 


500, 620 
750, 810 
250, 220 
2.3 
0.0 



* Values for duplicate wells. This experiment was performed as described in 
Table 1, footnote a. 



serine and threonine residues representing targets for phos- 
phorylation events modulating the activity of G-protein-cou- 
pied receptors (28). The coupling of CXCR-4 via its i4 domain 
is therefore dispensable for HIV coreceptor activity. The in- 
creased activity of the Ai4 mutant was probably an indirect 
eifect of the i4 deletion on, for example, the lateral mobility or 
the turnover of CXCR-4 at the cell surface. The intracellular 
loops i2 and i3 are also involved in coupling to the signal 
transduction machinery, but their deletion was not compatible 
with the surface expression of CXCR-4, due probably to severe 
conformational effects. A more detailed mutagenesis study of 
these domains will be required to address their possible role in 
HIV coreceptor activity. 

The deletion of the first nine amino acids of el had no 
apparent effect on the coreceptor activity of CXCR-4 for LAI 
and slightly reduced its activity for NDK and ROD. The almost 
complete deletion of el (yielding the A4-36 mutant) was com- 
patible with the cell surface expression and coreceptor activity 
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FIG. 4. Coreceptor activity of CXCR-4 with mutations or deletions in the amino-tcrminal cl domain. Transfection of U373MG-CD4 cells and syncytium formation 
assays with HeLa cells stably expressing LAI Env were performed as described in the legend to Fig. 1. 
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FIG. 5. Flow cytometry analysis of CXCR-4 expression. COS cells were 
transfected with a GFP expression vector and with WT or mutant CXCR-4, as 
indicated (1:6 ratio), and stained cither with the anti-CXCR-4 monoclonal an- 
tibody 12G5 and a phycoerythrin-conjugated secondary antibody or with the 
secondary antibody only (shaded curves in panels c to f)> (^ ^) Two-color 
analysis of 12GS (x axis, red fluorescence) and GFP (y axis, green fluorescence) 
expression (arbitrary units); (c to f) distribution of 12GS expression among GFP- 
positivc cells (>10^ arbitraiy units) (x axis, red fluorescence; y axis, cell numbers). 



of CXCR-4 for LAI. although both were reduced relative to 
those of WT CXCR-4. Notably, structure-function studies on 
the interleukin-8 receptor (CXCR-1) have suggested the exis- 
tence of a disulfide bridge between cysteines of el and e4 (20). 
These cysteines are conserved in other chemokine receptors, 
including CXCR-4. Our results with the A4-36 mutant indicate 
that the formation of a disulfide bridge between el and e4 is 
not absolutely necessary for the processing of CXCR-4 to the 
cell surface. The A4-36 mutant had very low coreceptor activ- 
ity for NDK and did not mediate infection with ROD. The el 
domain might therefore be dispensable for the interaction of 
CXCR-4 with gpl20 of LAI but required for the other strains. 
It can also be envisioned that LAI is affected differently by the 
reduced expression of the A4-36 mutant at the cell surface 
than is NDK. or ROD. Similar studies with CCR-5 have shown 
that the el and e2 domains were necessary for coreceptor 
activity for primary and dual-tropic HIV-1 strains, while the e3 
and e4 domains had no apparent role (4, 34). Tlierefore, dif- 
ferences in the interaction of dual-tropic HIV strains with 
CXCR-4 and CCR-5, as well as in the interaction of HIV 
strains with CXCR-4, seem to exist. 
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The HIV surface envelope protein gpl20 seems to interact 
directly with the chemokine receptors CXCR-4 and CCR-5, 
this interaction being markedly reinforced by the presence of 
CD4 (22, 40, 43). The third hypervariable domain (V3) of 
gpl20 was found to be necessary for the interaction of gpl20 
with CCR-5, although other domains of gpl20 were apparently 
also involved in this interaction (40, 43). The role of V3 in the 
interaction of gpl20 with CXCR-4 has not been directly ob- 
served but has been strongly suggested by different experi- 
ments. In cell line-adapted HlV-1, V3 substitutions resulted in 
a coreceptor switch from CXCR-4 to CCR-5 (9). We have 
observed that the V3 domain of lAl gpl20 was required for 
utilization of the rat coreceptor CXCR-4 and conferred this 
property to chimeric NDK gpl20 (33). A direct role for V3 in 
gpl20~CXCR-4 interaction would be consistent with the 
greater sensitivity of cell line-adapted HIV-1 strains to neu- 
tralization by anti-V3 antibodies (reviewed in reference 35) or 
drugs targeting V3 (24, 31) compared with that of primary 
HlV-1 strains using the CCR-5 coreceptor. The V3 mutations 
associated with the switch of HIV-1 strains from the NSI to the 
SI phenotype usually increase the net positive charge of V3 (8, 
17, 21). The e3 domain of CXCR-4 is negatively charged (net 
charge, -3), while the homologous domain of CCR-5 is posi- 
tively charged (+2). It is tempting to speculate that electro- 
static interactions take place between the V3 loop of SI strains 
and the e3 domain of CXCR-4. A detailed mutagenesis study 
of the charged residues in e3 could make it possible to address 
this hypothesis. 

Even if a role for V3 in the interaction of gpl20 with core- 
ceptors is probable, the very slight similarity between HIV-1 
and HI V-2 in this domain suggests a role for a more conserved 
domain(s) of gpl20. It seems unlikely to us that gpl20 can 
accommodate totally distinct sites, conserved across strains 
and primate immunodeficiency viruses, which mediate the in- 
teraction with CCR-5, CXCR-4, and possibly other chemokine 
receptors. A simpler view might be that the same domain, or 
conformation, of gpl26 has the ability to interact with different 
chemokine receptors, possibly via their amino-terminal do- 
mains. This interaction might be sufficient for an HIV strain to 
use CCR-5, but not CXCR-4, as its coreceptor, thus explaining 
the preferential use of CCR-5 by primaiy NSI strains. Muta- 
tions in other domains of gpl20 (in particular, V3) might be 
required for a functional interaction with CXCR-4, possibly 
involving the e3 domain. The HIV strains bearing these mu- 
tations (primary SI or dual tropic) could therefore use either 
CCR-5 or CXCR-4, In cell line-adapted strains, mutations in 
gpl20 resulting in a tighter interaction with CXCR-4 might be 
detrimental to interaction with CCR-5, thus preventing the use 
of this coreceptor. This model is consistent with the observa- 
tion that V3 mutations or substitutions conferring the NSI 
phenotype to a cell line-adapted HIV-1 strain were associated 
to a coreceptor switch from CXCR-4 to CCR-5 (9). 

If a conserved domain of gpl20 allows interaction with che- 
mokine receptors as different as CCR-5 and CXCR-4, it can be 
wondered why other chemokine receptors do not behave as 
HIV coreceptors (CCR-1, CCR-4, or CCR-2a) or can be used 
only by certain viral strains (CCR-3 and CCR-2b). The ability 
to interact with gpl20 might be necessary but not sufficient for 
HIV coreceptor activity, which may require other properties, 
such as the ability to colocalize with CD4 either spontaneously 
or in the presence of gpl20. 
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of the Human Immunodeficiency Virus That Is Critical 

for Infectivity 
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Site-specific mutagenesis was used to introduce amino add sulistitutions at tlie asparagine codons of four 
conserved potential N-linlsed glycosylatlon sites witliin tlie gpl20 envelope protein of human immunodeficiency 
virus (HTV). One of these alterations resulted in the production of noninfectious vfaiis particles. The amino acid 
substitution did not interfere with the synthesis, processing, and stabUity of the env gene polypeptides gpl20 
and gp41 or the bindfaig of gpl20 to its cellular receptor, the CD4 (T4) molecule. Vaccinia virus recombinants 
containhig wUd-type or mutant HIV env genes readily induced syncytia in CIM'^ HeLa cells. These results 
suggest that alterations involving the second conserved domain of the HIV gpl20 may interfere with an essential 
early step in the virus replication cycle other than binding to the CD4 receptor. In long-term cocultures of a 
T4'*' lymphocyte cell line and colon carcinoma cells producing the mutant virus, revertant infectious virions 
were detected. Molecular crharacterization of two revertant proviral clones revealed the presence of the original 
mutation as well as a compensatory amino acid change in another region of HIV gpl20. 



The acquired immunodeficiency syndrome (AIDS) is 
caused by a novel human retrovirus (4, 11, 21) Icnown as the 
human immunodeficiency virus (HIV). In tissue culture 
systems. HIV infects and kills human CD4^ (T4'^) lympho- 
cytes and lymphocyte lines (17). In infected individuals, the 
number of circulating T4'*^ lymphocytes may ultimately 
decline; this reduction is usually correlated with immuno- 
logic dysfunction (19). 

Previous studies have shown that HIV and the T4A 
monoclonal antibody can reciprocally compete in binding 
to T4'*' lymphocytes (25). Analysis of this reaction subse- 
quently revealed that it involved the direct interaction of the 
HIV gpl20 with a specific portion of the CD4 (T4) molecule 
(24, 26). The functional significance of CD4 as a receptor for 
HIV has recently been demonstrated by introducing CD4 
cDNA into nonlymphocyte human cell lines (23). The sub- 
sequent expression of the CD4 antigen on the surface of such 
cells renders them susceptible to HIV infection. 

Viral envelopes: have also been shown to participate in 
early events in the replicative cycle that follow the binding of 
particles to their receptor(s) (8). For example, a fusion step 
is required for passage through the two lipid-bilayer mem- 
branes (in the virus particle and the cell surface) which 
separate the viral genome from the cytoplasm of the infected 
cell. Sendai virus encodes an inactive precursor protein (FO) 
which must be cleaved to the mature Fl and F2 products that 
mediate, via a fusion mechanism, the direct passage of 
particles into cells (15, 33). Virus fusion with the host 
membrane also appears to be the mode of entry for HIV 
(36). 

Analyses of the env genes present in different HIV isolates 
have indicated that they contain several highly variable and 
conserved domains (3, 35, 41). In some African isolates, 
greater than 50% variability was noted at the deduced amino 
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acid level involving stretches of more than 100 residues of 
gpl20 (3, 41), Despite this degree of env heterogeneity, all 
HIV isolates tested have retained their tropism for T4* 
lymphocytes and their ability to induce cytopathic effects 
during productive infection in tissue culture. This suggests 
that one or more of the conserved domains within gpl20 are 
involved in the adsorption of HIV to the CD4 molecule and 
subsequent steps in viral replication. 

In this work, we have focused on the second conserved 
HIV env domain (41) and used site-specific mutagenesis 
techniques to target potential N-linked (Asn-X-Ser/Thr) gly- 
cosylatlon sites, where X is any amino acid. One of four HIV 
mutants examined was rendered noninfectious by this pro- 
cedure. In vitro assays indicated that the gpl20 produced by 
this mutant bound to the CD4 molecule. This result implies 
that portions of the HIV envelope not involved in binding to 
CD4 may be necessary for the initiation of productive viral 
infection. Interaction with other cellular surface proteins or 
processing of the bound virions may be required subsequent 
to their interaction with the CD4 receptor. 

MATERIALS AND METHODS 

Oligonucleotide synthesis and purification. Oligodeoxyribo- 
nucleotides were synthesized on an Applied Biosystems 
model 380B DNA synthesizer by using phosphoramidite 
chemistry. Oligonucleotides were purified on 20% polyacryi- 
amide gels and then with a Sep-Pak C18 column (Waters 
Associates) as described previously (31). 

Oligonucleotide-directed mutagenesis. Mutants were de- 
rived from an infectious plasmid clone of HIV, pNL4-3 (1). 
A 2.7-kilobase (kb) EcoRl-BamHl restriction fragment con- 
taining the amino-terminal 2,267 nucleotides of the envelope 
{env) gene was cloned into M13 vectors mpl8 and nipl9, and 
specific nucleotide changes were introduced by oligonucle- 
otide-directed mutagenesis (42). The mutagenic oiigonucle- 
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otides (designation) were 5'-ACCATGTACAcAaGTCAGC 
ACAG-3' (6992). 5'-ACTGCrrGTTAcAaGGCAGTCrrAG-3' 
(7055). 5'-TAGATCTGCCAAaTTCACAGACA-3' (7099). 5'- 
AGTACAGCTGcAaCAATCTGTAG-3' (7136), 5 -CAACT 
GCTGTTA(aat)GGCAGTCTAGCA-3' (7055D). 5'-CAACT 
GCTGgacAATGGCAGTC-3' (7052). and 5'-CTGTTAAAT 
GaCAGTCTAGCA-3' (7058). The lowercase letters identify 
nucleotide changes that were introduced. In 7055D. letters in 
parentheses indicate nucleotides that were deleted. The dif- 
ferent mutant 2.7-kb EcoRl-BamHl fragments were each 
isolated from the replicative form of M13 DNA and cloned 
back into EcoRI-BamHI-restricted pNL4-3 DNA to generate 
mutant HIV plasmid clones p6992, p7055, p7099, p7136, 
p7055D. p7052. and p7058. 

Cell cultures and DNA transfection. A3. 01, a human T4^ 
lymphocytic leukemia cell line (10). was maintained in RPMI 
1640 medium (GIBCO Laboratories) supplemented with 10% 
heat-inactivated fetal calf serum (FCS). Cells (approximately 
4 X 10*/ml) were transfected by electroporating (29) 20 jig of 
uncleaved plasmid DNA. The transfected A3. 01 cells were 
maintained in 25-cm^ tissue culture flasks (Costar) contain- 
ing 10 ml of RPMI-FCS. An adherent colon carcinoma cell 
line, SW480 (ATCC CCL228). was maintained in Dulbecco 
modified Eagle medium supplemented with 10% FCS. Cells 
were trypsinized, diluted, and transferred to 25-cm^ flasks 14 
to 16 h prior to transfection. Individual 60 to 80% confluent 
cultures were transfected with 10 M.g of uncleaved plasmid 
DNA by the calcium phosphate precipitation method (40). 
Cocultures were established by scraping transfected SW480 
cells from individual 25-cm^ flasks 24 to 72 h following 
transfection and adding them to 2 x 10^ A3.01 cells in 
25-cm^ flasks containing 10 ml of RPMI-FCS. 

Infections. The infectivity of progeny virions generated 
from transfection experiments was assayed in the A3 .01 
lymphocyte cell line. Supematants were Altered (0.22|jim 
pore size; Millex) and 300 yA was added to 15-ml conical 
tubes (Falcon) containing 2 x 10* A3. 01 cells. 700 \l\ of 
RPMI-FCS, and 2 |jug of Polybrene (Sigma Chemical Co.). 
After 4 h at 37**C, the cells were washed twice with phos- 
phate-buffered saline, suspended in 10 ml of RPMI-FCS, and 
maintained in 25-cm^ culture flasks. 

RT assay. Virion-associated reverse transcriptase (RT) 
activity was measured as described previously (13) with 
modifications as follows: 10 yA of culture supernatant was 
mixed with 50 ftl of an RT reaction mixture which contained 
a template primer of (A)rt (5 pig/ml) and (dT)i2_i8 

(1.57 jig/ml) 

in 50 mM Tris, pH 7.8, 7.5 mM KCl, 2 mM dithiothreitol, 5 
mM MgClz, 0.05% Nonidet P-40, and 0.5 \LCi of ['^pjdTTP 
(400 Ci/mmol). Following a 90-min incubation at 37'C, 10 ^1 
of the reaction mixture was spotted onto DEAE ion-ex- 
change paper (Whatman) and washed four to six times in 2 x 
SSC (ix SSC is 0.15 M NaCl plus 0.015 M sodium chloride) 
to remove unincorporated [^^PjdTTP. Spots were visualized 
by autoradiography or counted in a scintillation counter. 

The association of RT activity with a particulate fraction 
in the supematants of infected and transfected cells was 
verified by subjecting a 50-|U sample of an HIV stock to 
centrifugation in an air-driven ultracentrifuge (28 Ib/tn', 37 
min) (Beckman). An RT assay on 10 |tl of the uncentrifuged 
HIV sample indicated 6,210 cpm of activity. A 10-pi sample 
of the centrifuged preparation contained only 57 cpm of 
activity. 

Analysis of viral proteins. Immunoblotting was used to 
identify HIV proteins in lysates prepared from cells trans- 
fected with the mutant p7055 and infectious pNL4-3 plas- 
mids. Approximately 10^ SW480 cells were lysed 48 h 
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posttransfection in 100 of buffer containing 50 mM Tris 
hydrochloride, pH 8.0. 0.1 M NaCI. 5 mM EDTA. 1 mM 
phenylmethylsulfonyl fluoride, 0.5% 3-(3-cholamidopropyl- 
dimethylammonio)-l-propane sulfonate, 0.2.% deoxycho- 
late, and 0.2% aprotinin. Lysates (15 yA) were boiled in an 
equal volume of sample buffer (18), and proteins were 
resolved on 3 to 27% gradient sodium dodecyl sulfate 
(SDS)-polyacrylamide gels, followed by electrophoretic 
transfer to nitrocellulose membranes. The membranes were 
incubated at room temperature with sera pooled from AIDS 
patients for 1.5 h and with ^"l-protein A for 1 h, washed, and 
visualized by autoradiography. 

The secretion of newly synthesized HIV gpl20 out of 
transfected cells and into the medium was examined by 
immunoprecipitating (28) supematants from SW480 cells 
previously exposed to pNL4-3 and p7055 DNAs. Cells 
transfected with mutant or infectious HIV plasmids as 
described earlier were washed and incubated for 30 min with 
serum-free Dulbecco modified Eagle medium lacking both 
methionine and cysteine at 40 h following transfection. The 
transfected cells were labeled by adding [^^S]methioninc 
(125 M-Ci/ml) and cysteine (125 jtCi/ml) for 6 h and then 
propagated for an additional 12 h following the addition of 
10% FCS. Tissue culture supematants were filtered through 
0.22-fim filters and lysed in an equal volume of RIPA buffer 
(50 mM Tris hydrochloride, pH 7.5, 0.15 M NaCl, 1% Triton 
X-100, 1% deoxycholate, and 0.1% SDS) containing 0.5% 
aprotinin. The extract was preincubated with 10 pA of control 
human semm at 4**C, cleared twice with 50 yA of Pansorbin 
(Calbiochem), and incubated overnight with 2 yA of the 
indicated sera (control human or AIDS patient RJ1370) at 
4**C. Precipitates were incubated for 30 min with 10 yA of 
Pansorbin, collected by centrifugation, and washed twice 
with RIPA buffer and once with water. The precipitated 
proteins were solubilized by boiling in sample buffer (18), 
resolved on 10% SDS-polyacrylamide gels, and visualized 
by autoradiography. 

CD4-binding assay. The ability of the 7055 mutant gpl20 
env polypeptide to bind to the cellular CD4 gene product was 
analyzed as described previously (19a). Immunoprecipitates 
of gpl20-CD4 complexes were electrophoresed on 10% 
SDS-polyacrylamide gels and visualized by autoradiogra- 
phy. 

Revertant cloning and analysis. Molecular clones of the 
revertant virus p7055RI were obtained from unintegrated 
pro viral DNA. Cell culture supernatant from day 45 of the 
p7055RI coculture experiment (see Fig. 5A) was filtered and 
used to infect A3. 01 cells as described above. Approxi- 
mately 20 X 10^ cells were harvested on day 13, when 
numerous syncytia were observed and unintegrated proviral 
DNA was isolated (14). The unintegrated DNA preparation 
was restricted with EcoRl, ligated to EcoRI-restricted 
XWES B arms (20) with T4 DNA ligase, packaged in vitro 
(37), and plated on Escherichia coli LE392 cells. Approxi- 
mately 5 X 10^ recombinant phage plaques were screened 
(5) with the ^^P-labeled pBenn6 (10) DNA clone of the 
lymphadenopathy-associated virus (LAV) provirus. Phage 
DNAs were restricted with EcoRl and ^amHI, and the 
resultant 2.7-kb fragments, containing the amino- terminal 
2,267 nucleotides of the env gene, were inserted into the 
p7055 mutant plasmid clone. Progeny virions resulting from 
transfections with these constructions were checked for 
infectivity in A3.01 cells. The 2.7-kb EcoKl-BamHl frag- 
ment from revertant HIV proviruses was also transferred 
into M13 vectors mpl8 and mpl9, and DNA sequencing was 
performed as described below. 
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DNA sequendng. DNA was sequenced by using M13 
vectors mpl8 and mpl9, synthetic oligonucleotide primers 
(38), and the dideoxy chain termination procedure (32). 

Syncytium formation. The recombinant plasmid pP£env7h 
was used for insertion of the 7055 env mutant sequence into 
vaccinia virus. pPEenv? was constructed as follows. The 
3.5-kb Sad fragment from HIV strain HTLV-IIIB, which 
contained the entire envelope gene, was cloned into 
M13mpl8. By site-directed oligonucleotide mutagenesis, an 
EcoRV site followed by the eucaryotic translational consen- 
sus sequence CCACC was inserted upstream of the initiating 
ATG. Another EcoKV site was inserted just after the stop 
codon, allowing easy subcloning of the env gene. This 2.5-kb 
EcoRV fragment was subcloned into the Smal site of pSCll, 
i.e., downstream of the vaccinia virus P7.5 promoter (7). To 
create a plasmid containing a unique Hindlll site within 
the envelope gene, partial Hindlll digestion, Klenow fill-in, 
and religation were performed. The resulting plasmid, 
pPEenv7h, contained unique Stul and Hindlll sites within 
the envelope gene. The 1.3-kb Stul-Hindlll region was 
removed from pP£env7h and replaced with the analogous 
fragments from the wild-type pNL4-3 or p7055 mutant clone. 
The mutagenized area in the two resulting plasmids, 
pPEenv4-3 and pPEenv7055, respectively, was sequenced 
for verification. Recombinant vaccinia viruses VPEenv4-3 
and VPEenv7055 were constructed by standard methods (7, 
22). 

Syncytia assays were performed with the HeLa cell line 
T4^5, which has been transformed with the T4 receptor 
gene (23). Cells were infected with vaccinia virus recombi- 
nants at a multiplicity of infection of 5, 1, or 0.1. The 
presence or absence of syncytia was determined 16 to 24 h 
after infection. 

RESULTS 

The nucleotide and deduced amino acid sequences of the 
env gene from several different HIV isolates have been 
determined (3, 9, 27, 30, 35, 39, 41). Computer-assisted 
analyses of these data have indicated that of eight potential 
N-linked glycosylation sites present in the gpl20 of every 
isolate, four are clustered within the second highly con- 
served domain (41), which spans residues and is located 
about 174 codons from the amino terminus of the processed 
external envelope protein. We therefore decided to examine 
the functional significance of this domain by initially target- 
ing the four potential glycosylation sites by in vitro muta- 
genesis techniques. 

Construction and characterization of HIV env mutants. We 
have previously described an infectious molecular clone of 
Hi V (pNL4-3) that was constructed by joining two different 
integrated proviral DNAs at a common EcoRl restriction site 
located at nucleotide position 5779 (1). A 2.7-kb EcoRl- 
BamHl fragment from pNL4-3 DNA (Fig. 1), containing the 
amino-terminal 2,267 nucleotides of the HIV env gene, was 
subcloned into M13 vectors mpl8 and mpl9, and specific 
nucleotide changes were introduced by oligdnucleotide-di- 
reeled mutagenesis (42). Potential N-linked glycosylation 
sites (Asn-X-Ser/Thr, where X is any amino acid) 6992, 7055, 
7099, and 7136 (Fig. 1) (numbers refer to the first nucleotide 
of the asparagine codon, based on the GenBank proviral 
nucleotide sequence of the LAV isolate [39]) were altered by 
changing the asparagine codon. At positions 6992, 7055, and 
7136, the asparagine was converted to a glutamine by 
simultaneously introducing first and third base changes of A 
to C and T to A, respectively. At site 7099, a single base 
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change of T to A at the third position resulted in the 
substitution of lysine for asparagine. The 2.7-kb fcoRI- 
BamHl fragments containing individual amino acid codon 
changes were then inserted back into the infectious molec- 
ular clone, generating four different mutant HIV plasmid 
clones, p6992, p7055, p7099, and p7136. 

Two different biological assays were used for the charac- 
terization of these HIV env mutants. Virus infectivity was 
monitored by electroporating the mutant HIV proviral 
clones into the T4* lymphocyte cell line A3,0l, which has 
been shown to be sensitive to viral infection and transfection 
with HIV proviral DNA (1, 10). The appearance of virion- 
associated RT activity is indicative of a spreading HIV 
infection in this T4'^ lymphocyte line. The second assay 
examined the production of virus particles following calcium 
phosphate-mediated transfection of the colon carcinoma cell 
line SW480 (ATCC CCL228), which has been shown to 
produce virus 24 h after exposure to cloned HIV proviral 
DNA (1). Although SW480 cells can be infected with HIV. 
this is an extremely inefficient system compared with T4'*" 
lymphocytes, since a spreading viral infection cannot be 
detected by RT assays (2). Consequently, transfection of 
SW480 cells was done to monitor virus production rather 
than virus infectivity. In some experiments, the transfected 
SW480 cells were cocultivated with the T4'*' A3.01 cells 3 
days after transfection, permitting assessment of both pro- 
duction and infectivity of progeny virions. 

When HIV env mutant clones p7099 and p7136 were 
evaluated for infectivity by electrbporation into A3.01 cells, 
RT activity was detected in the culture supematants 7 days 
later and persisted through day 16 (Fig. 2A). The synthesis of 
RT paralleled that in cultures exposed to the parental infec- 
tious pNL4-3 plasmid (Fig. 2A) and was accompanied by 
syncytium formation and cell death. The infectivity of env 
mutant 6992 was evaluated by transfection of SW480 cells, 
followed by cocultivation with the T4^ A3 .01 cells as 
described in the preceding paragraph. The 6922 mutant virus 
generated a spreading infection in the A3 .01 cells that was 
indistinguishable from that of the infectious pNL4-3 plasmid 
(data not shown). Thus, elimination of the asparagine codons 
at sites 6992, 7099, and 7136 in the env gene of HIV did not 
alter infectivity or cytopathic properties. 

Unlike the other env mutants, 7055 failed to elicit RT 
activity (Fig. 2A) or generate syncytia in the electroporated 
A3. 01 cells. In other electroporation experiments, this mu- 
tant did not produce a spreading infection in the T4'*' 
lymphocyte line even after more than 60 days (data not 
shown). 

The 7055 env mutation does not affect virion assembly or the 
synthesis of env proteins. The failure of the 7055 HIV env 
mutant to successfully infect A3. 01 cells could reflect a 
defect in any one of a number of steps (e.g., virion assembly, 
protein synthesis) culminating in the generation of defective 
particles. Virion production was monitored by transfecting 
the p7055 mutant clone into the SW480 colon carcinoma 
cells. As shown in Fig. 2B, virion-associated RT activity was 
detected 24, 48, and 72 h following transfection with both 
p7055 and the infectious pNL4-3 plasmids; however, sec- 
ondary passage of HIV to the A3;01 lymphocyte line follow- 
ing cocultivation was only observed in cells transfected with 
the piNL4-3 proviral DNA (Fig. 2B). This again confirmed 
the inability of the 7055 env mutant to initiate a spreading 
infection in T4* cells. A similar result was obtained when 
RT-positive, cell-free filtrates from p7055- and pNL4-3- 
transfected colon cells were used for direct infections of 
A3. 01 cells. A spreading viral infection was only detected in 
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FIG. 1. Locations of potential N-linked glycosylation sites targeted for in vitro mutagenesis. The positions of HIV genes and open reading 
frames are shown at the top. LTR, Long terminal repeat. Diagrammed in the center is a blow-up of an HIV segment extending from the £coRI 
site at 5779 to 8^2. the end of the en\ reading frame. Conserved (open) and variable (solid) regions within the env gene are indicated. 
Triangles denote the positions of pfxx;essing sites. The deduced amino acid sequences of 10 different isolates of HIV (LAV [39], H9PV22 [27], 
CDC42 [9], Z3 and NY5 [41], and Eli and Mai [3]) from within a portion of the second conserved domain of gpl20 are shown at the bottom. 
The numbers indicate amino acid positions. Boxes delineate conserved potential N-linked glycosylation sites (Asn-X-Ser/Thr) in which the 
asparagine codon was changed. Numbers at the bottom refer to the first nucleotide of each asparagine, as described in the text. 



cultures tiifected with pNL4-3 filtrates during a 37-day 
observation period (data not shown). 

Although the results of the virus production assays indi- 
cated that the 7055 env mutant directed the synthesis of 
noilinfectious particles possessing RT activity, they did not 
rule out the possibility that partially assembled virions or 
only nucleocapsid structures were generated due to an 
intrinsic defect in env protein production. The intracellular 
synthesis of 7055 viral proteins was examined by immuno- 
blotting and comparing the lysates of SW480 cells transfec- 
ted with the p7055 mutant and the infectious pNL4-3 plas- 
mid. The protein banding patterns in the two lysates were 
indistinguishable (Fig. 3A). The presence of gpl20 and gp41 
env proteins in the p7055'transfected cells indicated that the 
mutation did not effect the synthesis, processing, or stability 
of these HIV gene products. 

Secretion of 7055 env gpl20 into the culture medium was 
evaluated by labeling transfected SW480 cells with [^^S] 
methionine aiid cysteine, preparing cell-free filtrates of tis- 
sue culture supematants, and immunoprecipitating such 
preparations with sera from AIDS patients known to have 
high antibody titers against gpl20. The two antisera immu- 
noprecipitated gpl20 from the filtrates of p7055- and pNL4- 
3-transfected cells (Fig. 3B). In addition, antiserum from 
AIDS patient RJ1370 precipitated the viral core protein p24. 
This result, in conjunction with the presence of virion- 



associated RT activity in the supernatant (Fig. 2B), sug- 
gested that the 7055 mutation did not affect the assembly and 
release of HIV particles. 

To ensure that no additional nucleotide changes had 
occurred during the generation of the 7055 mutant, recipro- 
cal constructions were made between the p7055 and the 
pNL4-3 clones by exchanging the 2.7-kb Ec6^-Bam)\\ 
fragment previously used to generate the 7055 mutant (Fig. 
1). Both constructs were assayed for infectivity by colon cell 
transfection, cocultivation with A3. 01 cells, and the RT 
assay. No evidence of a spreading infection was detected 
with the pNL4-3 clone containing the 2.7-kb EcoRI-BflmHI 
fragment of p7055. while the infectivity of p7055 containing 
the analogous fragment from pNL4-3 was restored (data not 
shown). Final verification that the asparagine-to-glutamine 
substitution at position 7055 was solely responsible for the 
mutant phenotype was obtained by sequencing the entire 
2.7-kb Eco¥X-BamW. frajgment of the p7055 plasmid. These 
results showed that the two introduced nucleotide changes in 
p7055 were the only differences between it and the parental 
pNL4-3 clone. 

The 7055 mutant gpl26 polypeptide binds to the CD4 
receptor. To determine whether the lack of infectivity asso- 
ciated with the 7055 env mutant was due to the inability of its 
encoded gpl20 env glycoprotein to bind to the CD4 gene 
product, binding activity was evaluated in vitro with a 
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FIG. 2. Infectivity of mutant HIV proviral DNAs. (A) RT activ- 
ity was measured in IO-m^ supernatant samples from A3.01 cells 
electroporated with the infectious (pNL4-3) or env mutant (p70S5, 
p7099, and p7136) plasmids at the indicated times as described in the 
text. (B) RT activity detected in IO-m-I supernatant samples following 
calcium phosphate-mediated transfection of SW480 cells with 
pNL4-3 and p7055 plasmid DNAs. A3. 01 cells were cocultured with 
the transfected SW480 cells beginning on day 3 (as indicated by the 
arrow). 



coimmunoprecipitation assay to measure the formation of 
the gpl20-CD4 molecular complex (24). The 7055 env muta- 
tion was initially transferred into a gpl20 expression vector 
(19a), which directs the synthesis of a truncated gpl20 
(gpl20-trunc) polypeptide. After transfection of mammalian 
cells, gpl20-trunc is secreted into the medium becaiise it 
lacks gp41 sequences which anchor the gpl20-a>41 envelope 
protein complex into the plasma membrane. FoUowiiig 
transfection of cells with the 7055 gpl20-trunc vector, sig- 
nificant levels of [^^S]methionine-labeled polypeptide were 
detected by immunoprecipitation with gpl20-reactive serum 
from an AIDS patient (Fig. 4). The electrophoretic mobility 
and heterogeneity were indistinguishable from those of the 
wild- type gpl20-trunc protein. 

The binding of 7055 gpl20>trunc to CD4 was assessed by 
incubating transfected-cell supematants containing the 
[^^SJmethionine-labeled mutant or wild-type protein with 
detergent-solubilized CD4 receptor prepared from a Chinese 
hamster ovary (CHO) cell line stably transfected with a 
human CD4 cDNA expression vector (D. Smith and D. 
Capon, unpublished results)^ followed by immunoprecipita- 
tion with monoclonal anti-CD4 antibodies to detect the 
gpl20-CD4 complex. Incubation with the OKT4 monoclonal 
antibody resulted in precipitation of comparable amounts of 
both the wild-type and mutant gpl20-trunc polypeptides 
similar to that observed with AIDS serum (Fig. 4). Neither 
gpl20-trunc protein was efficiently precipitated with the 
OKT4A monoclonal antibody, consistent with the behavior 
of HIV virioh-associated gpl20 (24), suggesting that the 
interaction of CD4 with the 7055 env mutant gpl20, like the 
wild- type protein, hinders binding of the OKT4A antibody to 
the CD4 receptor. These results thus demonstrate that the 
ability to bind CD4 was not grossly impaired by the 7055 env 
mutation. 

The 7055 mutant gpl20 induces syncytia. The induction of 
syncytia by HIV has been correlated with the expression of 
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FIG. 3. Production of vims-encoded proteins following transfec- 
tion of the infectious pNL4-3 and p7055 env mutant HIV clones. (A) 
Immunoblots of lysates from LAV-infectcd A3.01 cells (lane 1) and 
pNL4-3 (lane 2) and p7055 (lane 3) lysates from transfected SW480 
cells. (B) Immunoprecipitation of '^S-labeled filtrates prepared from 
SW480 cells transfected with pNL4-3 (lanes 2 to 4) or p7055 (laiies 
5 to 7) cloned proviral DNAs. Serum from AIDS patient RJ1370 was 
used in the immunoprecipitatlons shown in lanes 3 and 6; serum 
from AIDS patient RJ97 was used in lanes 4 and 7. Serum from an 
HIV antibody-negative individual was used in lanes 2 and 5; lane 1 
contains molecular weight (in thousands) standards. The positions 
of the gpl20 env and p24 gag proteins are indicated. 
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FIG. 4. Coimmunoprecipitation analysis of wild-type eriv and 
7055 env mutant polypeptide binding to CD4. Wild-type (WT» left) 
and 7055 mutant (right) gpl20-tnjnc polypeptides were labeled with 
P'S]methionine at 60 h following transfection of cells with gpl20- 
trunc expression plasniids. Cell supematants containing the se- 
creted gpl20 proteins were incubated with detergent lysates of CHO 
CD4"^ cells contaim'ng solubilized CE>4 receptor for 1 h at 4^ to 
allow the association of gpl20 and CD4. Eqiial portions of the 
reaction mixture were then analyzed by immunoprecipitation with 
the following antisera: control human serum L (N). AIDS patient 
RJ1370 (AIDS) (kindly provided by J. Groopman). OKT4 monoclo- 
nal antibody (Ortho), or OKT4A monoclonal antibody (Ortho). The 
position of gpl20-trunc is indicated at the right. 
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TABLE 1. InfccUvity of 7055 region point mutants 



Designation' 


Amino acid 
sequence* 


Syncytium 
formation'' 


Virion- 
associated RT*' 


pNL4<3 (wUd type) 


LLNGSL 


+ 


+ 


p7055 


LLQGSL 






p7055D 


LL-GSL 






p7052 


LDNGSL 






p7058 


LLNCSL 







" Numbers in mutant designations refer to the first nucleotide of the 
mutated codon based on the GenBank proviral nucleotide sequence of the 
LAV isolate (38). 

* Amino acid changes are underlined; deletion is shown with a dash. 

^ Data arc from infection of A3 .01 cells during a 37-day period for pNL4-3. 
p703S. p70S2t and p7058: p7055D results were obtained from cocultures over 
a 34-day period. 



CD4 on the surface of lymphocytes (34) and HeLa cells (23). 
Since the functional defect resulting from the 7055 mutation 
did not appear to affect CD4 binding, the ability of the 
mutant env gpl20 to form syncytia was assayed with recom- 
binant vaccinia virus (22) as described in Materials and 
Methods. The HeLa cell line T4^5 was infected with 
VPE«nv4-3, VPEtf/iv7055, VPEenv?, and wild-type vaccinia 
virus. Similar numbers of giant syncytia were observed in 
cultures infected with recombinant vaccinia constructions 16 
to 24 h postinfection (data not shown). Cells infected with 
wild-type vaccinia virus did not form syncytia. 

Adititional point mutations in the 7055 region of the HIV env 
gene. The defective phenotype resulting from the position 
7055 substitiition suggested Uiat the asparagine codon was 
critical for HIV infectivity. To explore this possibility fur- 
ther» oligonucleotide-directed mutagenesis was used to de- 
lete this codon. No evidence of a spreading viral infection 
was detected with progeny virions, containing this deletion 
(Table 1). Since this codon lies within a highly conserved 
region of the HIV env gene, we next wanted to determine 
whether a larger domain which encompassed the 7055 site 
might be involved in viral infectivity. Nucleotide changes 
were therefore introduced into the two codons adjacent to 
Asn-7055 (Table 1). The resulting two mutant proviral DNAs 
(7052 and 7058) also failed to elicit a spreading infection 
(Table 1). Immunoblotting of transfected SW480 cells indi- 
cated that these mutations did not affect the synthesis and 
processing of HIV proteins (data not shown). 

Reversion of the 7055 HTV env mutant in tissue culture. 
During the course of several long-term cocultures, set up to 
monitor niarker rescue of the 7055 mutant, the appearance of 
infectious virions was noted in some of the p7055 DNA 
controls. Of nine extended SW480 transfections-A3.01 co- 
cultivations with p7055 DNA, three resulted in particle- 
associated RT activity 26 to 35 days following the introduc- 
tion of the mutant proviral DNA. Siiice RT activity was 
consistently present within 5 days in cocultures of the 
pNL4-3 infectious clone, the delayed appearance of RT 
suggested that 7055 revertants had emerged. RT activity was 
not detected during a 45-day observation period in the six 
negative cocultures. Selected samples and time points from 
the nine experiments are shown in Fig. 5A. 

To assess directly the infectivity of oriie of the putative 
revertants, supematants from p7055Rl (day 45, Fig. 5 A) and 
pNL4-3 (day 12, Fig. 5A) trarisfection-cocultiues were used 
to establish infections of the A3.01 lymphocytes as described 
in Materials and Methods. Each inoculum was normalized 
for particle number by using comparable amounts of RT 
activity. The p7055Rl virus exhibited RT activity (Fig. 5B), 
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syncytium formation, and killing of the A3.01 cells, although 
the onset of these activities was delayed relative to the 
pNL4-3 infection. 

The emergence of infectious virus from the long-term 
cocultures of the p7055 env mutant suggested that a rever- 
sion in the env gene had occurred. To verify the molecular 
basis for such a reversion, A3. 01 cells were infected with 
filtrates from 7055Rl-producing cells, and cloned uninte- 
grated proviral DNA was obtained by using a XWES vector, 
as described in Materials and Methods. Analytic restriction 
enzyme digestion of two iambdaphage clones revealed the 
presence of the 2.7-kb EcoRl-BamHl fragment containing 
HIV env gene sequences. These DNA fragments were 
introduced into the p7055 mutant clone, and the infectivity of 
the reconstructed proviral DNAs was examined. In both 
instances, replacement of the mutant EcoRl-BamHJ frag- 
ment with the analog from the revertant restored infectivity, 
indicating that the change(s) responsible for the revertant 
phenotype was present in this region of the viral genome. 

The complete nucleotide sequence of the 2.7-kb EcoRl- 
BamHl fragments from both revertant clones was deter- 
mined following their subcloning into M13 vectors mpl8 and 
mpl9. Sequencing of the revertant env segments demon- 
strated, the preservation of the original asparagine-to-glu- 
tamine mutation at position 7055 in both cases; however, 
new nucleotide changes were identified (Fig. 6) in each 
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FIG. 5. Detection and characterization of revertant virus from 
long-term cocultures of the 7055 mutant. (A) SW480 cells were 
transfected with pNL4-3 or p7055 plasmid DNA; A3 .01 cells (2 x 
10^) were added 24 h later, and RT assays were carried out on the 
culture supematants at the indicated times. The infectious virions 
detected in p7055Rl and p7Q55R2 represent the results of two 
experiments among nine conducted in which i'evertants appeared in. 
cocultures of the p7055 mutant. (B) RT activities in supematants of 
A3. 01 cells infected with HIV prepared from pNL4-3 (day 12, panel 
A) or p7055Rl (day 45, panel A) as described in the text. 
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FIG. 6. Nucleotide and amino acid substitutions identified in the revertant proviral clones. Conserved and variable portions of the HIV 
envelope are diagrammed as indicated in the legend to Fig. 1. The nucleotide and amino acid substitutions identified in the revertant proviral 
clones p7055RlA and p7655RlB are shown. The asterisk indicates the original asparagine-to-g!utamine substitution at position 7055 which 
was also present in both revertant clones. 



clone. One clone <p7055RlA) contained a single nucleotide 
substitution (a G to A change) in codon 131, leading to the 
substitution of an asparagine for a serine. The second clone 
(p7055RlB) contained three nucleotide changes, all of which 
mapped to the gpl20 coding region. The first, located within 
the leader sequence at codon 25, involved substitution of an 
A for a C in the first position and resulted in a codon change 
from leucine to isoleucine. The second was identical to the 
single change in the first revertant clone at codon 131 and 
caused the substitution of an asparagine for a serine. The 
final nucleotide change occurred at codon 391 and was a 
third-position base change of T to C, which did not alter the 
codon specifying isoleucine. 

DISCUSSION 

Although the env genes of different HIV isolates exhibit 
profound heterogeneity at both the nucleotide and deduced 
amino acid levels* several domains within both gpl20 and 
gp41 are highly conserved, suggesting that they mediate 
properties or functions common to all isolates. For example, 
a 102-amino-acid polypeptide, between residues 28 and 128 
of gp41, specifically reacts with sera from HIV antibody- 
positive individuals in enzyme-linked and immunoblotting 
assays (6). A more recent study (12) done with a series of 
synthetic oligopeptides localized an immunodominant do- 
main within a conserved portion of gp41 (codons 81 to 92) 
that is recognized by 100% of the sera from HIV-infected 
persons. Another property of all HIV isolates is the interac- 
tion between gpl20 and its cellular receptor, CD4. Using 
murine monoclonal antibodies and oligonucleotide-directed 
mutagenesis, Laskey et al. (19a) identified an epitope within 
the fourth conserved domain of gpl20 (41) that was involved 
in the binding to CD4. Deletions and a point mutation within 
this region (amino acids 397 to 439) resulted in significantly 
reduced binding. A feature of HIV infection of CD4* cells, 
thought to involve env proteins, is the formation of syncytia 
that commonly precedes the peak of RT activity. A mutant 
env gene containing three hydrophilic amino acid substitu- 
tions within the conserved hydrophobic N-terminus of gp41 
failed to induce syncytia following its introduction into 
CD4* HeLa cells (R. Willey, J. Felser, and J. Silver, 
unpublished observation), defining still another functionally 
important region of the HIV envelope. 



At present, the nature of both the structural and functional 
defect in the 7055 mutant is unknown. The results described 
in this paper indicate that the binding of the 7055 mutant 
gpl20 to the CD4 protein was indistinguishable from that of 
wild-type gpl20. Although the interaction of the mutant env 
protein with CD4 was not analyzed in competition experi- 
ments with a large excess of unlabeled mutant gpl20, the 
validity of the binding results is supported by two additional 
pieces of data. First, vaccinia virus recombinants expressing 
wild-type or 7055 mutant gpl20 elicited similar amounts of 
giant syncytia following infection of CD4 ^ HeLa cells. Since 
syncytium formation depends on the interaction of HIV 
envelope-expressing cells with cells containing CD4 protein 
on their surface (34), it must be concluded that the 7055 
mutant env protein binds to CD4 in a functionally relevant 
manner. Furthermore, two murine monoclonal antibodies to 
the HIV envelope have been obtained which react with 
epitopes mapping in the second conserved domain of gpl20 
(G. Nakamura and L. Lasky, manuscript in preparation). 
One of these binds to the very region of gpl20 encompassing 
the 7055 mutation. Significantly, neither of the monoclonal 
antibodies blocks the binding of wild-type gpl20 to CD4. 

The inability to initiate a productive infection by eliminat- 
ing the highly conserved potential N-linked glycosylation 
site at 7055 points to a critical role for an attached sugar 
group at this position of gpl20; however, no data presently 
exist demonstrating that the 7055 site is actually glycosylated 
in the mature gpl20. The results presented in Table 1, 
showing that mutations in codons adjacent to 7055 also 
rendered the virus noninfectious, suggest the existence of a 
domain (rather than a critical glycosylation site) within gpl20 
which is involved in an early step of viral infection. In this 
regard, the 11 residues surrounding the 7055 site are invari- 
ant among the 10 HIV isolates shown in Fig. 1. Our results 
suggest that the initiation of an HIV infection of cells 
may involve more than binding of the virus to its receptor. 
Subsequent modification(s) of the viral envelope, fusion of 
the viral and cellular membranes (36), penetration, endocy- 
tosis, and partial uncoating represent only some of the steps 
that must occur following adsorption. One or more of these 
steps may require an intact (or fiinctionally equivalent) 7055 
domain. 

An unexpected result from these studies was the identifi- 
cation of second-site revertants of the 7055 env mutant 



146 WILLEY ET AL. 

which occurred in tissue culture. Sequencing analyses of the 
two revertant env genes point to the codon 131 substitution 
as the functionally significant change. It is not obvious how 
the codon 131 alteration might compensate for the original 
7055 mutation. One possibility is that the introduction of an 
asparagine codon at this position provides a novel site for 
N-linked glycosylation, but this seems very unlikely since 
this change did not generate a canonical site. All reported 
sequences of HIV env genes contain either a serine or 
threonine at codon 131. The elimination of this residue might 
prevent O-linked glycosylation. Since serine and threonine 
can also serve as targets of phosphorylation, the substitution 
of asparagine at this position would preclude such a modifi- 
cation and its potential effect on gpl20. The effect of other 
changes at codon 131 of gpl20 are currently being evaluated. 

It is not presently clear in which type of cell (colon or 
lymphocyte) the reversions occurred. Several facts relevant 
to detection of the revertants should be noted. First, SW480 
colon carcinoma cells are capable of producing infectious 
HIV for longer than 60 days following transfection with 
pNL4-3 DNA when cultured in the absence of T4'*" lympho- 
cytes (K. Strebel and M. Martin, unpublished observation). 
This constitutive production of progeny particles most likely 
reflects the intrinsic sensitivity of SW480 cells to transfec- 
tion (1) coupled with the establishment of a low level^of 
chronic HIV infection of this cell line (2). Second, direct 
introduction of the 7055 proviral DNA into A3.01 celts by 
electroporation consistently failed to generate infectious 
virions even after 60 days. The cocultivation of virus- 
producing SW480 cells with susceptible A3.01 cells, how- 
ever, may have allowed the "selection" and amplification of 
the revertant infectious particles. Since the original 7055 
mutation involved nucleotide substitutions rather than mul- 
tiple codon deletions within the env gene, it is conceivable 
that the defect is conditional rather than absolute. In a milieu 
containing continuously produced 7055 mutant virus parti- 
cles, A3.01 cells may be inefificiently yet definitively in- 
fected, thereby establishing a replicative cycle which would 
include an obligate reverse transcription step that gave rise 
to revertant particles in cocultures. 

The results from this study have provided additional 
information about the role of the HIV envelope during viral 
replication. The interaction of envelope with T lymphocytes 
involves more than binding of virions to the CD4 receptor. 
Subsequent events associated with internalization of virus 
particles, including penetration and endocytosis, may be 
modulated by the portion of gpl20 defined by the 7055 
mutant. These findings raise the possibility that interfering 
HIV mutants or env gene products encompassing this seg- 
ment might compete with replication-competent virus, as 
described for the FV-4 resistance determinant (16) in the 
murine leukemia virus system. In addition, antibodies di- 
rected against this region of the HIV envelope might be 
useful in blocking virus infection. 

Although the region encompassing the 7055 site appears to 
be critical for HIV infectivity, specific residues within this 
area may not be absolutely required. The emergence of the 
second-site revertant virus indicates that a change in a 
different portion of gpl20 can partially compensate for the 
original 7055 mutation. The appearance of the revertant may 
involve a mechanism(s) that allows the survival and persis- 
tence of HIV. This could be especially important in the 
presence of an active immune response if viral progeny arose 
that were able to escape neutralization. Further studies into 
the potential significance of variability during HIV-induced 
disease are presently in progress. 



J. ViROL. 



ACKNOWLEIXSMENTS 

We thank Klaus Strebel and Kathleen Clouse for performing the 
immunoblotting. We also thank Gilbert Ashwell, Peter McPhie, and 
Eugenio Santos for helpful discussions concerning protein structure 
and function. 

LITERATURE CITED 

1. Adachi, A., H. E. Gendelman, S. Koenig. T. Folks, R. WlUey, A. 
Rabson, and M. A. Martin. 1986. Production of acquired inunu- 
nodeflctency syndrome-associated retrovirus in human and non- 
human cells transfected with an infectious molecular clone. J. 
Virol. 59:284-291. 

2. Adachi, A., S. Koenig, H. E. Gendelman, D. Daugherty, S. 
GattonJ-Celli, A. S. Fauci, and M. A. Martin. 1987. Productive 
persistent infection of human colorectal cell lines with human 
immunodeficiency vims. J. Virol. 61:209-213. 

3. Alizon, M., S. Wain-Hobson, L. Montagnkr, and P. Sonlgo. 
1986. Genetic variability of the AIDS virus: nucleotide sequence 
analysis of two isolates from African patients. Cell 46:6^74. 

4. Barr£-Sinoiissi, F., J. C. Cherman, R. Rcy, M. T. Nugeryre, S. 
Chamaret, J. Gniest, C. Dauguet, C. Axler-Blin, F. Brun- 
Vezinet, C. Rouzioux, W. Rosenbaum, and L. Montagnier. 1983. 
Isolation of a T-lymphocytropic retrovirus from a patient at risk 
for acquired immune deficiency syndrome (AIDS). Science 
220:868-871. 

5. Benton, D., and R. W. Davis. 1977. Screening Xgt recombinant 
clones by hybridization to single plaques in situ. Science 
196:180-182. 

6. Cabradilla, C. D., J. E. Groopman, J. Lanigan, M. Renz, L. A. 
Lasky, and D. J. Capon. 1986. Serodiagnosis of antibodies to the 
human AIDS retrovirus with a bacterially synthesized env 
polypeptide. Bio/Technology 4:128-133. 

7. Chaicrabarti, S., K. Brcchling, and B. Moss. 1985. Vaccinia virus 
expression vector: coexpression of ^galactosidase provides 
visual screening of recombinant virus plaques. Mol. Cell. Biol. 
5:3403-3409. 

8. Cboppin, P. W., and A. Scheid. 1980. The role of viral glyco- 
proteins in adsorption, penetration, and pathogenicity of vi- 
ruses. Rev. Infect. Dis. 2:40-61. 

9. Desai, S. M., V. S. Kalyanaraman, J. M. Casey, A. Srinivasan, 
P. R. Andersen, and S. G. Dcvare. 1986. Molecular cloning and 
primary nucleotide sequence analysis of a distinct human im- 
munodeficiency virus isolate reveal significant divergence in its 
genomic sequences. Proc. Natl. Acad. Sci. USA 83:8380-8384. 

10. Folks, T., S. Benn, A. Rabson, T. Theodore, M. D. Hoggan, M. 
Martin, M. Liglitfoote, and K. Sell. 1985. Characterization of a 
continuous T-cell line susceptible to the cytopathic effects of the 
acquired immunodeficiency syndrome (AIDS)-associated retro- 
virus. Proc. Natl. Acad. Sci. USA 82:4539-4543. 

11. GaUo, R. C, S. Z. Saiahuddin, M, Popovic, G. M. Schearer, M. 
Kaplan, B. F. Haynes, T. J. Palker, R. Redfield, J. Oleske, B. 
Safai, G. White, P. Foster, and P. D. Markham. 1984. Frequent 
detection and isolation of cytopathic retroviruses (HTLV-III) 
from patients with AIDS and at risk for AIDS. Science 
224:500-503. 

12. Gnann, J. W., Jr., J. A. Nelson, and M. B. A. OidstiMie, 1987. 
Fine mapping of an immunodominant domain in the transmem- 
brane glycoprotein of human immunodeficiency virus. J. Virol. 
61:2639-2641. 

13. Goff, S., P. Traktman, and D. Baltimore. 1981. Isolation and 
properties of Moloney murine leukemia virus mutants: use of a 
rapid assay for release of virion reverse transcriptase. J. Virol. 
38:239^248. 

14. HIrt, B. 1967. Selective extraction of polyoma DNA from 
infected mouse cell cultures. J. Mol. Biol. 26:364-369. 

15. Homma, M., and M. Onuchi. 1973. Trypsin action on the growth 
of Sendai virus in tissue culture cells. J. Virol. 12:1457-1465. 

16. Ikeda, H., and T. Odaka. 1983. Cellular expression of murine 
leukemia virus gp-70 related antigen on thymocytes of uninfec- 
ted mice correlated with Fv-4 gene-controlled resistance to 
Friend leukemia virus infection. Virology 133:65-76. 

17. Klatzmann, D., F. Barr^-Sinoussi, M. T. Nugeryre, C. Dauguet, 
E. Vilmer, C. Grisceili, F. Brun^Vezinet, C. Rouzioiix, J. C. 



Vol. 62, 1988 

Gluckman, J. C. Chermann, and L. Montagnier. 1984. Selective 
tropism of lymphadenopathy associated virus (LAV) to helper- 
inducer T lymphocytes. Science 225:59-62. 

18. Laemmli, U. K. 1970. Cleavage of structural proteins during Che 
assembly of the head of bacteriophage T4. Nature (London) 
227:680^5. 

19. Lane, H. C, and A. S. Fand. 1985. Immunologic abnormalities 
in the acquired immunodeficiency syndrome. Annu. Rev. Im- 
munol. 3:477-500. 

19a.Lasky, L. A., G. Nakamura, J. H. Smith, C. Fennie, L. Shimo- 
saki, E. Patzer, P. Berman, T. Gregory, and D. J. Capon. 1987. 
Delineation of a region of the human immunodeficiency virus 
type 1 (HIV-1) gpl20 glycoprotein critical for interaction with 
the CD4 receptor. Cell 50:975-985. 

20. Leder» P., D. TIerocier, and L. Enquist. 1977. Ek2 derivatives of 
bacteriophage lambda useful in the cloning of DNA from higher 
organisms: the Xgt WES system. Science 196:175-177. 

21. Levy, J. A., A. D. Hoflhian, S. M. Kramer, J. A. Lanois, J. M. 
Shimabukuro, and L. S. Oskiro. 1984. Isolation of lymphocyto- 
pathic retroviruses from San Francisco patients with AIDS. 
Science 225:840-842. 

22. Mackett, M., G. L. Smith, and B. Moss. 1984. General method 
for production and selection of infectious vaccinia virus recom- 
binants expressing foreign genes. J. Virol. 49:857-864. 

23. Maddon, P. J., A. G. Dalgkish, J. S. McDougal, P. R. Clapham, 
R. A. Weiss, and R. Axel. 1986. The T4 gene encodes the AIDS 
virus receptor and is expressed in the immune system and the 
brain. Cell 47:333-348. 

24. McDougal, J. S., M. S. Kennedy, J. N. SUgh, S. P. Cort, A. 
Mawte, and J. K. A. Nkholson. 1986. Binding of HTLV-III/LA V 
to T4'^ cells by a complex of the llOK viral protein and the T4 
molecule. Science 231:382-385. 

25. McDougal, J. S., A. Mawie, S. P. Cort, J. K. A. Nicholson, D. G. 
Cross, J. A. Scheppler-CampbeU, D. Hkks, and J. SUgh. 1985. 
Cellular tropism of the human retrovirus HTLV-III/LAV. 1. 
Role of T cell activation and expression of the T4 antigen. J. 
Immunol. 135:3151-3162. 

26. McDougal, J. S., J. K. A. Nicholson, D. G. Cross, S. P. Cort, 
S. M. Kennedy, and A. C. Mawlc. 1986. Binding of the human 
retrovirus HTLV-III/LAV/ARV/HIV to the CD4 (T4) molecule: 
conformaUon dependence, epitope mapping, antibody inhibi- 
tion, and potential for idiotypic mimicry. J. Immunol. 
137:2937-2944. 

27. Muesing, M. A., D. H. Smith, C. D. Cabradilla, C. V. Benton, 
L. A. Lasky, and D. J. Capon. 1985. Nucleic acid structure and 
expression of the human AIDSAymphadenopathy retrovirus. 
Nature (London) 313:430-458. 

28. Muesing, M. A., D. H. Smith, and D. J. Capon. 1987. Regulation 
of mRNA accumulation by a human immunodeficiency virus 
trans-activator protein. Cell 48:691-701. 

.29. Potter, H., W. Lawrence, and P. Leder. 1984. Enhancer-depen- 
dent expression of human K immunoglobulin genes introduced 



REGION OF HIV env CRITICAL FOR INFECTIVITY 147 



into mouse pre-B lymphocytes by electroporation. Proc. Natl. 
Acad. Sci. USA 81:7161-7165. 

30. Sanchez-Pescador, R., M. D. Power, P. J. Barr, K. S. Steimer, 
M. M. Stempien, S. L. Brown-Shimer, W. W. Gee, A. Ranard, 
A. Randolph, J. A. Levy, D. Dina, and P. A. Luciw. 1985. 
Nucleotide sequence and expression of the AIDS-associated 
retrovirus (ARV-2). Science 227:484-492. 

31. Sanchez-Pescador, R., and M. S. Urdea. 1984. Use of purified 
synthetic deoxynucleotide primers for rapid dideoxynucleotide 
chain termination sequencing. DNA 3:339-343. 

32. Sanger, F., S. Nicklen, and A. R. Coulson. 1977. DNA sequenc- 
ing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. 
USA 74:5463-5467. 

33. Scheid, A., and P. W. Choppln. 1974. Identification of biological 
activities of paramyxovirus glycoprotiens. Activation of cell 
fusion, hemolysis, and infectivity by proteolytic cleavage of an 
inactive precursor protein by Sendai virus. Virology 57:47^ 
490. 

34. Sodroski, J., W. C. Goh, C. Rosen, K. Campbell, and W. A. 
Haseltine. 1986. Role of the HTLV-III/LAV envelope in syncy- 
tium formation and cytopathicity. Nature (London) 322:470- 
474. 

35. Stardch, B. R., B. H. Hahn, G. M. Shaw, P. D. McNeely, S. 
Modrow, H. Wolf, E. S. Parks, W. P. Parks, S. F. Josephs, R. C. 
Gallo, and F. Wong-Staal. 1986. Identification and characteriza- 
tion of conserved and variable regions in the envelope gene of 
HTLV-Ili/LAV, the retrovirus of AIDS. Cell 45:637-648. 

36. Stein, B. S., S. D. Gowda, J. D. Lifson, R. C. Penhallow, K. G. 
Bensch, and £. G. Engleman. 1987. pH-independent HIV entry 
into CD4-positive T cells via virus envelope fiision to the plasma 
membrane. Cell 49:659^-668. 

37. Sternberg, N., D. Tleroder, and L. Enquist. 1977. in vitro 
packaging of a X Dam vector containing EcoRl DNA fragments 
of Escherichia coli and phage PI. Gene 1:255-280. 

38. Strauss, E. C, J. A. Kobori, G. Siu, and L. E. Hood. 1986. 
Specific-primer-directed DNA sequencing. Anal. Biochem. 
154:353-360. 

39. Wain-Hobson, S., P. Sonlgo, O. Danos, S. Cole, and M. AUion. 
1985. Nucleotide sequence of the AIDS virus, LAV. Cell 40:9- 
17. 

40. Wigler, M., A. Pelllcer, S. Silverstein, R. Axel, G. Urisaub, and 
L. Chasin. 1979. DNA-mediated transfer of the adenine phos- 
phoribosyl transferase locus into mammalian cells. Proc. Natl. 
Acad. Sci. USA 76:1373-1376. 

41. Willey, R. L., R. A. Rutledge, S. DIas, T. Folkes, T. Theodore, 
C. £. Buckler, and M. A. Martin. 1986. Identification of con- 
served and divergent domains within the envelope gene of the 
acquired immunodeficiency syndrome retrovirus. Proc. Natl. 
Acad. Sci. USA 83:5038-5042. 

42. Zoller, M. J., and M. Smith. 1984. Oligonucleotide-directed 
mutagenesis: a simple method using two oligonucleotide prim- 
ers and a single-stranded DNA template. DNA 3:479-488. 



Journal of Viroix>gy, June 1992, p. 3306-331S 
0022-538X/92A)63306-10$02.00/0 

Copyright & 1992, American Society for Microbiology 



Vol. 66, No. 6 



Effects of Deletions in the Cytoplasmic Domain on Biological 
Functions of Human Immunodeficiency Virus Type 1 

Envelope Glycoproteins 

DANA H. GABUZDA,^ ANDREW LEVER.^f ERNEST TERWILLIGER.^ and JOSEPH SODROSKI^-^* 

Division of Human Retrovirology, Dana-Farber Cancer Institute, 44 Binney Street, ^ and Departments of Pathology^ and 

Neurology,^ Harvard Medical School, Boston, Massachusetts 02115 

Received 31 December 199iyAccepted 25 February 1992 

The role of the cytophismic domaiii of the human Immunodeficiency virus type 1 (HIV-1) envelope 
glycoproteins in virus replication was investigated. Deletion of residues 840 to 856 at the carboxyl terminus of 
gp41 reduced the efficiency of virus entiy during an early step in the virus life cycle between CD4 binding and 
formation of the DNA provinis without aifecting envelope glycoprotein ^thesis, processing, or syncytium- 
forming ability. Deletion of residues amino terminal to residue 846 vras associated with decreased stability of 
envelope glycoproteins made in COS-1 cells, but this phenotype was cell type dependent. The cytoplasmic 
domain of gp41 was not required for the incorporation of the HIV-1 envelope glycoproteins into virions. These 
results suggest that the carl>o3Eyl terminus of the gp41 cytoplasmic domain plays a role In HIV-l entiy other 
than receptor binding or membrane fkision. The cytoplasmic domain of gp41 also affects the stability of the 
envelope gtycoprotein In some cell types. 



The presence of a long intracytoplasmic domain at the 
carboxyl terminus of the gp41 transmembrane protein of the 
human immunodeficiency virus type 1 (HIV-1) distinguishes 
the HIV-1 envelope glycoprotein from that of most retrovi- 
ruses (4, 12, 34). The transmembrane proteins of the type C 
and type D retroviruses terminate within 50 amino acids 
carboxy terminal to the transmembrane domain (34). The 
presence of a cytoplasmic domain extending more than 100 
amino acids beyond the transmembrane region in HIV-1, the 
simian immunodeficiency virus (SFV), visna virus, and the 
equine infectious anemia virus suggests a specific function 
for this region in the life cycle of the lentivirus subfamily of 
retroviruses (12, 34). HIV-2 and SIV are frequently trun- 
cated just after the membrane anchor domain, but these 
truncated forms appear to result from selection during tissue 
culture propagation in human cells (4, 19, 20, 22). The 
natural form of the SIV transmembrane protein is the 
full-length 41-kDa protein (20, 22). 

The HIV-1 envelope glycoprotein is initially made as a 
160-kDa glycosylated precursor that is cleaved to form the 
gpl20 exterior and gp4I transmembrane subunits (7, 8, 40). 
The gpl20 glycoprotein determines the tropism of HIV-1 for 
specific target cells by binding to the CD4 molecule (6, 21, 
27). The gp41 transmembrane glycoprotein contains an 
extracellular domain that is required for membrane fusion 
and noncovalent association with the gpl20 glycoprotein, a 
hydrophobic transmembrane region that anchors the gpl20- 
gp41 complex in the cell or virion lipid bilayer, and an 
intracytoplasmic domain of unknown function (2, 11, 14, 24, 
35). Tlie C3^oplasmic domain contains two highly conserved 
segments with the potential to form amphipathic a helices 
and may form a secondary association with the lipid bilayer 
(1, 14, 15, 39). 

The cytoplasmic domain of gp41 is required for efficient 
mV-l replication, but the function of this domain in the 
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virus life cycle is unknown (9, 26, 37). The cytoplasmic 
domain is not required for syncytium formation or binding to 
CD4 (7, 8, 24) but may modulate intracellular transport and 
processing of gpl60 in some cell types (16). Whether the 
cytoplasmic domain is involved in virion assembly, for 
example, by facilitating the incorporation of envelope glyco- 
proteins into virions, is unknown. 

To investigate the role of the gp41 cytoplasmic domain in 
virus replication, the effect of deletions in this region on the 
synthesis, processing, cell surface expression, incorporation 
into virions, and biological function of the HIV-1 envelope 
glycoproteins was investigated. 

MATERIALS AND METHODS 

Plasmids. The pSVIIIenv plasmid expresses the HIV-1 
env and rev genes of the HXB2 strain (31) under the control 
of the HIV-1 long terminal repeat (LTR) (17, 24). The 
pHXBAenvCAT plasmid contains an HIV-1 provinis with an 
in-frame deletion from the BglVl to BgUl sites (nucleotides 
6620 and 7200) of the sequence of Ratner et al. (32) in the env 
gene and a chloramphenicol acetyltransferase (CAT) gene 
replacing the nef gene (17, 36). The pHXBAenvAEcoCAT 
plasmid, derived from pHXBAenvCAT, contains an addi- 
tional deletion from £c£>RI to £coRI (nucleotides 4231 to 
5325) overlapping the pol gene. These plasmids contain a 
simian virus 40 origin of replication. Envelope glycoprotein 
deletion mutants were made by Bal 31 exonuclease digestion 
at either the ^amHI site or the ATioI site (nucleotide 8053 or 
8476, respectively) (24, 32, 37). The env open reading frame 
was restored when necessary either by creating blunt ends 
prior to religation or by inserting Clal linkers. The A(767- 
856) mutant was made by inserting an EcoRl linker at an 
Mnil site to create a frameshift at amino acid 767. Oligonu- 
cleotide-directed mutagenesis was used to create the A(796- 
803), A(796-804), A(846-856), A(85 1-856), and A(854-856) 
deletions in an EcoRhio-Xhol (nucleotides 5325 to 8476) 
fragment of HXB2 subcloned into the pBluescript plasmid 
(Stratagene) (25). All plasmids were sequenced in the region 
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of the introduced mutation. The pHXB2A(840*856) provirus 
was made by subcloning the deleted BamHl-io-Xhol frag- 
ment into the pHXB2 plasmid (31). The n?v expressor 
plasmid pSVIIIenvAKS contains a fQ?nl-io-Stul (nucleotide 
5926 to 6411) out-of-frame deletion in the envelope gene and 
expresses HIV-1 rev but not the envelope glycoproteins. 

Virus replication studies in Jurkat lyinpho<7tes. Jurkat T 
lymphocytes were transfected by the D£A£-dextran method 
(30) with 10 Jig of pHXB2 or pHXB2A(840-856). Following 
transfection, cultures were maintained in RPMI 1640 plus 
10% fetal calf serum with daily medium changes. Reverse 
transcriptase activity of pelleted virions was measured as 
previously described (33). 

Radioimmunoprecipitation. For measurement of viral pro- 
tein expression in infected Jurkat cells, 5 x 10^ cells were 
metabolically labeled with 100 fiCi (each) of [-^^SJcysteine 
and [^^S]methionine per ml for 16 h. Virions were harvested 
for immunoprecipitation by centrifugation of supematants 
for 10 min at 1,500 x ^ to remove cell debris and ultracen- 
trifugation at 12,000 x g for 1 h. Cells or virions were lysed 
in RIPA lysis buffer, and HIV-1 proteins were immunopre- 
cipitated with AIDS patient serum and analyzed by sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis (SDS- ' 
PAGE) (23). The human monoclonal antibody (50-69) to 
gp41 was obtained from the National Institutes of Health 
AIDS Research and Reference Reagent Program and was a 
donation of Susan ZoUa-Pazner. For measurement of enve- 
lope glycoprotein expression in transfected COS-1 cells, 3 x 
l(r cells were metabolically labeled with 100 \iCi of [^^S]me- 
thionine per ml for 16 h at 48 h posttransfectton with 5 ^g 
each of an envelope expressor plasmid and the rev expressor 
plasmid by the DEAE-dextran method (5). The labeled cells 
were lysed in 10 mM Tris-HQ (pH 7.4)-150 mM Naa-5% 
Triton X-114 at 0"C, and the detergent phase containing 
integral membrane proteins was isolated by phase separation 
at 30°Q dissolved in RIPA lysis buffer, clarified, and immu- 
noprecipitated as previously described (3, 11). Virions pro- 
duced in COS-1 cells were pelleted as described above 
following labeling of transfected COS-1 cells for 16 h and 
chasing for 3 h with medium containing excess unlabeled 
methionine and cysteine. 

PCR assay for determining the efficiency of provirus forma- 
tion following acute infection. Fresh virus stocks were pre- 
pared from supematants of infected Jurkat cell cultures by 
centrifugation at 1,500 x g for 10 min and filtration (0.45- 
|xm-pore-size filter) to remove cell debris. Virus stocks were 
treated with DNase (Worthington) (2 \Ji$/inl) for 20 min at 
room temperature to eliminate plasmid DNA contamination 
(41). Heat-inactivated virus control supematants were incu- 
bated for 90 min at 60°C. Jurkat cells (10^) were incubated 
with HXB2 or HXB2A(840-856) for 7 h at 3r*C. After 7 h, 2 
X 10^ cells were harvested by centrifugation, washed twice, 
lysed in 50 jjlI of 0.2% Nonidet P-40, and boiled for 15 min. 
The cell lysate was clarified by centrifugation at 12,000 x g 
for 2 min and stored at — 20**C. For polymerase chain 
reactions (PCR)» the DNA in 2.5 p,l of the cell lysate (100,000 
cell equivalents) was amplified by using the HIV-1 LTR 
RAJ5 primers 5' GGCTAACTAGGGAACCCACTG 3' and 
5' CTGCTAGAGATnTCCACACTGAC 3', which are sim- 
ilar to the AA55 and M667 primer pair previously described 
(41). PCR reactions were performed according to the manu- 
facturer's instmctions (Perkin-Elmer Cetus Corp.) for 33 
cycles of 94*C for 1 min, 56'*C for 1 min, and 72*C for 1 min. 
The PCR products were analyzed by electrophoresis on 2% 
agarose gels. 

Replication complementation assay for measuring the rep- 
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licative potential of mutant envelope glycoproteins. A tran- 
sient M/if-complementation assay was used to assess the 
replicative potential of mutant envelope glycoproteins in a 
single round of vims replication (17). Cell-free vims trans- 
mission was assessed by cotransfecting COS-1 cells by the 
DEAE-dextran method (5) with 5 pig of an envelope expres- 
sor plasmid and 5 jxg of pHXBAenvCAT. At 48 to 72 h after 
transfection, the COS-1 cell supematants were filtered (0.45- 
jj.m-pore-size filter) and the reverse transcriptase activity 
was measured (33). Equivalent reverse transcriptase units of 
cell-free supematants were added to 5 x 10^ Jurkat T 
lymphocytes. The Jurkat cells were incubated for 48 to 72 h 
and then assayed for CAT activity. In a similar trans- 
complementation assay, both cell-free and cell-to-cell vims 
transmission were measured by direct transfection of Jurkat 
T lymphocytes as previously described (17). 

Syncytium formation assays. The ability of the mutant 
glycoproteins to mediate the formation of syncytia was 
assessed in COS-1 cells by cotransfection of 5 ^.g of the 
envelope expressor plasmid and 5 M.g of the rev expressor 
plasmid followed by cocultivation with CD4-positive SupTl 
lymphocytes for 6 h at 60 h posttransfection (38). The 
syncytium-forming abilities of the mutant envelope glyco- 
proteins were assessed in Jurkat-to/ cells by cotransfecting 8 
\Lg of the envelope-expressing plasmid and 8 p-g of the rev 
expressor plasmid into Jurkat tat cells and scoring syncytia 
at 60 h posttransfection (11). 

Expression of envelope glycoproteins on the cell surface, 
COS-1 cells cotransfected with an envelope expressor plas- 
mid and the rev expressor plasmid were metabolically la- 
beled with [^S]methionine for 16 h at 48 h posttransfection. 
The intact labeled cells were washed twice with ice-cold 
phosphate-buffered saline (PBS) containing 2% heat-inacti- 
vated fetal calf serum, incubated with a 1:100 dilution of 
AIDS patient semm reactive with the envelope glycopro- 
teins at 4'*C for 30 min, rinsed twice with PBS containing 2% 
fetal calf semm, and lysed in RIPA lysis buffer containing 2.5 
ixg of unlabeled gpl20 (American BioTechnologies, Inc.) at 
4'*C. The cell lysates were clarified by ultracentrifugation, 
and bound envelope glycoproteins were immunoprecipitated 
by incubation with protein A-Sepharose as described above. 

Soluble-CD4 inhibition of syncytium formation and virus 
replication. The effect of soluble CD4 on vims entry was 
measured by producing the recombinant HXBAenvCAT pro- 
vimses in COS-1 cells as described above and preincubating 
equivalent amounts of virus as determined by measuring 
reverse transcriptase activity with different concentrations of 
full-length soluble CD4 (American BioTechnologies) for 1 h at 
37*C prior to infection of Jurkat lymphoc^es (38). 

RESULTS 

Effects of a deletion at the cart>oxyl terminus of gp41 on 
HTV-l replication in Jurkat lymphocytes. A mutation which 
deletes residues 840 to 856 (28) at the carfooxyl terminus of 
gp41 was introduced into an infectious HIV-1 provirus on 
plasmid pHXB2. To compare the replication rate of this 
mutant vims with that of the wild-type vims, the pHXB2 and 
pHXB2A(840-856) plasmids were transfected into Jurkat 
cells and vims replication was monitored by measuring 
reverse transcriptase activity in the culture supematants. 
The Jurkat cultures transfected with the pHXB2A(840-856) 
plasmid demonstrated slowed vims replication, with a 5-day 
lag in the time required to reach peak reverse transcriptase 
activity compared with that of the wild-type vims (Fig. 1), 
which is consistent with previous studies (9, 26, 37). When 
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FIG. 1. Replication of HXB2 and HXB2A(840>856) viruses in 
Jurkat cells. Curves show reverse transcriptase activity in superna- 
tants of Jurkat cell cultures transfected with 10 ^.g of pHXB2 or 
pHXB2A(840-8S6) DNA. The results are typical of those seen in at 
least three independent experiments. 



similar levels of virus production were achieved, the number 
of syncytia formed in the pHXB2- and pHXB2A(840-856)- 
transfected cultures and the viability of these cultures were 
indistinguishable (data not shown). 

To examine the effects of a deletion at the carboxyl 
terminus of gp41 on gag and env protein synthesis, process- 
ing, secretion, and incorporation into virions in Jurkat cells, 
steady-state and pulse-chase labeling experiments were per- 
formed at a time when virus production in HXB2- and 
HXB2A(840-856)-infected cultures was similar. At 14 to 21 
days after transfection, equal numbers of viable cells were 
metabolically labeled with {^^S]cysteine and [^^sjmethionine 
for 16 h and viral proteins were immunoprecipitated from 
cell lysates, supematants, and pelleted virions. The steady- 



state levels of viral gag and env proteins were similar in the 
pHXB2- and pHXB2A(840-856)-transfected cell lysates, su- 
pematants, and pelleted virions (Fig. 2A, B, and C and data 
not shown), although slightly less cell-associated gpl60 was 
observed in the pHXB2A(840-8S6)-transfected culture (Fig. 
2A). Pulse-chase analysis of infected Jurkat cell cultures 
demonstrated similar synthesis, processing, and release of 
envelope glycoproteins (Fig. 2D and E). However, the level 
of the mutant gpl60 in cell lysates was approximately 50% of 
the wild-type level after a 2-h or longer chase (Fig. 2D). This 
decrease in the level of gpl60 at the longer chase times was 
not accompanied by an increase in the level of cell-associ- 
ated gpl20 glycoprotein. The gpl20 glycoprotein was re- 
leased into the cell supematants after a 1-h chase in both the 
wild-type and mutant cultures (Fig. 2E). These results sug- 
gest that deletion of residues 840 to 856 at the carboxyl 
terminus of gp41 results in decreased stability of the gpl60 
glycoprotein in infected Jurkat cells. Nonetheless, the syn- 
thesis, processing, secretion, and incorporation into virions 
of the mutant HXB2A(840-856) envelope glycoproteins ap- 
pear to be comparable to those of the wild-type glycopro- 
teins. 

Deletion of the carboxyl terminus of gp41 reduces the 
efficiency of virus entry. To determine whether the cytoplas- 
mic domain of gp41 has an effect on the efficiency of virus 
entry, HXB2 or HXB2A(840-856) virus stocks were used to 
infect Jurkat cells and the efficiency of HIV-1 provirus 
formation at 7 h after acute infection was measured by PCR 
amplification of the proviral DNA. Uninfected Jurkat cells 
and Jurkat cells incubated with heat-inactivated virus con- 
trols were used as controls (41). The HIV-1 LTR R/U5 
primer pair used for PCR amplification detects the first 
region of the viral DNA made during reverse transcription 
and should detect virtually all HIV-1 DNA made, including 
partial DNA transcripts (41). Mapping of restriction sites 
within the amplified DNA product with HindllU Hinflh 
XhoW^ and Mbol restriction enzymes, which cut at internal 
sites, confirmed that the 140-bp fragment amplified by these 
primers was the correct HIV-1 LTR fragment (data not 
shown). Following acute infection of Jurkat cells, formation 
of the HXB2A(840-856) provirus as detected by PCR ampli- 
fication of viral DNA was significantly reduced compared 




FIG. 2. HIV-1 env ^n^gag proteins in HXB2- and HXB2A(840-8S6)-infected Jurkat cells and incorporated into virions. Immunoprecip- 
itated cell lysates (A, B. D), virions (C), and supematants (E) were assayed at days 14 to 21 following transfection of Jurkat cells with no DNA 
(lanes 1), 10 \x.g of pHXB2 (lanes 2), or 10 \Lg of pHXB2A(840-856) (lanes 3). Cells were metabolically labeled with [^*S)cysteine and 
I S]methionine for 16 h (A, B, Q or by a pulse-chase protocol (D, E). The gpl60. gpl20, and gp41 envelope glycoproteins and the p55, p24, 
and pl7^a^ products are marked. (A) Immunoprecipitated cell lysates with AIDS patient serum analyzed by 4 to 12% gradient SDS-PAGE; 
(B) immunoprecipitated cell lysates with a human monoclonal antibody (50-69) that rec:ognizes gp41 analyzed by 12.5% SDS-PAGE; (C) 
immunoprecipitated pelleted virions with AIDS patient serum analyzed by 12.5% SDS-PAGE; (D and E) pulse-chase analysis of cell lysates 
(D) and supematants (E) immunoprecipitated with AIDS patient serum and analyzed by 4 to 12% gradient SDS-PAGE. Cells were labeled 
for 10 min and chased for the indicated times (in hours) with medium containing excess unlabeled cysteine and methionine. 
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FIG. 3. Detection of HIV-1 provirus formation in Jurkat cells by 
PGR amplification of viral DNA following acute infection with 
HXB2 or HXB2A(840-856) viruses. PGR amplification was per- 
formed by using the HIV-1 LTR RAJS primer pair M667-AA55 (41) 
for 33 cycles to amplify DNA from 100,000 Jurkat cell equivalents 
haivested 7 h after acute infection with 4,000 (A), 20,000 (B), or 
100,000 (G) cpm of reverse transcriptase units of no vims (lane 1), 
heat'inactivated HXB2 (lane 2), HXB2 (lane 3), heat-inactivated 
HXB2A(840-8S6) (lane 4), or HXB2A(840^56) (lane S) per ml. The 
140-bp PGR product was analyzed by electrophoresis on 2% agarose 
gels and visualized with ethidium bromide. A DNA standard {PhiX- 
HaelU digest) is shown in lane M. 



with that of wild type (Fig. 3). As shown in Fig. 3, the effect 
of the mutation on the efficiency of vims entry was most 
marked at the lower multiplicities of infection (4,000 and 
20,000 cpm/ml). These results suggest that the carboxyl 
terminus of gp41 has an effect on the efficiency of virus entry 
during an early stage in the virus life cycle prior to formation 
of the DNA provirus. 

Mutational analysis of sequences in the cytoplasmic domain 
of gp41 required for replication. The cytoplasmic domain of 
gp41 contains a strongly hydrophilic region (amino acids 724 
to 745) followed by a region of alternating hydrophobicity 
and hydrophilicily (amino acids 746 to 856) (4, 25). The latter 
region also contains two segments with the potential to form 
amphipathic a helices (amino acids 770 to 794 and 824 to 856) 
(1, 39). A series of mutants containing deletions in the 
cytoplasmic domain was constructed in the pSVIIIenv en- 
velope expressor plasmid (Fig. 4). Because many of these 
mutants overlap the tat or rev second open reading frames, 
rev or rev plus tat was supplied in trans by cotransfecting 
either the rev expressor plasmid or pHXB2AenvCAT, re- 
spectively. Of note, the truncation in the A(726-856) mutant 
is similar to the truncation found in some isolates of SIV and 
HIV-2 (4, 10). The effect of the A(814-856) mutation in the 
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FIG. 4. Diagram of the mutant HlV-1 gp41 transmembrane glycoproteins. The wild-type HIV-1 gp41 transmembrane glycoprotein is 
depicted at the top. with the hydrophobic transmembrane region designated TM. The cross-hatched regions are highly conserved among 
HIV-1 isolates (28). Regions of potential amphipathic a helices at residues 770 to 794 and 824 to 8S6 (39) are indicated by coils. The positions 
of the second open reading frames of tat and rev are shown. 
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Phenotypes of gp41 cytoplasmic domain mutants" 
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69 
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63 


A754-797 


31 


10 


29 


0.9 


20 


33 


A754-856 


6 


9 


7 


1.0 


71 


106 


A767-856 


6 


12 


32 


1.5 


30 


92 


A796-«03 


50 


14 


56 


1.1 


72 


ND 


A796-804 


4 


7 


33 


1.2 


65 


ND 


A814-856 


24 


17 


57 


1.1 


47 


126 


A840-8S6 


34 


24 


48 


1.1 


63 


135 


A846-8S6 
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1.0 
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A8S1-856 


SO 


28 
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0.9 


80 


122 


A8S4-856 
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32 
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1.0 


80 
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" Data reported represent the means of at least two or three independent experiments. ND, not done. 

^ The number of the mutant refers to the amino acid residues of the HXB2 strain of HIV-1, where 1 is the initial methionine (31). 

' Percentage of the wild-type value. Values for replication complementation represent the CAT conversion in target Jurkat cells for the mutant envelope 
glycoprotein relative to the value for the wild type. Values shown for cell-associated gpl20 were determined by gel densitometiy of radioimmunoprecipitates 
analyzed by SDS-PAGE and autoradiography. 

" A measure of the conversion off mutant gpl60 to gpl20 relative to that of the wild-type glycoprotein. The amounts of gpl60 and gpl20 were determmed 
by gel densitometiy of autoradiograms of SDS-PAGE gels. The processing index was calculated by the following formula: processing index » ((total gpl20),„uta„| 
X (gpldOK.ua (ypel/[(8Pl60)„ut».i X (total gpl20)^M 



context of an infectious HIV-1 provirus has already been 
described elsewhere (37). 

To identify sequences in the cytoplasmic domain that are 
required for efficient viral replication, the ability of the 
mutant envelope glycoproteins to support cell-free virus 
transmission was determined by using a transient comple- 
mentation assay (17). In this assay, COS-1 cells cotrans- 
fected with an envelope expressor plasmid and an envelope- 
defective provirus expressing the CAT gene produce 
recombinant virions that are used to infect Jurkat T lympho- 
cytes (17). The efiiciency of cell-free transmission of the 
recombinant viruses is determined by measuring CAT activ- 
ity in the infected Jurkat culture. Most deletions within the 
carboxy-tcrminal 160 amino acids of gp41 reduced replica- 
tion complementation of cell-free virus transmission to 4 to 
69% of the wild-type value (Table 1). To determine whether 
similar regions are required for replication in Jurkat cells 
under conditions in which most of the virus transmission 
occurs by cell-to-cell spread, the abilities of the mutant 
glycoproteins to supp>ort a single round of virus replication 
were examined by using a similar transient complementation 
assay in transfected Jurkat cell cultures (17). The hydrophilic 
region extending from amino acids 726 to 751 and the 
immediately carboxyl region (amino acids 754 to 769) were 
relatively dispensable for replication in Jurkat cells (Table 
I). Otherwise, the regions in the cytoplasmic domain re- 
quired to support the transmission of viruses produced in 
COS-1 cells and Jurkat cells were similar. These results 
suggest that the function of some regions of the cytoplasmic 
domain depends on the cell type in which the envelope 



glycoproteins are expressed or on the mode of virus trans- 
mission. 

Deletions in the cytoplasmic domain of gp41 are associated 
with decreased envelope glycoprotein stability in COS-1 cells. 
To determine whether mutations in the cytoplasmic domain 
of gp41 alter envelope glycoprotein synthesis or processing 
in COS-1 cells, mutant envelope glycoproteins expressed in 
COS-1 cells were immunoprecipitated with AIDS patient 
serum and analyzed on SDS-polyacrylamide gels. The 
steady-state levels of mutant glycoproteins with deletions 
within residues 846 to 856 at the carboxyl terminus were 
similar to those of the wild-type glycoproteins (Fig. 5 A and 
Table 1). The steady-state levels of mutant glycoproteins 
containing deletions amino terminal to residue 846 were 
reduced (Fig. 5A and Table 1). Two mutants containing 
deletions overlapping the transmembrane domain (amino 
acids 701 to 777 and 704 to 751) exhibited a processing defect 
(Fig. 5A and Table 1). The levels of wild-type and mutant 
gpl20 in the transfected COS-1 cell supematants correlated 
with the levels of cell-associated gpl20 (data not shown). 
When labeling COS-1 cells for 16 h was followed by a 3-h 
chase with excess cold methionine and cysteine, the levels of 
mutant envelope glycoproteins that were low in the steady 
state were further reduced (Fig. SB and Table 2). This result 
suggests that deletion of residues amino terminal to amino 
acid 846 results in decreased envelope glycoprotein stability 
in COS-1 cells. 

The cytoplasmic domain of gp41 is not required for incor- 
poration of envelope glycoproteins into virions. To determine 
whether the cytoplasmic domain of gp41 is required for the 
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FIG. 5, Wild-type and mutant HIV-1 envelope glycoproteins in COS-1 cell lysates (A, B) and virion pellets (C). Cell lysates and virions 
were immunoprecipitated with AIDS patient serum and analyzed by 10% (A, B) or 12.5% (C) SDS-PAGE. (A) Steady-state levels of mutant 
envelope glycoproteins expressed in COS-1 cells transfected with 5 ^.g of the rev expressor plasmid plus no DNA (lane 1), S |A,g of wild-type 
envelope expressor plasmid (lane 2), or 5 M.g of the mutant envelope expressor plasmids (lanes 3 to 12). (B) Expression of wild-type (lanes 
1 and 3 to 8) and mutant (lanes 9 to 14) envelope glycoproteins in COS-1 cells after 16 h of labeling and 3 h of chase. COS-1 cells were 
transfected with 5 |ig of the wild-type envelope expressor plasmid (lane 1), pHXBAenvAEcoCAT (lane 2), or pHXBAenvAEcoCAT plus 0.1, 
0.25, 0.5, 1.0, 2.5, or 5.0 \Lg of the wild-type envelope expressor plasmid (lanes 3 to 8, respectively) or 5 \ig of the mutant envelope expressor 
plasmids (lanes 9 to 14). (C) Virion-associated envelope glycoproteins. The lanes are the same as in panel B. The positions of the gpl20 
envelope glycoprotein and p55, p41, and pTAgag products are marked. 



TABLE 2. Effect of mutations in the gp41 cytoplasmic domain 
on virion-associated envelope glycoproteins" 



Mutant* 


Relative cell- 
associated 
gpl20 after 
3-h chase^ 


Virion-associated 
gpl20 
(gpl2CVp24r 


Virion 
association 
index' 


Wild type (0.1 jtg)" 


9 


15 


1.7 


Wild type (0.25 »ig) 


16 


22 


1.4 


Wild type (0.5 jtg) 


22 


46 


2.0 


Wild type (1.0 ^.g) 


39 


35 


0.9 


Wild type (2.5 \x.g) 


56 


57 


1.0 


Wild type (5.0 ^g) 


100 


100 


1.0 


A701-771 


4 


25 


5.6 


A704-751 


18 


25 


2.6 


A726.856 


4 


15 


4.0 


A728-745 


58 


70 


1.4 


A754-788 


12 


20 


1.7 


A754-856 


4 


4 


ND 


A767-856 


15 


32 


2.6 


A814-856 


39 


33 


LO 


A840-8S6 


62 


42 


0.8 


A846-856 


100 


76 


0.8 


A851-856 


84 


72 


0.9 


A854-856 


104 


83 


0.8 



" Data reported represent the means of at least two independent experi- 
ments. 

* The numbers in parentheses refer to the amount of wild-type envelope 
expressor plasmid DNA used for transfection. For all of the mutant glyco- 
proteins, S of plasmid DNA was used. 

' Percentage of the wild- type value at 5 iig of wild-type transfected plasmid 
DNA. The values shown were determined by gel densitometiy of radioimmu- 
noprecipitates analyzed by SDS-PAGE and autoradk^aphy. 

* calculated to normalize the level of virion-associated gpl20 for the level 
of cell-associated gpl20. The virion-association index was calculated by using 
the formula ((virion-associated gpl^)muunt ^ (p24)wiid type ^ (cell-associated 
gpl20)wiid iypel/l(p24)„,„^„, X (virion-associated gpl20)«^M typ« (cell-associ- 
ated gpl20U„„„,l. ND, not done. 



efficient incorporation of HIV-1 envelope glycoproteins into 
virions, the effect of deletions on the level of virion-associ- 
ated envelope glycoproteins was examined. COS-1 cells 
cotransfected with an envelope expressor plasmid and 
pHXBAenvAEcoCAT were used to produce recombinant 
virions. Pelleted virions were immunoprecipitated with 
AIDS patient serum and analyzed on SDS-polyaciylamide 
gels. The amount of transfected wild-type envelope expres- 
sor plasmid DNA was incrementally increased from 0.1 to 
5.0 M.g to allow comparison of the level of wild-type and 
mutant virion-associated gpl20 under conditions in which 
the levels of cell-associated envelope glycoproteins were 
similar (Table 2 and Fig. 5B and C). A virion association 
index was calculated to normalize the level of virion-associ- 
ated gpl20 for the level of cell-associated gpl20 (Table 2). 
Deletions amino terminal to residue 846 caused reductions in 
the level of virion-associated gpl20 to 70% or less of the 
wild-type level (Fig. 5C and Table 2). However, the virion 
association index of the mutant glycoproteins was similar to 
or greater than that of the wild-type envelope glycoproteins 
under conditions in which the levels of cell-associated gpl20 
were similar (Table 2). Moreover, the relative incorporation 
of gpl20 into virions was generally more efficient when the 
levels of wild-type or mutant cell-associated gpl20 were 
relatively low (Table 2). These results demonstrate that the 
cytoplasmic domain of gp41 is not required for the efficient 
incorporation of the gpl20 envelope glycoprotein into viri- 
ons. It can also be inferred that the incorporation of the gp41 
glycoprotein is not defective, since the cell or virus associ- 
ation of the gpl20 glycoprotein depends on an interaction 
with gp41 (24). 

To determine whether the observed decrease in the abso- 
lute amount of virion-associated mutant envelope glycopro- 
tein was the sole cause of the replication defect in virions 
made in COS-1 cells, replication complementation by the 
wild-type and mutant envelope glycoproteins was compared 
under conditions in which levels of virion-associated gpl20 
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FIG. 6. (A) Levels of virion-associated envelope glycoproteins and replication cximplementation. Values for virion-associated envelope 
glycoproteins were derived from Table 2. Values for replication complementation were determined by using the replication complementation 
assay in COS-1 cells (Table 1). All values are expressed as a percentage of the value for the wild-type envelope glycoprotein at 5 jjig of 
transfected plasmid DNA. Open circles represent the values, in consecutive order, for the wild-type envelope glycoproteins when 0.1, 0.25, 
0.5, 2.5, or 5.0 ^.g of envelope expressor plasmid DNA was transfected. For the mutant envelope glycoproteins, 5 iig of envelope expressor 
plasmid DNA was transfected. (B) Levels of envelope glycoproteins e3q>ressed on the cell surfacre and syncytium-forming ability. The 
expression of envelope glycoproteins on the cell surface was determined by immunoprecipitation of cell surface-accessible envelope 
glycroproteins following incubation of intact COS-1 cells with AIDS patient serum, analysis by SDS-PAGE, and quantitation by gel 
densitometry. All values are expressed as a percentage of the value for the wild-type envelope glycoproteins at 5 p-g of transfected plasmid 
DNA. Open circles represent values, in consecutive order, for the wild-type envelope glycxiproteins when 0.1, 0.25, 0.5, 1.0, 2.0, or 5.0 |JLg 
of plasmid DNA was transfected. For the mutant envelope glycoproteins, 5.0 \Lg of plasmid DNA was transfected. 



were similar. The relationship between the levels of virion- 
associated wild-type glycoprotein and degree of replication 
complementation was nonlinear, with saturation of the latter 
value when the level of virion-associated glycoprotein was 
only 35% of the maximum value (Fig. 6A). As shown in Fig. 
6A, relatively low levels of wild-type virion-associated en- 
velope glycoprotein can mediate virus entry efficiently. In 
<x>ntrast, most mutant envelope glycoproteins containing 
deletions within the cytoplasmic domain exhibited a replica- 
tion defect that could not be solely attributed to the reduced 
level of virion-associated envelope glycoproteins (Fig. 6A). 

The gp41 cytoplasmic domain is not required for syncytium 
formation. To determine whether a defect in fusion was the 
cause of the reduced replication complementation ability of 
the mutant envelope glycoproteins produced in COS-1 cells, 
COS-1 cells were cotransfected with a wild-type or mutant 
envelope expressor plasmid and cocultured with CD4-posi- 
tive SupTl cells, and syncytium-forming ability was scored. 
The syncytium-forming abilities of most of the mutant gly- 
coproteins expressed in COS-1 cells were less than those of 
the wild-type glycoproteins (Table 1). Since previous studies 
had demonstrated that syncytium formation in other cell 
types did not require the cytoplasmic domain (7, 24)» the 
syncytium-forming abilities of the mutant envelope g^co- 
proteins were also examined in Jurkat-/ar cells (11, 18). In 
Jurkat-tat cells, the syncytium-forming abilities of the mu- 
tant glycoproteins were usually equal to or greater than 



those of the wild-type glycoproteins (Table 1). These results 
suggest either that the syncytium-forming ability of the 
mutant glycoproteins is dependent on the cell type in which 
the envelope glycoproteins are expressed or that syncytium 
formation in COS-1 cells might be reduced as a consequence 
of decreased expression of the mutant envelope glycopro- 
teins on the cell surface. 

To determine whether the decrease in syncytium forma- 
tion observed in COS-1 cells could be solely attributed to 
reduced expression of envelope glycoproteins on the cell 
surface, the expression of selected mutant envelope glyco- 
proteins on the cell surface was determined by immunopre- 
cipitation, analysis on SDS-polyacrylamide gels, and quan- 
titation by gel densitometry. The syncytium-forming abilities 
of the wild-type and mutant envelope glycoproteins were 
compared under conditions in which similar levels were 
expressed on the cell surface. For the A(726-856), A(814- 
856), and A(840-856) mutant glycoproteins, the level of 
envelope glycoproteins expressed on the cell surface was not 
significantly different from the wild-type level when the 
levels of intracellular envelope glycoproteins were similar 
(Table 2 and Fig. 6B). Furthermore, the syncytium-forming 
abilities of the mutant envelope glycoproteins made in 
COS-1 cells were similar to that of wild type when similar 
levels of envelope glycoproteins were expressed on the cell 
surface (Fig. 6B). These results demonstrate that the cyto- 
plasmic domain is not required for the cell surface expres- 
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FIG. 7. Effect of mutations in the cytoplasmic domain on the 
sensitivity of replication complementation to soluble-CD4 (sCIM) 
inhibition. The effects of different soluble-CD4 concentrations on 
the CAT activity transferred to Jurkat lymphocytes by recombinant 
virions carrying the wild-type or mutant envelope glycoproteins are 
shown. Each value represents the percentage of CAT activity 
observed for a given mutant in the presence of soluble CD4 relative 
to the activity observed for the mutant in the absence of soluble 
CD4. micrpgrams. 



sion or the fusion function of envelope glycoproteins ex- 
pressed in COS-1 ceils. 

Soluble-CD4 sensitivity of tlie mutant envelope glycopro- 
teins. HIV-1 envelope glycoproteins that exhibit more than a 
twofold decrease in relative CD4-binding ability are less 
sensitive than the wild-type glycoprotein to soluble CD4 
inhibition of virus entry (38). To investigate the possibility 
that deletions in the cytoplasmic domain of gp41 affect the 
CD4-binding ability of the functional envelope glycoprotein 
multimer, the effect of mutations in the cytoplasmic domain 
on soluble CD4 inhibition of virus entry was examined. 
Equivalent amounts of recombinant virions produced in 
COS-1 cells as determined by measuring reverse tran- 
scriptase activity were incubated with different concentra- 
tions of soluble CD4 for 1 h at 3TC prior to infection of 
Jurkat target cells. The A(814-S56), A(840-856), and A(846- 
856) mutants exhibited the same sensitivity to soIuble-CD4 
inhibition of virus entiy as the wild*type envelope glycopro- 
tein (Fig. 7). From these results, it can be inferred that 
deletions in the cytoplasmic domain of gp41 do not signifi- 
cantly alter the CD4-binding ability of the native, multimeric 
envelope glycoproteins. This conclusion is consistent with 
previous Gi>servations that alterations in the gp41 cytoplas- 
mic domain do not affect the CD4-binding ability of the 
monomeric soluble form of the gpl20 glycoprotein (2, 7» 8, 
24). 

DISCUSSION 

This study demonstrates that the cytoplasmic domain of 
the HIV-1 transmembrane glycoprotein has at least two 
effects on viral replication. A highly conserved region within 
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residues 840 to 856 at the carboxyl terminus of gp41 is 
important for efficient virus entry during an early step in the 
virus life cycle between CD4 binding and formation of the 
DNA provirus. The entire cytoplasmic domain amino termi- 
nal to residue 846 affects the stability of envelope glycopro- 
teins made in COS-1 cells. The entire cytoplasmic domain 
appears to affect the efficiency of virus entry during the 
cell-free transmission of viruses made in COS-1 cells, 
whereas residues 726 to 769 are dispensable for cell-to-cell 
transmission in Jurkat cells. Whether these differences are a 
consequence of the cell type in which the envelope glyco- 
proteins are made or reflect differences in the requirement 
for this domain during cell-free versus cell-to-cell transmis- 
sion remains to be determined. 

The reduced efficiency of HIV-l provirus formation fol- 
lowing acute infection with HXB2A(840-856) demonstrates 
that the carboxyl terminus of the HIV-l transmembrane 
glycoprotein has an effect on the efficiency of virus entry 
during an early stage in the virus life cycle prior to formation 
of the provirus. The steps in the virus life cycle required for 
virus entry include CD4 binding, fusion of the virus and host 
cell membranes, and penetration and uncoating of the viral 
core (13). Deletion of the carboxyl terminus of gp41 is 
unlikely to affect CD4 binding, since the CD4-binding do- 
mains are all contained within the gpl20 subunit (24, 27). The 
lack of requirement for the cytoplasmic domain for syncy- 
tium formation and the lack of altered sensitivity to soluble 
CD4 provide further evidence that deletions in this region do 
not significantly affect the CD4-binding affinity of the multi- 
meric gtycoprotein complex. Deletions in the cytoplasmic 
domain of gp41 are also not likely to affect the fusion 
function, since this domain is not required for syncytium 
formation, as demonstrated here and in previous studies (7, 
24). The possibility that the membrane fusion function oif 
only the virion-associated envelope glycoproteins is defec- 
tive has not been excluded, but reliable methods for mea- 
suring the fusion function of the small fraction of virions 
relevant to infection are not available. The competence of 
the mutant envelope glycoproteins for CD4 binding and 
membrane fusion in conjunction with inefficient formation of 
the DNA provirus raises the possibility that these truncated 
envelope glycoproteins are defective for a step involved in 
the uncoating or penetration of the viral core. The location of 
the gp41 carboxyl terminus within the virion close to the 
viral core is consistent with the possibility that this region 
plays a role in virus uncoating or penetration. 

Independent of a role in virus replication, the integrity of 
the gp41 cytoplasmic domain amino terminal to residue 846 
is important for the stability of envelope glycoproteins 
expressed in COS-1 cells. The observation that the stability 
of gpl60 in HXB2A(840-856)-infected Jurkat cell cultures is 
decreased suggests that the cytoplasmic domain also has an 
effect on the stability of the envelope glycoproteins in other 
cell tjrpes. The effect of the cytoplasmic domain on envelope 
glycoprotein stability appears to be cell type dependent, 
judging by the observation that the effect is relatively minor 
in Jurkat cells and the finding that some mutant glycopro- 
teins that showed decreased stability in COS-1 cells were 
Uilfy functional, and thus presumably stable, in Jurkat cells. 
These results are consistent with previous studies demon- 
strating that the stability and processing of the HIV-l 
envelope glycoproteins depend on the cell type in which the 
envelope glycoproteins are expressed (8, 18). In COS-1 cells, 
approximately 75% of gpl60 is cleaved to produce the gpl20 
subunit, whereas in Jurkat cells, only a small fraction (10 to 
20%) of gpl60 is cleaved. This dependence of the efficiency 
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of gpl60 cleavage on cell type might, at least in part, account 
for some of the cell type-specific effects observed in this 
study, since a decrease in the steady-state level of gpl60 
would be less deleterious in cells which process gpl60 
inefficiently. In a previous study, Willey et al. demonstrated 
that intracellular sorting results in the transport of most 
uncleaved gpl60 to lysosomes, where it is degraded (40). 
The mechanism by which the cytoplasmic domain affects 
envelope glycoprotein stability is unknown. One possibility 
is that deletions in the cytoplasmic domain result in altered 
transport of gpl60, as observed in one study (16), which 
might affect localization to subcellular compartments where 
degradation occurs. 

The cytoplasmic domain of the transmembrane protein 
does not appear to be required for the efficient incorporation 
of the HIV-1 envelope glycoproteins into virions. These 
findings are consistent with those of previous studies dem- 
onstrating that SIV and equine infectious anemia virus 
containing truncated envelope glycoproteins are still infec- 
tious (4, 34). Similar studies of deletions of the shorter 
cytoplasmic domain of the Rous sarcoma virus envelope 
glycoprotein also demonstrated that this region was dispens- 
able for virion incorporation, although in this case, infectiv- 
ity was not affected (29). The studies reported herein, 
combined with systematic mutagenic analysis of the HIV-1 
gp41 transmembrane region (11), essentially eliminate the 
possibility that interaction of specific gp41 sequences with 
gag proteins is a necessary step for incorporation of enve- 
lope glycoproteins into virions. The incorporation of the 
HIV-1 envelope glycoproteins into virions occurred more 
efficiently when levels of wild-type or mutant cell-associated 
envelope glycoproteins were low. This result suggests that a 
mechanism for concentrating the envelope glycoproteins 
into virions is involved in HIV-1 virion assembly. Whether 
the cytoplasmic domain is required for another step in virus 
assembly is not known, but the examination of HXB2(840- 
856) virions by electron microscopy and determination of the 
rate of gag precursor processing (Fig. 2D and E) did not 
reveal any gross defects in virus assembly or maturation 
(lla). 

An HIV-l provirus containing a 4-amino-acid deletion and 
an additional 15 random residues in the gp41 cytoplasmic 
domain, XlO-1, was reported to result in a replication- 
competent HIV-1 exhibiting the ability to form syncytia but 
no ability to efficiently lyse single cells (9). A subsequent 
study also proposed a role for the cytoplasmic domain of 
gp41 in viral cytopathic effects (26). Contrary to these earlier 
studies, Kowalski et al. demonstrated that the replication 
and cytopathogenicity of the XlO-1 mutant in Jurkat T 
lymphocytes were indistinguishable from those of the wild- 
type virus (23). The results of the present study provide 
further evidence that the cytoplasmic domain of gp41 does 
not play a major role in viral cytopathic effect independent of 
the effects of changes in this region on viral replication rate. 
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abstract We have investigated the requirement of die sig- 
nal sequence for expression of influenza virus hemagghttiiun (HA). 
For this purpose we used a recombinant prepared from a late-re- 
gion deletion mutant of simian virus 40 (SV40) and doned influ- 
enza HA DNA; the influenza DNA was inserted into the late re- 
gion of SV40 previously occupied by the deleted sequences coding 
for $V40 capsid proteins. A simple in-phase deletion was made in 
the HA DNA, resulting in loss of 11 internal amino adds from the 
16 amino add signal peptide. This deletion HA recombinant was 
then used to infect African green monkey kidney celb. Mutant HA 
was not detected on the cell surface but stably accumulated in the 
cytoplasm at a level simOar to that of wild-type HA. NaDodSO^/ 
polyacrylamide gel analysis of lysates from infected cells showed 
that mutant HA was not glycosylated. Significantly, the amount 
of mutant HA synthesized was not affected by tunicamyctn. In 
contrast, wild-type HA was decreased more than 90% by timi* 
camydn. These findings suggest that mutant polypeptide is syn- 
thesized on free polyribosomes rather than on membrane-bound 
polyribosomes. Ibe mutant HA failed to agglutinate erythrocytes, 
probably due to a defect directly or indirectly assodated with the 
ladc of carbohydrate side chains. 

The hemagglutinin (HA) of influenza virus constitutes the ma- 
jor viral envelope glycoprotein and is responsible for binding 
of virus to sialic acid receptors on the cell surface (1). HA also 
mediates membrane fusion that initiates uncoating of the virion 
inside the infected cell (2-5). During infection HA is synthe- 
sized in the cytoplasm, glycosylated, and transferred to the outer 
cell membrane where the final stage of virus assembly takes 
place. Functional HA contains three HA polypeptide subunits 
and a specific protease cleavage of each subunit is required for 
viral infectivity (6, 7). 

Sequence analysis of HA from many influenza virus stains 
shows the presence of a hydrophobic sequence at the amino 
terminus and another hydrophobic sequence near the carboxyl 
terminus (8-11). The hydrophobic carboxyl terminus is re- 
sponsible for anchoring the polypeptide on the outer mem- 
brane because HA lacking these sequences is secreted extra- 
cellularly (12). On the other hand, the hydrophobic amino 
terminus is cleaved during maturation and is therefore absent 
in the membrane-integrated HA (13). These transient hydro- 
phobic sequences, commonly referred to as the signal peptide, 
exist in all known eukaryotic transmembrane and secretory pro- 
teins with several notable exceptions, such as ovalbumin and 
the influenza virus neuraminidase (14-16). Similar transient 
signal peptides also exist in the periplasmic proteins of pro- 
karyotes. It has been postulated that the signal peptide seg- 
ments play a crucial role in mediating the transfer of polypep- 
tide across the membrane by a specific mechanism known as 
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vectorial discharge of nascent polypeptide (17, 18). Synthesis 
and transfer of proteins utilize a specialized machinery of mem- 
brane-bound polyribosomes and the Golgi apparatus in eukary- 
otes. Mutational analysis of the signal sequences to further elu- 
ddate the transfer process has not been reported in eukaryotic 
systems. 

Previously we constructed a recombinant consisting of a late- 
region deletion mutant of simian virus 40 (SV40) and cloned 
influenza DNA representing the complete sequences of the gene 
coding for the HA sur&ce glycoprotein. The cloned influenza 
DNA was inserted into the late region of SV40 previously oc- 
cupied by the deleted sequences coding for SV40 capsid pro- 
teins. This HA-SV40 recombinant expressed fully functional 
HA in African green monkey kidney (ACMK) cells. The HA 
was glycosylated and inserted into the outer membrane of the 
infected cell (19). This system was used in the present study to 
examine the role of the signal peptide in HA maturation and 
function. In this study, we describe the construction of a dele- 
tion recombinant that produced HA lacking the signal peptide 
sequences. Analyses of functional defects in the mutant HA that 
affect glycosylation, maturation at the cell surface, and hem- 
agglutination activity are presented. 

MATERIALS AND METHODS 

Preparation of DNA and Construction of HA-SV40 Dele- 
tion Mutants. Previously we cloned a full-length DNA copy 
coding for die HA of influenza virus strain A/Udom/72 (H3N2). 
To achieve functional expression the cloned HA DNA was in- 
serted into the late region of a SV40 vector and a recombinant 
of HA-SV40 that produced a functionally active HA was iso- 
lated. Subsequendy the HA-SV40 DNA recombinant was cloned 
in pBIl322 by using the unique BamHl cleavage site (19). 

The recombinant pHA-SV40 DNA was prepared from a cloned 
Escherickia colt transformant as described (19). DNA frag- 
ments prepared by restriction enzyme digestion were sepa- 
rated on 3.5% polyacrylamide gels and eluted electrophoreti- 
cally. Tlie Mho II DNA fragment downstream of the cleavage 
site was digested with A exonuclease (28 units/ml) in a buffer 
containing 67 mM glycine (pH 9.4), 3 mM MgCla, 3 mM 2-mer- 
captoethanol at O^C for 30 min (20). Other enzyme digestions 
were carried out according to the prescribed conditions by their 
suppliers. Constructed plasmid recombinants carrying a signal 
peptide deletion were subjected to sequence analysis by the 
method of Maxam and Gilbert (21). Mutant HA-SV40 DNA 
was isolated after fiamHI digestion, circularized, and used for 
coinfection of ACMK cells with a tsA2B SV40 early-region mu- 
tant helper virus. 

Radiolabeling and Analysts of HA Polypeptides. ACMK cells 
infected with the mutant HA-SV40 and incubated at 40''C for 



Abbreviations: HA. hemagglutinin; SV40, simian virus 40; ACMK, Af- 
rican green monkey kidney. 



3563 



3564 Biochemistry: Seldkawa and Lai 

72 hr were labeled with [^Sjmethionine (100 /tCi/m]; 1 Ci = 
3.7 X 10'° Bq) in a methionine-free medium for 5 hr. Labeling 
of cells during infection with the wild-type HA-SV40 was car- 
ried out similarly. Cells were directly lysed in radioimmuno- 
precipitation assay buffer and the supernatant fraction was im- 
munoprecipitated for analysis on NaDodS04/polyaciylanude 
gels (19). 

Indirect Immunofluorescence Assay. Tbree days after in- 
fection with either wild-type or mutant HA-SV40, unfixed cell 
monolayers were stained direcdy in a live cell fluorescence as- 
say with sheep antiserum against HA and fluorescein-conju- 
gated rabbit anti-sheep IgG. Fixed cell fluorescence assay was 
performed on cells treated with 3.7% formaldehyde for 10 min 
and 0. 1% Triton X-100 for 5 min as described by Wehland et 

Hemagglutination Test. Lysates of recombinant virus-in- 
fected cells were prepared in phosphate-buffered saline and se- 
rially diluted in a microtiter plate. Guinea pig erythnx^es (0. 1% 
suspension in phosphate-buffered saline) were added to each 
dilution and hema^utination activity was examined after 30 
min at room temperature. 

RESULTS 

Constniction of Mutant HA Encoding Deleted Signal Se- 
quences. .The wild-type HA-SV40 recombinant cloned previ- 
ously at the unique BamHl site of pBR322 was used for the 
derivation. of HA deletion mutants. Fig. 1 illustrates, the strat- 
egy used to construct HA mutants lacking the signal peptide 
sequences. EndbR Mbo II cleaves the HA-SV40 plasmid once 
at a site 12 base pairs downstream of the initiation codon of the 
HA and several other sites outside the HA gene. Digestion of 
pHA-SV40 with Mbo U yielded two DNA fragments containing 
HA-specific seqaences. The DNA fragment downstream of the 
cleavage site was briefly treated with A exonuclease to shorten 
both DNA strands progressively, whereas the other Mbo U 
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no. 1. Construction of signal peptide deletion mutants of HAr-SV40. 
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fragment was not treated. After nuclease SI digestion, both HA 
DNA fragments were joined. In this manner, a series of dele- 
tion mutations were introduced at the Mbo U cleavage site while 
retaining the upstream Mbo II enzyme recognition site. The 
joined HA DNA from the h'gase reaction mixture was cleaved 
with Kpn I and Xba I to generate cohesive termini. This spe- 
cific DNA fragment was isolated and rebuilt into the pHA-SV40 
plasmid by joining it to the other fragment from the Kpn l/Xba 
I digest of pHA-SV40. In this manner, the signal deletion was 
introduced into the wild-type HA DNA molecule and repre- 
sented the only mutation in the HA-SV40 recombinant. The 
constructed DNA was used directly for cloning by transfor- 
mation of £. co/t. 

Sequence Analysis of Mutant HA Lacking the Signal Pep- 
tide. The constructed deletion mutant DNAs were isolated from 
£. coli tiansformants and characterized by digestion with EndoR 
Ava II and polyacrylamide gel electrophoresis. For compari- 
son, similar A 170 II digestion of the wild-type pHA-SV40 yielded 
a 338-nucleotide fragment containing the signal sequences and 
this type of analysis allowed us to estimate the size of the cor- 
responding fragments of mutant DNA. These estimates al- 
lowed us to determine the extent of deletion and served to guide 
us in choosing mutant recombinants for sequence analysis. 
Among six mutant isolates shortened by 30-100 nucleotides, 
we found one mutant that contained an in-phase deletion, 
whereas the other five showed either a +1 or a —1 frameshift 
deletion. The latter were not analyzed further. The wild-type 
nucleotide sequence and the encoded signal peptide sequence 
at the amino terminus of HA were compared with the se- 
quences of the HA-SV40 mutant (Fig, 2). TTie HA signal pep- 
tide consists of 16 amino acids — 1 charged amino acid (Lys), 11 
hydrophobic amino adds (Met, lie, Ak, Leu, Phe, Val, and Gly). 
and 4 uncharged polar amino acids (Thr, Ser, Tyr, and Cys). 
The proteolytic cleavage site is located at the Cly-Cln juncture 
because Gin is the amino terminus of the membrane-incor- 
porated HA molecules. The mutant HA sustained a deletion of 
33 base pairs or 11 amino acids, all within the signal peptide. 
As a result, the remaining signal region contains three hydro- 
phobic amino adds (Met, He, and Gly), a polar uncharged amino 
add (Tbr), and a charged amino add (Lys). Tbe signal peptide 
cleavage site of Gly-Gln and the subsequent sequences re- 
mained unchanged in the mutant. 

Analysis of HA by Indirect Immunofluorescence Assay. An 
indirect immunofluorescence assay was employed to identify 
the intracellular site of accumulation of the mutant HA in the 
infected cell. Initially, the deletion mutant genome of HA-SV40 
was packaged within SV40 virions and propagated together with 
the SV40 is helper virus at the restrictive temperature (40**C). 
AGMK cells were infected with the SV40 is helper and either 
the mutant or tbe wild-type HA-SV40 recombinant for 3 days 
prior to assay for HA by live cell fluorescence staining with HA 
antibody and an appropriate fluorescein conjugate. Fluores- 
cence was not detected on mutant infected ceUs, whereas cells 
infected with wild-type HA-SV40 showed positive surfece 
staining (data not shown). When infected cells were fixed and 
stained for HA fluorescence, wild-type HA was detected in the 
cytoplasm and on the membrane. In contrast, tbe mutant HA 
was found only in the cytoplasm. The mutant HA was stained 
as intensively as the wild-type HA (Fig. 3). These results dem- 
onstrate that mutant HA laddng the signal sequences accu- 
mulates in the cytoplasm but is not incorporated into tbe outer 
membrane. 

Analysis of HA by NaDodS04/PolyaGrylimiide Gel Elec- 
troi^resis. We estimated the molecular size of the mutant HA 
product to determine whether tbe mutant polypeptide is post- 
translationally modified by glycosylation in the same manner as 



Biochemistry: Seldkawa and Lai Froa NatL Acad. Sci, USA 80 (1983) 3565 



NboII 

Wt-HA ATG AAG ACT ATC ATT^GCT TT6 AGC TAC ATT T7C TST CT6 6TT CTC GGC CAA 6AC 

TRT TTC TGA TAG TAA CGA AAC TCG ATG TAA AAG ACA GAC CAA GAG CCG GTT CTG 

Met Lvs Thr lie lie Ala Leu Ser Tvr He Phe Cys Leu Val Leu Gly Gin Asp — 



dl-HA ATG AAG ACT ATC , „ * GGC CAA GAC 

TJ^ TTC TGA TAG ^ deletion ) — 

Met Lys Thr lie Gly Gin Asp — 

Fig. 2. Nucleotide sequences of the wild-type and the derived deletion mutant. Hie nucleotide sequences and the encoded amino acids at the 
amino tenninus are shown. Hie wild-type HA contains a signal peptide of 16 amino acids (underlined) whidi is cleaved by a protease. Also indicated 
is the Mbo U cleavage site where deletions are introdaoed to obtain this series of mutants. Hie mutant analyzed in this study lacks 11 of the 16 
amino adds. The mutant DNA retains an Mbo H deavage site because the G-A-A-G-A Af <6o D recognition sequence is still present, hp, base pair. 



the wild-type HA. Tunicamycin was used as an inhibitor of g)y- 
cosylation to determine if the mutant HA was modified by ad- 
dition of carbohydrate side chains. If the HA was glycosylated 
there should be a reduction in size of the HA in the presence 
of the inhibitor. On the other hand, if die mutant HA was not 
glycosylated it would be the same size whether or not tunica- 
mycin was present. Immunoprecipitation of radiolabeled ex- 
tracts from infected cells was performed and tjie precipitates 
were analyzed on NaDodS04/polyacrylamtde gels. As a con- 
trol, a lysate from AGMK cells infected with the wild-type HA- 
SV40 recombinant was used. Molecular mass of HA from the 
wild-type recombinant was estimated to be 70»000-75,000 dal- 




tons, similar in size to the glycosylated HA produced during 
influenza virus infection (Fig. 4). In the presence of tunica- 
mycin, an inhibitor that prevents polypeptide ^ycosylation (23), > 
this value was decreased to 56,000 daltons, the predicted size 
for unglyoosylated HA. It should be noted that addition of tuni- 
camycin also decreased the accumulation of unglycosylated wild- 
type HA by *»90%. Similar analysis showed that the mutant 
recombinant produced a HA polypeptide of 56,000 daltons 
equivalent to the predicted molecular size for unglycosylated 
HA. When tunicamycin was added during radiol^ling, the 
mutant HA product that corresponded to the most prominent 
band was not decreased in size and remained at 56,000 daltons. 
Other minor bands from the deletion mutant were probably 
derived from rearranged HA sequences that arose during pas- 
sages in AGMK cells. Hiese results provide evidence that the 
mutant polypeptide synthesized during infection was not mod- 
ified by giycosylation posttranslationally. In contrast to the wild- 
type HA tunicamycin did not show an inhibitory effect on the 
synthesis of mutant HA. 

Hemagglutination Activity of Unglycosylated Mutant HA. 
Lysates from AGMK cells were prepared after infection with 
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Fig. 3. Indirect inununofluorescence assay of HA in infected cells. 
AGMK cells were infected with the SV40 te helper and either the wild- 
type HA-SV40 virus ( Upper) or the mutant HA— SV 40 virus (Jxkvbt) for 
72 hr. Cells were fixed and stained as described in the text. Accumu- 
lation of the wild-type HA on the outer membrane is discernible. Ihe 
mutant HA stains as intensely as wild-type HA in the cytoplasm but 
does not accumulate on the membranes. 



Flo. 4. NaDodSOi/polyacrylamidegel analysisof the wild-type and 
mutant HAs. Monolayers of AGMK cells were infected with the wild- 
type HA-SV40 (wt-HA) and mutant HA-^40 (dl-HA> for 72 hr. In- 
fected cells were pretreated with tunicamycin (1 /ig/ml) for GO min prior 
to labding with [^jmethionine (100 MCi/ml) in methionine-free me- 
dium in the presence of 1 /ig of tunicamycin per ml. Cell lysates were 
immunopredpitated with HA antiserum and analyzed on 16% Na- 
DodS04/polyacrylamide gels. M indicates the polypeptide markers 
(shown in <failtons x 10'^) prepared from (^jmethiomne-labeled in- 
fluenza virions. 
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either the wild-type HA-SV40 or the mutant HA-SV40 and were 
examined for ability to agglutinate guinea pig erythrocytes. In 
a control experiment, a hemagglutination titer of 1:8 was ob- 
served for the wild-type HA. The iysate containing the ungly- 
cosylated mutant HA at an antigen concentration comparable 
to ^e wild-type HA did not show detectable hemagglutination 
activity (data not shown). 

DISCUSSION 

This study concerns the fimctional role of HA signal sequences 
in &e processing and maturation of the H3 subtype HA at the 
cell surfece. We constructed deletion mutants containing a 
deletion of the signal peptide at the Mbo II cleavage site. Mbo 
n cleaves at a site ei^t nucleotides downstream from its rec- 
ognition sequence 5' C-A-A-G-A 3' such that a one-nucleotide 
3' protrusion is generated. The current construction scheme 
allowed the retention of sequences including the Mbo II rec- 
ognition site and those encoding the first three amino acids, 
^er construction the deletion juncture should become a new 
Mbo II cleavage site; however, in our mutant, two additional 
bases were deleted from the upstream fragment for reasons not 
clear to us. This created a new Mbo 11 cleavage site three bases 
after the juncture. Nonetheless, it should be possible to apply 
the scheme used in our study for the construction of deletions 
d any length and thereby obtain a series of progressively ex- 
tended HA signal peptide mutants. Analysis of these HA mu- 
tants should yield information concerning the signal sequences 
required for functional activity. The mutant constructed in this 
study, with a deletion of 11 of the 16 amino acids in the signal 
sequence, was suitable for characterization of defective (unc- 
tions. This mutant retained the presumptive amino-terminal 
signal-peptide protease cleavage site and therefore specific 
cleavage at this site did not preclude the observed abnormal- 
ities in HA function. 

Our results demonstrate that the deleted HA polypeptide 
from the constructed mutant is not posttranslationaUy modified 
by glycosylation despite the presence of all normal Asn-X-Thr 
and Asn-X-Ser sequences. The signal hypothesis predicts that 
wild-type HA mRNA is translated on membrane-bound poly- 
ribosomes of the endoplasmic reticulum and the nascent HA 
polypeptides are subsequently transferred to the lumen by a 
mechanism of translocation and vectorial discharge. There is 
considerable evidence suggesting that glycosylation takes place 
initially on nascent polypeptides on endoplasmic reticulum 
membranes and subsequendy the carbohydrate components are 
modified in the Golgi apparatus (24, 25). It is interesting to note 
dvat production of wild-type, fully glycosylated HA (in the ab- 
sence of tunicamycin) was 10-fold higher than that of ungly- 
cosylated HA (in the presence of tunicamycin). One possible 
explanation is that tunicamycin inhibition freezes these gly- 
cosylation steps and hence translation of HA mRNA is slowed 
down, althou^ a low level synthesis of unglycosylated wild-type 
HA is eventually completed. Alternatively, tunicamycin in- 
creases the breakdown of unglycosylated wild-type HA because 
glycosylation confers intracellular stability and resistance to 
proteolytic digestion (26). On the other hand, translation of mu- 
tant HA mRNA probably proceeds on free polyribosomes and 
therefore the nascent polypeptide lacking the signal sequences 
&ils to translocate. Hie nascent as well as the completed mutant 
polypeptides do not encounter glycosyl transferases, which are 
membrane-associated, and the unglycosylated HA molecules 
remain in the cytoplasm. The finding that tunicamycin exerts 
no detectable inhibition on the accumulation of mutant HA fur- 
ther supports the model of synthesis on free polyribosomes. 

Our results show that the HA polypeptide lacking the tran- 
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sient signal sequences is not detectable at the cell surface. The 
transport of polypeptides from the cytoplasm to the outer 
membranes or for extracellular secretion is a complex process, 
presumably initiated by the signal sequences. Recent evidence 
suggests that the signal sequences at the amino terminus of 
polypeptides specify the initial recognition by a multiple-com- 
ponent structure known as signal peptide recognition particles 
(27, 28). This recognition process did not occur vnth the mutant 
HA polypeptide in simian cells. In £. coli, similar hydrophobic 
sequences that provide a transient signal for polypeptide trans- 
port also exist in several periplasmic proteins. Deletion or al- 
teration of these sequences results in accumulation of these 
proteins in the cytoplasm (29, 30). 

The HA-SV40 mutant produces unglycosylated HA that is 
fonctionally inactive, as assayed by the hemagglutination test. 
This failure to agglutinate erythrocytes suggests at least two 
possibilities. First, the mutant HA may be monomeric and thus 
M to assemble into a trimeric structure, which is required for 
ftinctional activity. Second, the defect of mutant HA may be 
due to the lack of carbohydrate components and as a result the 
mutant HA subunit fails to assemble properly. Because 50% of 
the carbohydrate side chains of HA can be removed without 
affecting hemagglutination of infectivity (31), it would seem that 
at least some carbohydrate components are not crucial. There- 
fore, it is most likely that the assembly process to form mature 
active surface HA may require a defined configuration of the 
polypeptide subunit that is programmed by posttranslational 
modifications such as glycosylation. 

Other observations support the view that deletion within the 
signal peptide of the HA polypeptide is most likely responsible 
for the functional defects described in this paper. Recently we 
have found that several HAs containing substitutions of amino 
adds in the signal peptide produced by point-mutation in HA- 
SV40 also showed similar defects of glycosylation and cell sur- 
&ce expression (unpublished data). This finding effectively rules 
out the possibdity that a mutation other than the signal se- 
quence deletion in our mutant was responsible for the observed 
fonctional abnormalities. ^ 

There are a number of differences evident when our results 
are compared with those of a similar recently published study 
(32). First, we constructed mutants that contained a simple in- 
ternal deletion in the HA signal sequence. The deletion HA 
that was studied extensively preserved the first four amino-ter- 
minal amino acids and the remaining sequences. Therefore, 
characterization of this mutant allowed us to correlate specific 
functional defects with this deletion. In contrast, using a dif- 
ferent construction procedure, other investigators derived a 
mutant tiiat sustained a deletion in the 5'-noncoding region of 
the HA recombinant. In addition, three different amino-ter- 
minal amino acids were added to replace the entire signal pep- 
tide (except the carboxyl-terminal Gly). Presumably as a con- 
sequence of either or both of these alterations, the mutant HA 
was synthesized at a level 0. 16% that of wild-type HA- Hence, 
it is difficult to interpret the lack of glycosylation of the small 
number of HA molecules that were produced. In contrast, mu- 
tant HA polypeptides directed by our mutant were made in the 
amount similar to that of wild-type HA molecules. Conse- 
quently, the absence of glycosylation of our mutant HA and its 
failure to migrate to the cell membrane can be convincingly at- 
tributed to deletion within the signal peptide. 

TTie authors thank Dr. Robert M. Chanock for helpful discussions 
and useful suggestions about the manuscript. We also thank Ms. Jo Ann. 
Beradt and Ms. Salome Kruger for their excellent technical assistance 
and Ms. Linda Jordan for typing the manuscript. 
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Sammary 

Engagement of the CD3/T cell antigen receptor complex on small, resting T celb is insufBcient 
to trigger cell-mediated cytotoxicity or to induce a proliferative response. In the present study, 
we have used genetic transfection to demonstrate that interaction of the B7-BB1 B cell activation 
antigen with the CD28 T cell differentiation antigen costimulates cell-mediated cytotoxicity 
and proliferation initiated fay either anti-CD2 or anti'CD3 monoclonal antibody (mAb). Moreover, 
a B7-negative Burkitt's lymphoma cell line that fails to stimulate an allogeneic mixed l3mnphocyte 
response is rendered a potent stimulator after transfection with B7. The mixed leukocyte reaction 
proliferative response against the B7 transfectant is inhibited by either anti-CD28 or B7 mAb. 
We also demonstrate that freshly isolated small, resting human T cells can mediate anti-CD3 
or anti*CD2 mAb-redirected cytotoxicity against a murine Fc receptor-bearing mastocytoma 
transfected with human B7. These preexisting cytotoxic T lymphocytes in peripheral blood arc 
present in both the CD4 and CD8 subsets, but are preferentially within the CD45RO* 
"memory** population. While small, resting T cells apparently require costimulation by CD28/B7 
interactions, this requirement is lost after T cell activation. Anti-CD3 initiates a cytotoxic response 
mediated by in vitro cultured T cell clones in the absence of B7 Uganda The existence of functional 
cytolytic T cells in the small, resting T cell population may be advantageous in &ciliuting rapid 
responses to immune challenge. 



T lymphocytes recognize antigen via the CD3/TCR com- 
plex. Binding of anti-CD3 or anti-TCR mAb to T cells 
results in a rapid increase in intracellular [Ca^^] and the 
generation of inositol triphosphate (IP3) (1-4). Although 
crosslinking of CD3/TCR alone is often sufhcient to induce 
inositolphosphate pathway activation, other signals are ap- 
parently necessary to induce functions such as cytokine secre- 
tion and proliferation (4). Both soluble &ctors and interac- 
tion vrith cell surface receptors have been implicated as 
costimulators of CD3/TCR-mediated activation. For example, 
allo-antigen-specific helper T cells fail to produce cytokines 
in response to HLA-DR7 antigen expressed in murine L cells. 
However, cotransfecdon of HLA-DR7 and CD54 (intracellular 
adhesion molecule 1 [ICAM-l]), a cellular ligand for CDll/18 
leukocyte function-associated molecule 1 [LFA-l]), restores T 
cell responsiveness (5). Thus, antigen-specific recognition alone 
m^ be insufficient to trigger effector funcdons since addi- 
tional accessory molecules are required for an efficient response. 

CD28 is a disulfide-linked homodimer that Is expressed 
on the majority of human peripheral blood T cells (6, 7). 



mAbs against CD28 in conjunction vidth phorbol ester in- 
duce T cell proliferation (7) and augment proliferarion in- 
duced by anti-CD3 or anri-CD2 mAb (8-12). Anti-CD28 
mAb-induced proliferation is IL-2 dependent (7, 8) and pos- 
sibly results from stabilization of messenger RNA for IL'2 
and other cytokines (13). 

Recently, it has been demonstrated that a B cell activation 
antigen. B7 (14) or BBl (15), is a natural ligand for CD28 
and that this receptor/ligand interacdon mediates heterotypic 
adhesion (16). Interaction of CD28 and B7 results in aug- 
mentation of T cell proliferation and cytokine production, 
and antibodies against CD28 or B7 inhibit alloantigen and 
mitogen-induced proliferadve responses (16, 17). B7 is weakly 
expressed on resting B cells and monocytes, and is elevated 
after stimulation with pokeweed mitogen, anti-Ig, anti-HLA 
class II, and EBV (14, 15, 17). Therefore, it is likely that T 
cell activation via CD28 will depend on the capacity of the 
AFC to upregulate expression of B7. The importance of the 
CD28/B7 interaction in polyclonal mitogenic responses 
prompted us to investigate whether the binding of B7 to CD2d 
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costiraulates the generation of a proliferative response against 
alloantigen and can augment the generation of T cdl-medi- 
ated cytotoxicity in small* resting T lymphocytes. 



Materials and Methods 

Frnaratum of T Lymphocytes. Human peripheral blood was ob- 
tained from the Stanford Blood Center (Pilo Alto, CA). Mononu- 
clear ceUs were isolated by density gradient centrilugation using 
Ficoll/Hypaque. Monocytes and B cells were depleted by plastic 
adherence and passage through nylon wool colunws (18). Nonad- 
herent lymphocytes were fractionated by centrifugation on discon- 
tinuous gradients consisting of 30% and 40% Percoll (Pharmacia 
Fine Chemicals, Piscataway, NJ) in PBS containing 10% PCS (18). 
High buoyant density lymphocytes enriched for small, resting T 
cells were isobted from the bottom of the fticoll gradients. Thymo- 
cytes from 1-yr-old children undergoing cardiac surgery were ob- 
tained from Stanford Medical Center (Palo Alto, CA). The biop- 
sies were minced with sdssors, the single cell suspension was passed 
through a nylon mesh, and mononuclear cells were isolated by 
Ficoll/Hypaque density centrifrtgation. Umbilical cord blood was 
collected from healthy full-term neonates immediately afrer vag- 
inal delivery. High buoyant density cord blood CD3^ T lympho- 
cytes were obtained as for peripheral blood. 

T Cell Clones, T cell clones were established from normal PEL 
or frtal hver, as described (19). The following clones were used: 
1375b2.22 is a TCR-7/5* CTL done established from fetal liver; 
1360b.l and 1320.3 are TCR-a/^* CTL clones esublished from 
fetal liver. Jp28a.6 is a TCR-a/0* CTL clone esublished from 
normal PBL. Antigen specificity of these T cell clones has not been 
determined. 

mAbs. BBl mAb (15) was generously provided by Dr. £d Clark 
(Univenity of Washington, Seattle, WA). L293 is a murine IgGl 
mAb directed against CD28 that was generated by immunizing 
BALB/c mice with the HPB-ALL T cell line and fosing immune 
splenocytes with Sp2/0 myeloma cells. L303 (IgG2a, x) and L304 
(IgGl, ic) aie murine mAbs directed against CD2 and CD2R, respec- 
tively, that were generated by immunizing BALB/c mice with anti- 
CD3-activated PBL and fusing inmiune splenocytes v^rith Sp2/0 
myeloma cells. Other mAbs were generously provided by Becton 
Dickinson Immunocytometry Systems (San Jose, CA). F(ab')2 
fragments of anti-CD3, CD28 and CD16 mAbs (aU IgGl isotype) 
were prepared using immobilized pepsin (Pierce Chemical Co., 
Rockford, IL). Purity of F(aV)2 fragments was verified by SDS- 
PAGE analysis. 

Thmsfeciion. P815, a murine mastocytoma cell line, and Ramos, 
a human Burkitt's lymphoma cell line, were obtained from Amer- 
ican Type Culture Collection (Rockville, MD). Human B7 cDNA 
(generously profvided by Dr. J. Allison, University of California, 
Berkeley) was subdoned into the pBJ expression vector (20). 10^ 
P815 and Ramos cells were transfected with 15 fxg pBJ/B7 plasmid 
by ekrti op o r ation (2.0 KV, 25 fiVD) using a Gene Pulser (Bio-Rad 
Laboratories, Richmond, CA) and selected in culture medium 
(RPMI 1640 [M. A. Bioinroducts, WalkersviUe, MD] 10% PCS 
[IR Scientific Woodland, CA], 1 mM sodium pyrruvate, 1 mM 
L-glutamine, 1% penidUin-streptomycin) supplemented with 2 
mg/ml G418 (Gibco Laboratories, Grand Island, NY). Afrer drug 
selection, B7-transfected P815 and Ramos cells were cloned and 
sdeaed for high cdl surface B7 expression by flow cytometry. Trans- 
fectants were maintained in culture medium containing 1 mg/ml 
G418. 

Anti-CDS-induced Redirected Cytotoxicity Assays. Cytotoxicity 



was measured in a 4-h ^*Cr-release assay (18). Anti-CD3 or anti- 
CD2 mAbs were added to "Cr-bbeled P815 or B7* P815 target 
cells 30 min before addition of e£Fector T cells, unless indicated 
otherwise. In mAb blocking studies, aU mAbs were used at a final 
concentration of S ^g/ml and were added at initiation of the assay. 

FroUfgmtion Assays. Irradiated (80 Gy) P815 and B7* P815 (3 
X 10* cells/well) were plated with anti-CD3 (anti-Leu-4; 5 
/ig/ml) or anti-CD2 (L303 and L304, 1.25 Mg^ml) mAbs in flat- 
bottomed 96-well micrptiter plates (Falcon Labware, Lincoln Park, 
NJ). Afrer incubation, for 30 min at rocmi temperature^ high buoyant 
density T cells were added at 10^ cells/well. Cultures were in- 
cubated at 37^C with 5% CO2 in a humidified atmosphere for 
72 h. For the mixed lymphocyte response, 10* high buoyant den- 
sity T cells and 10* irradiated (80 Gy) Ramos or B7^ Ramos cells 
were added co microtiter plate wells vnth or without mAbs and 
were incubated for 7 dl All cultures were bbeled for the final 18 h 
with 1 /iCi/well [^H]thymidine (New England Nuclear, Boston, 
MA) and were harvested on a 96-well plate harvester (LKB Instru- 
ments Inc, Gaithersburg, MD). Incorporated radioactivity was mea- 
sured in a beta plate scintillation counter (LKB Instruments, Inc.). 
Each value is the mean of tripticate wells. Standard mean error of 
the triplicates was <10%. 

Immunofluorescence and Flow Cytometry. Methods of immuno- 
fluorescence and flow cytometry have been described previously 
(21, 22). Flow cytometry was performed using a FACScan* or 
FACStar"-^ (Becton Dickinson Immuno^ometry Systems). 



Restdts and Diacnstion 

CD28/B7 Costimuhues Both CD2- and CDS-dependent T Cett 
Prolifenttion. Human B7 cDNA (23) was subcloned into a 
raammalian expression vector, transiSxted into an FcR-bcaring 
murine mastocytoma cell line, P815, and stable transfectants 
expressing high levels of surface B7 were established. A rep- 
resentative transfectant is shown in Hg. 1. To determine 
whether the B7'' P815 ceils were functionally competent to 
activate CD28, purified high buoyant density periphoal blood 
T cells were cocultured with irradiated B7'*' P815 or parental 
P815 cells in. the presence or absence of anri-CD3 mAb. Con- 
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Figon 1. B7'' P815 transfectants. Roenta] P815 {A) and B7^ P815 (B) 
truufiBctants were stained with control or BBl mAb, followed by FITC- 
coojugated goat anti-mouse Ig. Samples were analyzed by flow cytom- 
etry. The X-axis represents fluorescence (four-decade log scale) and the 
y-axis represents the relative cell number. Histograms from cells stained 
with control mAb (nearest the ordinate) are superimposed over the histo* 
gram of cells stained with BBl. 
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Fignn 2. B7 costimulation of anti-CD3-iiiduced T cdl proliferation. (A) Proliferation of small/ r«»ting peripheral blood T cells cocultuied with 
irradiated P815 (•) or B7* P815 (■) in the presence of anti-CD3 raAK (B) BBl mAb. but not IgM control mAb Leu-7, inhibits B7* P815 augmen- 
ution of anti-CD3-induced T cell prolifination. Note that BBl mAb is a murine IgM and does not interfere with binding of aiiti-CD3 mAb to IgG 
Fc TDcepton on P815. (C) Intact but not F(ab')2 fragments of anti<X>3 mAb stimulate praliferatibn of small, resting T cells in the ptesence of B7^ P815. 



sistent with prior findings (24), freshly isolated, small pe- 
ripheral blood T cells obtained from high buoyant density 
lymphocytes were minimally responsive to triggering via anti- 
Cp3. However, when T cells were stimulated by coculture 
with B?" P815. but not parental P815, we obs^i^ a pro- 
nounced effect on anti-CD3-induced proliferation (Fig. 2 A). 
Anti-CD3-induced proliferation costimulated by B?"*^ P815 
was substantially inhibited by anti-B7 mAb BBl (IgM) but 
not isotype-matchcd anti-CD57 mAb Leu-7 (IgM) (Fig. 2 
B). Similarly, the addition of anti>CD28 F(ab')2 fragments 
resulted in the inhibition of T cell proliferation (not shown). 
This indicated that the response was dependent upon the 
CD28/B7 interaction. Anti-CD3-tnduced proliferation was 
Fc dependent, in that no prolifieration was observed when 
T cells and B7* P815 were cultured with F(ab')2 anti-CD3 
firagments (Fig. 2 C). 

mAbs directed against certain CD2 epitopes induce T cell 
proliferation (25). To determine whether the CD28/B7 in- 
teraction could also stimulate T cell proliferation triggered 



via CD2, we examined the effect of anti-CD2 mAbs on B7- 
dependent T cell proliferation (Fig. 3). Saturating concen- 
trations of L303 (CD2) and/or L304 (CD2R) mAbs never 
induced substantial proliferation of small resting T cells in 
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Figure 3. B7 costimulation of anti-CD2-in<luced T cell proliferation. 
Small, resting peripheral blood T cells were cocultured with irradiated 
parental P815 (V). B7* P815 (■), or without P815 celU (H) in the 
presence or absence of anti-CD3. anti-CD2 (L303). and/or anti-CD2R 
(L304) mAb. 
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Hgore 4. B7 expression augments an allogeneic MLR. Smallp resting 
peripheral blood T cells firom two donors {A and B) were cocultured with 
irradiated Ramos or B7'*' Ramos in the presence or absence of BBl mAb, 
anti-CD28 F(ab')2 fragments, control IgM mAb Leu*7, or control anti- 
CD16 F(ab')2 fragments, as indicated. 
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the presence of P815. However, when resting T cells were 
costimulated with B7^ P815, L303 and L304 together in- 
duced significant proliferation » which was inhibited by BBl 
niAb (not shown). These results suggest that the binding 
of B7 to CD28 costimulates both the CD3 and CD2 activa- 
tion pathways. 

CD28/B7 Intemctions Augment Primary Allogeneic MLR. 
EBV-transfbrmed B cell lines are known to serve as potent 
stimulators of alloantigen-induced T cell proliferation. B lym- 
phoblastoid cell lines have been reported to express high levels 
of cell sur£ice B7 (15), suggesting the possibility that B7 ex- 
pression is necessary for the generation of a primary MLR. 
Ramos is an EBV-negative American Burkitt^s lymphoma ceU 
line that expresses high levels of class I and II MHC antigens, 
but does not express B7. Preliminary experiments demon- 
strated that this cell line was a poor stimulator of allogeneic 
MLR. Ramos was transfected with B7 cDNA« and stable 
transfectants expressing high levels of B7 were selected. Fig. 4 
shows the proliferative response of small» resting T cells from 
two donors cocultured with irradiated parental Ramos and 
B7- transfected Ramos. Parental B7-negative Ramos fuled to 
stimulate small, resting T cells. By contrast, small, resting 
T cells cocultured with B7-transfected Ramos mediated sub- 
stantial proliferative responses. These proliferative responses 
were substantially inhibited by either BBl or anti-CD28 
F(ab')2 fragments, but not isotype-matched anti-CD57 or 
anti-CD16 F(ab')2 fragments. 

CD28/B7 Intemctions Augment CeU-meJiated Cytotoxicity Trig- 
gered thwu gh CD3 and CD2. CTL clones can lyse ¥c receptor- 
bearing targets in the presence of anti-CD3 mAb in a **re- 
directed" cytotoxicity assay (26-29). Unlike CTL clones, 
freshly isolated small resting T lymphocytes do not mediate 
anti-CD3 redirected lysis (29, 30), although a small subset 
of 0056**^ or CI>S7* T cells that may represent in vivo ac- 
tivated T cdls do mediate this function (30, 31). Thus, en- 
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Figure 5. B7*depeDdent anti- 
CD3-rDdirected cytotcodchy. ^^Cr- 
labeled F815 (•) or B7^ P815 
(■) target cdls were added to 
plates widi citaed amount of ants' 
CD3 (Leu-4) mAb. After incuba- 
tion for 30 min at room tem|)era* 
ture, fireshly isolated small, testing 
peripheral blood T cells were 
added and the assay was harvested 
after 4 h. Resvdts &om expen- 
ments using two dififerent T cell 
donors {A and B) are shown. E/T 
ratio was 20:1. 



gagement of CD3/TCR alone is insufficient to trigger a cyto- 
lytic response in the majority of resting peripheral blood T 
lymphocytes. It is possible that small, resting T lymphocytes 
lack the cellular machinery necessary for cytotoxicity. Altcr- 
nativdy, additional signals m^ be necessary to generate a 
cytolytic response in resting T cells that are not required for 
stimulation of activated T cells. The ability of combinations 
of hetero-biiunctional anti-CD3/antitumor and anti'CD28/ 
antitumor mAbs to induce cytotoxicity suggested that inter^ 
action of CD28 with its natural ligand may provide the re- 
quired costiniulatory signal fi>r generation of CTL from resting 
peripheral blood T cells (32). As shown in Fig. 5, freshly 
isolated small resting peripheral blood T cells demonstrated 
potent anti-CD3-redirected cytotoxicity against B7* P815, 
but not p^ental P815, detecuble using a 4-h ^^Cr release 
assay. Comparable results were obtained using three indepen- 
dently derived B7^ P815 transfectanU; however, P815 trans- 
fectants expressing high levels of CD16-II were not lysed in 
the presence of anti-CD3 (not shown). Lysis of B7* P815 
transfectants was inhibited by BBl or anti-CD28 F(ab')2 
fragments (pig. 6 ^). Anti-CD3-induced (ytotoxicity was 
Fc dependent since anti-CD3 F(ab')2 fragments failed to 
trigger lysis against B7+ P&IS (Fig, 6 B). 

mAbs against CD4 (Leu-3a), CDS (Leu-1), CD7 (Leu-9), 
CDS (Leu-2a). CDlla (L7), CD18 (L130), and CD38 (Lcu- 
17) failed to induce redirected lysis against parental or 67"^ 
P815 cells (not shown). However, as with the proliferative 
response, appropriate combinations of anti-CD2 mAb (L303 
+ L304) coidd trigger cytotoxicity against B7* P815, but 
not parental P815 (Fig. 7). Anti-CD2-induced cytotoxicity 
was inhibited by BBl antibody, but not isotype-matched con- 
trol mAb (not shown). These data indicate that binding of 
B7 to CD28 costimulates CD3/TCR- and CD2-dependent 
T cell-mediated cytotoxicity and demonstrates that small, 
resting T cells are capable of cytotoxic funcdon. 

The mechanism whereby expression of B7 on the target 
cell enables small, resting T cells to initiate lysis has not been 
defined. It is possible that the CD28/B7 interaction increases 
or stabilizes heterotypic cellular adhesion between the effector 
and target cells. However, in the present experimental system 
this seems unlikely since effectors and Fc receptor-bearing 
targets are already ef&dently bridged by anti-CD3 or anti- 
CD2 mAbs. No obvious differences were observed in con- 
jugate formation between parental or B7'*' P815 cells and 
effector T cells in the presence of anti-CD3 mAb (unpub- 
lished observation). Alternatively, interaction of CD28 with 
B7 on the target may provide a cosignal to the T cell that 
is necessary to mobilize the cellular machinery necessary for 
cytotoxicity. Preliminary studies have indicated that cytotox- 
icity is unlikely to be mediated by a soluble cytotoxic factor, 
since no b3rstander cytotoxicity was observed when ^'Cr-la- 
beled parental P815 cells were admixfd virith tmlabeled B7* 
P815 cells in the presence of effector T cdls and anti-CD3 
mAb (unpublished observation). Further studies will be neces- 
sary to elucidate the mechanism of cell-mediated cytotoxicity 
initiated by small, resting peripheral blood T cells. 

Effectors of CD28/B7 -dependent Cytotoxicity Using B7* 
P815 targets, anti-CD3-reclirected cytotoxicity could also be 
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Fignre 6. Inhibidon of B7-depeiulent aiiti-CD3- 
redirected cytotoxicity. {A) Anti-CX)28 F(ab')2 frag- 
ments and BBl (IgM) mAb, but not contxol anti-CDl6 
F(ab')2 fragments or control Leu- 7 (IgM) rnAbs, in- 
hibit anti-CD3-redirected cytotcaddty against B?"^ 
P815. £/T ratio was 20:1. (B) Intact (•). but not 
F(ab')2 <0) anti-CD3. isduoes ged ir ected ototoxicity 
against '^Cr-bbeled B7^ P815 targets. (▲) Cytotox- 
icity against 87*^ P815 cells in the absence of anti- 
CD3 mAb. AD mAbf were used at a 6nal concentia* 
tion of 5 |ig/ml and were added at the start of the assay. 
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Figore 7. B7"depcndent anti-CD2'tedixected cytotoxicity. Anti-CD2 
mAb-iediitcted cytotoxicity mediated by small, resting peripheral blood 
T cells against P815 (■) or B7^ P815 (■). All mAbs were used at a final 
concentration of 5 /ig/ml and were added at the start of the assay. E/T 
ratio was 20:1. 

demonstrated iising freshly isolated thymocyte and cord blood 
T cell effectors, although the activity was lower than that 
observed using adult small, resting T cells (Fig. 8 A), Both 
small resting peripheral blood CD56~ JCD4^ and CD56', 
CDS'** T cdls mediated equivalent levels of aDti-CD3-in- 
duced cytotoxicity (Hg. 8 B), The ability of both CD4 and 



CD8 T cells to mediate cytotoxicity is consistent with the 
ability of both CD4 and CD8 T cell clones to mediate anti- 
CD3-redirectcd cytotoxicity, demonstrating that both major 
subsets of T lymphocytes possess the necessary cellular com- 
ponents for cytolytic function (27, 33, 34). 

Prior studies have indicated that difeential immune re- 
sponses can be mediated by subsets of T cells distinguished 
based on expression of CIM5 isoforms (35, 36). "Virgin'* 
or "naive" T cells are characterized by expression of high levels 
of CD45RA and the absence of CD45RO, vdiereas "memory" 
T cells express CIM5RO and lower levels of CD45RA. When 
small, resting peripheral blood T lymphocytes were separated 
into CD45RO- and CD45RO* fractions, CD45RO^ T 
celk denionstrated a significantly higher level of cytotoxicity 
against P815 Urgets when stimulated with either anti- 
CD3 (Fig. 9) or anti-CD2 mAbs (not shown). Comparable 
results were obtained using T cells isolated from four dissent 
individuals. It is likely that the B7-costimulated anti-CD3-re- 
directed killing observed in cord blood T cells is accounted for 
by the presence of a low number of CD45RO^ cells (9%) 
in this population. As shown in Hg. 10, both CD45RO~ 
and CD45RO^ T ceUs express CD28 on the cell surface, 
demonstrating that this di£ferential responsiveness cannot 
be explained simply by lack of CD28 expression in the 
CD45RO- T cell population. 



Fignre 8. T subsets demonstrating B7-dependent T 
cell-mediated cytotoxicity. {A) Cytotooddty mediated by 
thymocytes, cord blood T cells, and adult small, resting 
peripheral blood T cells against 37'*^ P815 in the pres- 




ence (■) or absence (■) of anti*CD3. Note that high 
buoyant density cord blood CD3 * T lymphocytes were 
87% CD4% 12% CDS*, and 9% CD45RO*. T cells 
fuled to demonstrate any cytotooddty against parental P815 
in the preiMnce of anti-CD3 niAb (not shown). (B) Anti- 
CD3-tedirdned cytotooddty mediated by CD4~ and 
CDS" smaU, testing peripheral blood T ceUs agaiiut 
P815 (m or B7^ P815 (■). High buoyant density T 
lymphocytes were stained with PE^njugated anti-Leu- 
19 (CD56) and FTIC-conjugated anti-Leu-2 (CDS) or anti- 
Leu -3 (CD4) mAbs. CD8*.CD56-; CD8-.CD56-; 



CD4 * .CD56 - . and CD4'JCX>56- subscu were isolated 
to >95% pxirity by flow cytometry. Data are presented for the CD4-.CD56- (i^., CDS*) T cell fraction and the CD8-JCD56- (i^» CD4*) 
T cell fraction. I^itively sorted CD8*,CD56~ and CD4*,CD56~ T cells were also tested and demonstrated equivalent cytotooddty against B7-P815 
targets in the presence of antt-CD3 (not shown). E/T ratio was 20:1. 
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Fignre 9. B7-dcpcndeDt cytotoxicity mediated by memory T cells. High 
buoyant density T lymphocytes from two individuals {A 2nd fi) were stained 
with PE-conjugated anti-CI>45RO (UCHU) and were sorted by flow 
cytometry. Purity of the sorted subsets was >98%. Umcparated T cells 
(•)» CD45RO- T cells (■). and CD4SRO'^ T cells (▲) were assayed 
for anti-CD3-redii«cted cytotoxicity against siCr-Iabded B7* P815 odls. 
Staining with anti-CD45RO mAb did not affect levels of anti-CD3- 
redirected lysis compared with unstained cells (not shown). 
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Figure 10. Cocxpression of CD28 on CD45RO- and CD45RO* 
small, resting peripheral blood T cells. High buoyant density T Ijonpho- 
cytes were suined with FlTC-conjugated anti-CD28 (9.3), PE-conjugated 
anti-CD45RO, and PerCP-conjugated anti-CD3, or appropriate control 
mAb. Samples were analyzed on a FACScan*. Markers were positioned 
to include >987c> of cells stained with FITC- and PE-conjugated isotype 
control antibodies in the lower left qiudrant (not shown). An electronic 
gate was set on PcrCP-suincd CD3* T cells, and correlated fluorescence 
of FITC (CD28) and PE (CD4SRO) were displayed as a contour map 
(four-decade log scales). Axialysis of CD45RO cxpxesuon in small, resting 
T cells isolatied from four individuals revealed that the proportion of 
CD4SRO'^ T cells ranged from 48 to 78%. 



memory T cell population is primarily responsible for this 
activity and both CD4 and CD8 T cells are capable of this 
function. It is unlikely that this cytotoxicity is mediated by 
in vivo activated T lymphocytes, since highly purified, small. 



Bl'independent Cytotoxicity Mediated by T Cell Clones. The 
above results si^gest that small, resting T cells require 
CD28/B7 costimulation to induce CD2- or CD3-dcpendent 
cell-mediated cytotoxicity. In accordance with prior obser- 
vations (26-29). TCR-a/j8* and CTL clones 
mediated anti-CD3-redirected cytotoxicity against parental 
P815 (Fig. 11). Thus, while resting T cells apparently require 
CD28/B7 interactions to trigger cell-mediated cytotaxicity, 
this is no longer essential for stimulation of activated T lym- 
phocytes. However, even with the T cell clones, we consis- 
tently observed that the presence of B7 on PB15 slightly aug- 
mented cytotoxicity compared with parental P815 targets 
(Fig. 11). 

In summary, interaction of CD28 with its natural ligand 
B7 costimulates the induction of cell-mediated cytotoxicity, 
as well as prolifieradon, of small, resting T lymphocytes. Both 
CD2 and CD3 activation pathways cooperate with CD28 
as a consequence of CD28 binding its natural ligand B7. 
With respect to induction of cytotoxicity, the CD45RO'^ 
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Fsgore 11. B7-indcpfndcnt anti-CD3-redirected cytotoaucity mediated 
by T cell clones. TCR-a/^^ (Jp28a.6, 1360b.l. and 1320.3) and TCR- 
7/6* (1375b2.22) T cell clones were incubated with siCr-labclcd B7* 
P815 (■) or P815 {■) in the presence or absence of anti-CD3 mAb, as 
indicated. £/T ratio was 10:1. 
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resdng T cells isobted by density giadient centrifugadon weir 
used for these experiments. Moreover, cytotoxicity was only 
observed using the B7'*' P815 trans£ectant and not with the 
parental P815 cells. Since in vitro cultured CTL, as well as 
the presumably in vivo activated CD56'*' T cells mediate 
redirected lysis against parental P815» it seems likely that the 
CD28/B7 interaction is necessary only for costimulation of 
small, resting T ceUs and is no longer required after activa- 
tion. Consistent with this concept, we have previously demon- 
strated that T lymphocytes that mediate potent 
anti-CD3-redizected cytotoxicity dowmegulate CD28 expres- 



sion after stimulation with IIr2 (37). Moreover, the subset 
of CD28- peripheral blood T cells that coexpress CDUb 
in vivo are predominantly granular, low buoyant density T 
cells that m^ represent activated T ceUs that have under- 
gone clonal expansion (38, 39). Therefore, the CD28/B7 in- 
teraction may be most critical in the early generation of a 
cell-mediated immime response, consistent with a possible 
role of CD28/B7 in the stimulation of immature thymocytes 
(40, 41). The existence of functional cytolytic T cells in the 
small, resting T cell population may be advantageous in 
fiictlitating rapid responses to inmiune challenge. 
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Through the interaction with its ligands* CD80/B7-1 
and CD86/B7-2 or B70. the human CD28 molecule plays a 
major functional role as a costimulator of T cells along 
with the CD3'TcR complex. We and others have previ- 
ously reported that phosphatidylinositol 3-kinase in- 
ducibly associates with CD28. This association is medi- 
ated by the SH2 domains of the p85 adaptor subunit 
interacting with a C3^oplasmic YMNM consensus motif 
present in CD28 at position 173-176. Disruption of this 
binding site by site-directed mutagenesis abolishes 
CD28-lnduced activation events in a murine T-cell hy- 
bridoma transfected with human CD28 gene. 

Here we show that the last 10 residues of the intracy- 
toplasmic domain of CD28 (residues 193-202) are re- 
quired for its costimulatory function. These residues are 
involved in interleukin-2 secretion, p85 binding, and 
CD28-associated phosphatidylinositol 3-kinase activity. 
In contrast, the CD28/CD80 interaction is unaffected by 
this deletion, as is the induction of other second messen- 
gers such as the rise in intracellular calcium and tyro- 
sine phosphorylation of CD28-specific substrates. Fur- 
thermore, we also demonstrate that, within these 
residues, the tyrosine at position 200 is involved in p85 
binding, probably together with the short proline-rich 
motif present between residues 190 and 194 (PYAPP). 



In the absence of a costimulatory signal, activation of the 
CD3'TcR' complex is not sufficient to induce the complete 
activation of T lymphocytes. The interaction between CD28 on 
T lymphocytes and its counter-receptors CD80 (B7-1) and 
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CD86 (B70 or B7-2) on antigen-presenting cells provides a 
costimulatory signal required for IL-2 production, T-cell prolif- 
eration, and effector functions such as T-cell-mediated cytotox- 
icity and differentiation of Th cells into Thl or Th2 subsets (for 
recent reviews, see Refs. 1-3). This CD28/CD80 interaction has 
also been shown to prevent anergy and to boost anti-tumor 
immunity (4-6). 

Sequence comparisons between human, rat, mouse, and 
chicken CD28 cytoplasmic domains (7-10) demonstrates high 
interspecies conservation, suggesting a crucial role for this 
domain in coupling to signal transduction pathways. In the 
absence of catalytic motifs in this sequence, an indirect cou- 
pling via adaptor molecules was the most likely mechanism of 
action. Indeed, we and others have demonstrated previously 
that ligand stimulation of the human CD28 molecule induces 
its association with PI 3-kinase activity (11-15) by means of a 
cytoplasmic YMNM motif at position 173-176 which, when 
phosphorylated. interacts with the SH2 domains of the p85 
adaptor subunit. Similarly, the SH2 domain of the adaptor 
protein Grb-2 has been shown to interact with this motif al- 
though with a lower affinity (16). and the CD28-associated 
Grb-2*Sos complexes are likely to link the activated CD28 re- 
ceptor to the activation of p2K** and downstream events such 
as Raf-1 hyperphosphorylation and ERK2 stimulation (17), as 
well as Jun kinase activation (18). 

The primary events leading to CD28 phosphorylation are 
becoming better understood. The T-cell-specific protein-tyro- 
sine kinase ITK has been shown to associate with CD28 and to 
be phosphorylated on tyrosine residues sifter CD28 stimulation 
(19), and the Src-related tyrosine kinases pSB'** and p59^'' 
have been found in CD28 immiine complexes from stimulated T 
cells (20). Recently, it has been shown that CD28 is phospho- 
rylated by pSS'*"* and p59^ in vitro leading to the recruitment 
of ITK. Grb-2, and p85 (21). Interestingly, the pattern of tyro- 
sine phosphoproteins induced by a CD28 stimulation is similar 
but not superimposable to that induced by a CD3-TcR stimu- 
lation (22, 23) and. among the identified products, are p36-38, 
p95*"**', and PLC-7I as well as a CD28-speciric 64-kDa protein 
which has yet to be formally identified (reviewed in Ref. 24). 

Using a murine T-cell hybridoma transfected with the hu- 
man CD28 gene, we have shown previously that a point muta- 
tion of the Tyr*^^ residue into phenylalanine abolished CD28- 
induced IL-2 secretion, suggesting that the PI 3-kinase 
pathway plays a major role in the CD28 function (12). Here we 
report the generation and functional characterization of a set of 
intracytoplasmic variants of the human CD28 molecule. We 
have generated mutants of CD28 containing progressive trun- 
cations of its intracytoplasmic tail (10. 21, 30, and 41 residues), 
as well as a point mutation of the tyrosine residue at position 
200. These variants were expressed in a murine T-cell hybri- 
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doma. By analyzing stable transfectants. we investigated 
whether these molecules were able to mediate cell adhesion to 
human CD80-transfected L-cells» to be phosphorylated, bind 
and activate PI 3-kinase, and to costimulate IL-2 production 
together with CD3-TcR. 

EXPERIMENTAL PROCEDURES 

Cells and mAbs — ^DC27. 1 used for transfection is a murine T-cell 
hybridoma derived by transfecting the TcR a/3 genes of KB5C20 in 
DOl 1.10.2 (kindly provided by B. Malissen. CIML. France). These cells 
were grown in Dulbecco's modified Eagle's medium containing 10% 
fetal calf serum, sodium pyruvate (1 mM), 0-mercaptoethanol (50 ^im). 
and antibiotics (penicillin-streptomycin, 10 lU/ml). supplemented by 
xanthine (250 /xg/ml). hypoxanthine (13.6 ^g/ml). and mycophenolic 
acid (2 fig/ml). LTK~ and LB7'*' cells are L cells respectively untrans- 
fected or transfected by a CD80/B7-containing expression vector.^ The 
human CD28 mAbs. CD28.1. CD28.2. CD28.3. CD28.5. CD28.6. and 248 
used in this study have been described previously (25). CUE. 4 and 
6A11.2 (anti-human IgGl and IgM, respectively) were derived in the 
laboratory and used as negative isotypic controls. The anti-murine 
mAbs were, respectively, 145-2C11 (a hamster IgG specific for CD3-€ 
chain) and 37.51 (specific for murine CD28. Pharmingen. San Diego, 
CA). 

OligonucleoUde-directed MutagenesJs—The human CD28 cDNA 
(kind gift of B. Seed. Ref. 7) was cloned into the SaR-BamHl open vector 
pBIuescript KS"^, We constructed deletion mutants using the "Muta- 
Gene Phagemid fn Vitro Mutagenesis Kit" (Bio-Rad). The point muta- 
tion of the Tyr^°® residue to phenylalanine was realized with the over- 
lap extension technique. The double mutant Tyr*^^^"*' was obtained by 
using these primers on the cDNA template containing the mutation 
Xyr*^^ -» Phe*'^ (12). The sequences of the oligonucleotides used are 
available upon request. Sequencing of mutated molecules was per- 
formed using Sequenase 2.0 (U. S, Biochemical Corp.). 

Piasmid Construction and Transfection — ^Wild type and mutated 
CD28 cDNA constructs were cloned into pH^APr-l-neo (26) at SaW 
BamHl sites, and recombinant genes were introduced by protoplast 
fusion into DC27.1 as described (27). Stable transfectants were selected 
for their resistance to 3 mg/ml geneticin G418 (Life Technologies. Inc.) 
and screened for CD28 expression by flow cytometry analysis. 

Flow Cytometry Analysis — 2 x 10* cells were incubated with satu- 
rating concentrations of mAbs at 4 'C for 1 h. After extensive washing, 
cells were stained with fluorescein isothiocyanate-conjugated goat anti- 
mouse Ig at 4 *C for 30 min (Jackson Laboratories. West Grove. PA). 
Samples were analyzed by flow cytometry using a FACScan (Becton 
Dickinson). Fluorescence data were collected with logarithmic 
amplification. 

Adhesion Assay — 4 x 10* transfected cells loaded with calcein AM 
(Molecular Probes. Eugene. OR) were added to 10^ LTK~ or LB7* cells 
seeded the day before in a microtiter plate, in the absence or presence 
of mAb CD28.2 in PBS without Ca^"^ and Mg^"^. Adherent cells were 
analyzed by the quantification of fluorescence (excitation at 485 nm and 
emission at 538 nm) by fluorimetry (Fluoroscan). 

Measurement of IL-Z Secretion — 10^ transfected cells were stimu- 
lated for 24 h at 37 'C in microtiter plates with various stimuli. 5x10* 
un transfected (LTK~) or CDSO-transfected (LB7'*^) L cells were used for 
stimulation of transfectants. Negative (anti-CD5) and positive (anti- 
CD3) controls were respectively purified mAbs CUE. 4 and 145-2C11 
used at a final concentration of 10 /xg/ml in combination with FcR^ B 
lymphoma cells LK35.2 (10^ cells). Soluble CD28.2 mAb (30 /xg/ml) was 
used in combination with soluble 145-2C1 1 (10 /xg/ml). Culture super- 
natants were collected and titrated, by serial 2-fold dilutions, for their 
ability to support proliferation of the lL-2-dependent murine T cell line, 
CTLL-2. as assessed by the cell growth determination 3-(4,5-dimethyl- 
Chiazol-2-yl)-2,5-diphenyltetrazolium bromide kit (Sigma). Results were 
expressed as Asjq „,„ obtained for each dilution of the supematants. 

PI 3'Kinase Assay — 1 0^ transfected cells were either unstimulated or 
activated by CD28.2, then immunoprecipitated with protein G-Sepha- 
rose beads (Pharmacia Biotech Inc.). or with a p85 antiserum (UBI). 
Measure of CD28-associated PI 3-kinase activity was performed as 
described in Ref. i2. 

Association of p85 Subunit and CD28 Phosphorylation— XQ'' cells 
were stimulated by a fibroblast cell line transfected (LB7'*') or not 
(LTK-) by the CD80/B7 cDNA. with or without CTLA-4Ig. Western 
blotting of p85-associated CD28 molecules was performed as described 
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in Ref. 12, Blots were then stripped and reprobed with an anti-phos- 
photyrosine antibodies (4G10. UBI). Integrated signal intensity of phos- 
phorylated proteins was determined using the Biolmage System (Mil- 
lipore) and expressed as arbitrary units. Values were normalized by 
integration of the actual amount of CD28 molecule detected by Western 
blotting for each mutant. Fold induction corresponds to the ratio of 
CD28 tyrosine phosphorylation in cells stimulated by CD80^-L cells 
versus unstimulated cells. 

Anti'phosphotyrosine Immunoblotting — ^Whole cell lysates from 2 X 
10*^ stimulated cells were run on SDS-PAGE and transferred to polyvi- 
nylidene difiuoride membranes as described in Ref. 12. Blots were then 
probed with an anti-phosphotyrosine antibodies (4G10, UBI), and im- 
munoreactive proteins were visualized by ECL. 

Binding ofCD28 Peptides to Purified p85 Domains— CB2B peptides 
(residues 166-180. 166-180 with phosphorylated Tyr»". 186-202. 
186-202 with phosphorylated Tyr^ were coupled on Actigel ALD 
beads (Sterogene). 15 /xl of beads were incubated with 5 /xg of either 
GST. GST-C-SH2. or GST-SH3 purified proteins (kind gift of Ivan Gout) 
for 4 h at 4 *C. Precipitates were then extensively washed, resuspended 
in sample buffer, denaturated by 3 min boiling, then run on 10% 
SDS-PAGE. and silver-stained. 

D-3 Phosphoinositide Labeling. Extraction, and HPLC Separa- 
tion — 2 X 10" transfected cells were labeled with 1 mCi of 
[^^Plorthophosphate (8500-9120 Ci/mmol. DuPont NEN) as described 
(28. 29). Following the labeling procedure, cells were washed and stim- 
ulated by the addition of CD80"**-L cells. Cell contact was achieved by 
low speed centrifugation in a microcentrifuge for 5 s. Phospholipids 
were extracted as described (29. 30). The samples were deacylated and 
analyzed by anion exchange high performance liquid chromatography 
(HPLC) as described (28). 

RESULTS 

Binding of B7/CD80- trans fected Cells and CD28 mAbs to 
CD28 Intracytoplasmic Deletion Mutants Transfected into a 
Murine Hybridoma — ^The high conservation of the CD28 cyto- 
plasmic domain among various species (human, rat, mouse, 
chicken) suggested its major role in signal transduction. None- 
theless, in the absence of a recognizable catalytic domain 
within these sequences, an indirect coupling via adaptor mol- 
ecules was suspected. We truncated 10 (de7 lOi, 21 {del 2f), 30 
(cte7 30i C- terminal residues, respectively, or the whole intra- 
cellular domain {del 41) (Fig. I A) and stably transfected these 
various constructs into the murine T-cell hybridoma DC27.1. 
Fig. IB shows CD28 expression profiles for one clone represent- 
ative of each transfection. after staining with the CD28.2 mAb. 
Deletion of 10, 21. 30. or 41 residues did not prevent surface 
expression of the transfected molecule (Fig. but for the del 
41 mutant, the mean fluorescence intensity was 7-fold lower 
than that observed for wild type CD28 (49 and 370. respective- 
ly). We also tested these cells for staining with a panel of 5 
distinct mAbs: CD28.1. CD28.4. CD28.5, CD28.6. and 248 iden- 
tifying at least 4 distinct epitopes on the CD28 molecule (25). as 
well as for binding of a B7-Ig fusion protein, and they all 
stained the various CD28 deletion mutants (not shown). 

CD28 is an adhesion molecule since CD28/CD80 interaction 
allows cell adhesion (31). Using L cells transfected with human 
CD80. we show that wild type CD28-expres5ing cells bound to 
huCD80'*' cells (LB7^, 34.5% of binding) but not to untrans- 
fected cells (LTK"). In addition, this binding was inhibited by 
the addition of the human mAb CD28.2 (Fig. IQ. The del 10 
and del 30 transfected mutants were still able to bind 
huCD80'^-L cells with almost similar efficacy to wild type 
CD28. We previously reported the involvement of the tyrosine 
residue at position 173 in the activation of the PI 3-kinase 
pathway (12). Fig. IC shows that this mutation did not affect 
CD28/CD80 interaction. Altogether, these data indicate that all 
deleted CD28 molecules still bind CD28 mAbs and B7-Ig and. 
in addition, are equally able to mediate the CD28/CD80 inter- 
action showing that their extracellular structure was not 
modified. 

The 10 C-terminal Residues of CD28 Are Required for IL-2 
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Fic. 1 . Intracytoplasmic truncations of the human CD28 mol- 
ecule. A, deletion mutants were produced by replacing original codons 
by stop codons {arrow^ using oligonucleotide-directed mutagenesis. 
Sequencing of mutated molecules before transfection was peiformed 
according to the classical dideoxy method. B. one clone representative of 
each transfection (wild type or deleted CD28 molecules) was analyzed 
by flow cytometry after staining with the CD28.2 mAb. These fluores- 
cence histograms were compared with staining of the untransfected 
murine T-cell hybridoma. DC27.1, C, adhesion assay was performed 
using untransfected {LTK-) or CD80*-L cells in the absence {LB7-^), or 
presence of the CD28.2 mAb {LB7+/CD28.2i. 

Secretion — We have previously shown that a point mutation of 
the tyrosine residue at position 173 abolished both PI 3-kinase 
binding and IL-2 secretion in a murine T-cell hybridoma trans- 
fected with human CD28 (12). Here we investigated whether 
other regions of the CD28 cytoplasmic domain were required 
for late events of activation. We therefore tested whether CD28 
stimulations (huCD80'^-L cells or CD28 mAb in combination 
with CDS mAb) could induce IL-2 secretion in transfected cells. 
Fig. 2 shows that both stimulations induced IL-2 secretion in 
cells transfected by the wild type CD28 construct {upper pane!), 
while IL-2 production was severely altered in del 10 cells what- 
ever CD28 stimuli was used {lower panel}. Similar data were 
obtained with cells transfected by molecules truncated by 21, 
30, and 41 residues (not shown). By contrast, cross-linked CD3 
stimulation resulted in strong IL-2 secretion in all these clones. 

Coupling of Deleted CD28 Molecules to Phosphatidylinositol 
3-Kinase and Tyrosine Kinases — ^We and others have shown 
previously that, upon stimulation, the human CD28 molecule 
was able to associate with a PI 3-kinase activity. This associ- 
ation involves the SH2 domains of the p85 subunit interacting 
with a YMNM motif present in the CD28 cytoplasmic domain. 
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Fig. 2. Function of wild type and deleted CD28 molecules. 

Transfected cells were stimulated by cross-linked CDS {closed circle^ or 
CD3 {open circle^ mAbs as negative and positive controls, respectively. 
CD28 stimulations were performed with CD80'*^-L cells {triangle^, or 
soluble C028 mAb in combination with soluble CDS {closed square^. 
Supematants were collected after 24 h of stimulation and titrated by 
serial dilutions for their ability to support proliferation of the IL-2-de- 
pendent cell line. CTLL-2. Results are expressed as A^-jq obtained for 
each dilution of the supernatants and correspond to the proliferation of 
CTLL-2 as assessed by the cell growth determination 3-{4,5-dimethyl- 
thiazol-2-yl)<2.5-diphenyltetrazolium bromide reagent. Stimulation by 
soluble CD28 or CDS mAbs on their own did not induce a significant 
IL-2 production. 

Here we tested gradually truncated CD28 molecules for their 
ability to associate with p85 and PI 3-kinase. Upon CD28 mAb 
stimulation PI 3-kinase activity associated with the wild type 
CD28 molecule (Fig. 3. lane ^. while a deletion of 30 C-termi- 
nal residues including the p85 binding site completely abol- 
ished this coupling {lane S^, Interestingly a deletion of only the 
10 last residues was also able to inhibit 90% of the PI 3-kinase 
activity coupled to the CD28 molecule {lane 4^. 

Since the total immunoprecipitable PI 3-kinase activity was 
equivalent in all these cells (data not shown), the observed 
defect in PI 3-kinase activity could be explained either by the 
inability of truncated molecules to activate the enzyme or by 
their failure to associate with its p85 adaptor subunit. p85 
Western blotting of CD28 immunoprecipitates revealed that 
deletion of 10 C-terminal residues decreased the CD28/p85 
association by more than 90% while a deletion of 30 C-terminal 
amino acids including residues 173-176 completely abolished it 
(Ref 12 and data not shown). 

We also examined the ability of other transducing pathways 
to associate with CD28 deletion mutants. A rise in Ca^"^ re- 
flecting PLCyl activation was detected in cells expressing ei- 
ther wild type CD28 or del 10 mutant upon stimulation by CDS, 
as well as by CD28 mAbs (data not shown). CD28 and CD3 
stimulations induce the tyrosine phosphorylation of specific 
substrates (17. 22. 23). A 2-min stimulation of both WT and del 
10 transfected cells by CDS mAbs induced the tyrosine phos- 
phorylation of several substrates, the two most prominent 
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Fig. 3. Association of CD28 mutants 
with phosphatidylinosltol 3-klnase. 

Each transfected clone, left unstimulated 
{lanes 1. 3, 5, 7, and ^ or stimulated with 
the CD28.2 mAb {lanes 2, 4, 6, 8, and iCJ, 
was tested for its ability to associate with 
PI 3-kinase. Immunoprecipitation of 
CD28 molecules was performed using pro- 
tein G, and measure of PI 3-kinase activ- 
ity associated with the various CD28 mu- 
tants was performed as described under 
'Experimental Procedures." 
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bands corresponding to molecular masses of 100 and 36 kDa 
(Fig. 4, Janes 2 and 5 (32. 33)). CD28 stimulation led to a strong 
phosphorylation of two proteins of 95 and 64 kDa, the former 
being vav.^ As shown in Fig, 4. deletion of the 10 C-terminal 
amino acids did not prevent phosphorylation of these 
substrates. 

Binding ofCD28 Peptides to p85 C-SH2 and SH3 Domains— 
The cytoplasmic CD28 sequence contains two short proline-rich 
segments between residues 178-183 (Px^'aP) and 190-194 
(PA3tf*P) which might serve as docking sites for the SH3 domain 
of p85 (34). Deletion of 10 C-terminal residues (amino acids 
193-202) disrupts one of these proline-rich sequences, and this 
may account for the observed defect in p85 binding. To test this 
hypothesis and to determine if individual SH3 and C-SH2 
domains of p85 could bind directly to the C-terminal part of 
CD28 in vitro, we tested whether a 1 7-mer peptide correspond- 
ing to residues 186-202 of CD28 could interact with recombi- 
nant SH2 and SH3 domain fusion proteins. As shown previ- 
ously, interaction of p85 with the YMNM consensus site was 
strictly dependent upon tyrosine phosphorylation since a 15- 
mer phosphopeptide corresponding to residues 166-180 
strongly bound the C-SH2 domain of p85 (Fig. 5, lane 4) while 
a nonphosphorylated fonn of the peptide did not {Jane ^. In a 
non-phosphorylated form, peptide 186-202 did not interact 
with the C-SH2 domain (Jane ^ while it did bind the p85 SH3 
domain {Jane 7). Interestingly, this p85 SH3 domain binding 
was decreased when peptide 186-202 was phosphorylated at 
position Tyr^°° {Jane JO^. Despite the absence of a consensus 
p85 binding site in the phosphopeptide 186-202. a weak bind- 
ing of the p85 C-SH2 domain was observed however {Jane 9i. 

InvoJvement of Tyt^^* in CD28 SignaJing— To examine If the 
tyrosine residue at position 200 was involved in the PI 3-kinase 
pathway in vivo, we mutated it to phenylalanine and expressed 
the mutated construct in DC27.1 cells. Stable cell lines were 
analyzed for surface expression, CD80 binding. IL-2 secretion. 
p85 binding, and PI 3-kinase activation. Fig. GA shows that 
point mutation of Tyr^*^ -» Phe^°° inhibited CD28-assoclated 
PI 3-kinase activity but did not abolish it {Jane 4). Western 
blotting of p85 demonstrated that this impairment was due to 



^ S. Klasen. F. Pagfes. J. F. Peyron. D. Cantrell. and D. Olive, manu- 
script in preparation. 
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Fig. 4. CD28-induced tyrosine phosphorylations in wild type 
and del 10 CD28 transfectants. Wild type and del 10 transfected cells 
were stimulated with CD3 {3, Janes 2 and 5) and CD28 {28, ianes 3 and 
6) mAbs and goat anti-mouse Ig antiserum for 2 min. or left unstimu- 
lated {NS, lanes J and 4). Whole cell lysates were separated on SDS- 
PAGE, transferred onto polyvinylidene difluoride, and probed with 
anti-phosphotyrosine monoclonal antibody (4G10) as described under 
"Experimental Procedures." 

a decrease in the quantity of CD28-associated p8S (not shown). 

It has never been proven that PI 3-kinase association with 
CD28 was necessary for activation of the enzyme. We have 
therefore tested CD28-induced accumulation of D-3 phosphoi- 
nositides in the transfectants expressing wild type or mutated 
(Tyr*^^. Tyr^^ CD28 molecules. In wild type transfectants. B7 
ligation induces a transient accumulation of PtdIns(3,4,5)P3 
(Fig. 6B). However, point mutation of the Tyr*"^^ residue com- 
pletely abolished CD28-induced PI 3-kinase activation as as- 
sessed by PtdIns(3.4.5)P3 accumulation. In contrast, a point 
mutation of the Tyr^°° residue, however, only delayed and 
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Fig. 5. Binding of CD28 peptides to purified p85 SH2 and SH3 
domains. Peptides corresponding to CD28 residues 166-180 {lanes 1 
and 2^ including a phosphorylated Tyr'" {lanes J and 0 and residues 
186-202 {lanes 5-7) with phosphorylated Tyr^"* {lanes 8-10) were 
coupled on beads and used to precipitate recombinant GST {lanes 1, 3. 
5, and 8). GST-C-SH2 {lanes 2. 4, 6. and 9). and GST-SH3 {lanes Zand 
10) p85 fusion proteins. Precipitates were run on 10% SDS-PAGE and 
revealed by silver staining. 

attenuated its activation. Thus, the defect in the PI 3-kinase 
activation observed in the DYF200 transfectant occurs at the 
level of p85 association. 

We also examined the ability of p85 C-SH2 domain fusion 
proteins to precipitate wild type or mutated CD28 molecules 
following ligation. Fig. 7A shows that upon ligation by 
CD80^-L cells, the GST-p85 C-SH2 fusion protein precipitated 
wild type CD28 {Jane 2). while a point mutation of Tyr^^^ 
strongly decreased this interaction {Jane 8^. Mutation of Tyr^°° 
did not prevent CD28 interaction with p85 C-SH2 domain {Jane 
5). Mutation of both Tyr"^ and Tyr^°° to phenylalanine abro- 
gated most of the CD28 ability to be recognized by the C-SH2 
domain of p85 {Jane It), 

CD28 is tyrosine-phosphorylated upon activation (12, 21) 
and mainly on the Tyr"" residue (21). Mutation of the Tyr^°° 
residue did not prevent CD28 phosphorylation, while point 
mutation of Tyr'^^ strongly decreased it. The double mutant 
Tyr'^^~^°** lost most of its ability to be tyrosine-phosphorylated 
after stimulation (Fig. IB). Hence, CD28 phosphorylation is 
further decreased by a double point mutation. 

The tyrosine residue at position 200 is therefore involved in 
PI 3-kinase binding and activation. We have also tested its role 
in CD28 function. Fig. 8 shows that CD28 mAbs in combination 
with CDS mAbs induced IL-2 secretion although to a lesser 
extent than wild type CD28. In contrast, stimulation by 
CD80^-L cells did not induce IL-2 secretion (Fig. 8) while 
binding to CD80^-L cells was retained (not shown). 

DISCUSSION 

In this report, we studied the structural requirements of the 
cytoplasmic domain of human CD28 for its signaling. For this 
analysis, the wild type CD28 molecule and various cytoplasmic 
mutants (deletion of 10. 21, 30. and 41 amino acids, or point 
mutation of Tyr^^ — > Phe) were expressed into the murine 
T-cell hybridoma DC27.1. We have shown previously that 
transfection of the full-length CD28 cDNA in these cells al- 
lowed surface expression of functional molecules which induce 
either early or late events of T-cell activation (27). Flow cyto- 
metric analysis showed that deletion of 10, 21. or 30 residues 
did not affect the cell surface expression of CD28. Deletion of 
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Fig. 6. Point mutation of the Tyi^** -» Phe^****. A, wild type and 
mutated transfected cells were left unstimulated {lanes 1 and 3) or 
stimulated with the CD28.2 mAb {lanes 2 and 4) before immunoprecipi- 
tation with protein G. PI 3-kinase activity associated with CD28 was 
analyzed as in Fig. 3. B. PtdIns(3,4.5)P3 generation. Wild type {open 
circle^ or mutated (Tyr'^^. open squares. Tyr^^. closed circle^ CD28 
transfectants were labeled with [^^P]orthophosphate as described un- 
der "Experimental Procedures." Cells were then stimulated by the 
addition of CD80'*'-L cells. Phospholipids were extracted and analyzed 
by HPLC as described in Ref. 28. 

the whole intracellular domain, however, impaired the expres- 
sion of the construct. This observation has previously been 
reported for mutational analysis of the human CD2 molecule 
and could be explained by a partial instability of the molecule 
due to the removal of positively charged amino acids which are 
responsible for transmembrane stabilization (35). 

After ligand binding and dimerization, many growth factor 
receptors phosphorylate several substrates on tyrosine resi- 
dues leading to a cascade of signaling events. The antigen- 
binding T-cell receptor does not possess intrinsic enzymatic 
activity, and its coupling to the cellular signaling machinery is 
mediated by adaptor molecules. Mutagenesis studies of several 
molecules involved in T-cell functions (CD3 C chain, CD2) have 
identified cytoplasmic consensus motifs which couple these 
receptors to early events of T-cell activation. The ITAM motif 
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Fig. 7. Association of CD28 molecules with the C-SH2 domain 
of p85 and CD28 phosphorylation. A, cells transfected by wild type 

{Janes 1-3), Tyr^™* [Janes 4-6). Tyr'" {Janes 7-S), or double-mutated 
'j'yj.173-200 y^jjgs JO- J 2) CD28 molecules were stimulated by CD80^-L 
cells in the absence {Janes 2, 5, 8. and J 1) or presence {Janes 3. 6, 9. and 
J 2) of CTLA-4/Ig or left unstimulated {Janes J. 4, 7, and JO^. Whole cell 
lysates were precipitated using a p85 C-SH2 fusion protein. Precipi- 
tates were run on SDS-PAGE. and transferred onto polyvinylidene 
difluoride. Membranes were blotted using CD28.6 mAbs and revealed 
by ECL. B, blots were stripped and reprobed using an anti-phosphoty- 
rosine mAb, and bands were quantified using the Biolmage System 
(Millipore). Fold induction corresponds to the ratio of CD28 phospho- 
rylation obtained in cells stimulated by CD80'^-L cells versus unstimu- 
lated cells. 
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Fig. 8. IL-2 secretion in DYF200 transfectant. Cells transfected 
by mutated CD28 molecule were stimulated by cross-linked CDS or 
CD3 mAbs (c/osec/and open circJes. respectively), by soluble CD28 mAbs 
in combination with soluble CDS {cJosed square^ or by CD80'*'-L cells 
{triangle^. Supematants were collected after 24 h of stimulation and 
analyzed as described in Fig. 2. 



Q{xx{\fL))2, present in several subunits of the CD3 complex 
(36-38) couples the T-cell receptor to tyrosine kinase activa- 
tion. Sequence comparison of CD28 with these molecules failed 
to identify any common motifs. Nonetheless, analysis of cyto- 
plasmic sequences from human, mouse, rat, and chicken CD28 
(7-10) showed high interspecies sequence similarity, suggest- 
ing a role for this domain in signal transduction. Functional 



characterization of clones carrying mutations of CD28 confirms 
that the CD28 cytoplasmic donnain plays a major role in signal 
transduction. We show that deletion of the 10 C- terminal 
amino acids severely impairs IL-2 secretion induced by a CD28 
stimulation. This impairment was not merely due to a modifi- 
cation of CD28 extracellular structure, since all epitopes rec- 
ognized by 6 different CD28 mAbs (25) were retained on the 
various deleted molecules, and since these transfectants were 
equally able to bind to B7-Ig and CDSO-transfected L cells. 
Furthermore, cells carrying a deletion of 10 C-terminal resi- 
dues were able to exhibit wild type levels of calcium mobiliza- 
tion as well as tyrosine phosphorylation of cellular substrates 
in response to CD 28 stimulation. This suggests that the most 
C-terminal region of CD28 (residues 193-202) is crucial for the 
coupling of this receptor to IL-2 secretion. Interestingly, this 
region of CD28 is also involved in PI 3-kinase binding and 
activation. Together with the previously described loss of CD28 
function following mutations of the PI 3-kinase binding site at 
residues Tyr*^^ (12) and Met*^® (39). our data argue for the 
major role of this enzyme and/or its associated molecules in 
coupling the CD28 receptor to the cellular events leading to 
IL-2 secretion. 

Upon ligand interaction, CD28 becomes tyrosine-phosphoryl- 
ated and associates with p85 via a *^^YMNM motif present in 
its cytoplasmic domain since a point mutation of Tyr*^^ com- 
pletely abolished p85 binding to CD28 (12. 13. 20). Recently. 
Raab et al (21) have shown that p56'''* and p59^" can phos- 
phorylate the Tyr^^^ residue of CD28 in vitro. Interestingly, 
mutation of this residue did not completely abolish CD28 phos- 
phorylation denoting the presence of otfier phosphorylation 
sites (21). Here we show that a deletion of 10 C-terminal 
residues greatly diminished the ability of CD28 to bind PI 
3-kinase without affecting other signaling pathways such as 
PLCyl activation and tyrosine phosphorylations. Within this 
region, we have further identified two putative motifs involved 
in p85 binding. The first is a short proline-rich region (residues 
190-194). and the second a tyrosine residue at position 200. We 
mutated this tyrosine residue (Tyr^^ and confirmed its in- 
volvement in p85 binding and PI 3-kinase activation Jn vJvo. 
Interestingly, deletion of the last 10 amino acids and point 
mutation of Tyr^^ only decreased PI 3-kinase binding to CD28. 
A low, but detectable, amount of the p85 still associated with 
mutated CD28. Furthermore, Jn vitro binding experiments 
showed that while the binding of the p85 SH2 domain to this 
Tyr^°** residue was dependent upon its phosphorylation, it was 
weak compared to binding to the *^^YMNM motif. This obser- 
vation was not surprising since Tyr^°° is not located within a 
consensus binding site for SH2 domains of p85 (YjoM. Ref. 40). 
Two alternative non-consensus binding sites. WAV (41) and 
YVNA (42), have also been described as novel p85 recognition 
motifs in the tyrosine kinase receptors HGF-R and Flt-1. 
respectively. 

The results we present here demonstrate that two regions in 
the intracytoplasmic domain of CD 28 are Involved in PI 3-ki- 
nase binding, one corresponding to the consensus p85 binding 
site *^^YMNM and another one at the C terminus of the mol- 
ecule (residues 193 to 202) including tyrosine 200 within a 
non-consensus p85 binding site. Although the CD28 *^^YMNM 
motif is sufficient to associate with p85 since individual N- or 
C-SH2 fusion proteins can coprecipltate CD28 after CD28-B7 
interaction and a 15-mer CD28 peptide including phosphoryl- 
ated Tyr^^^ precipitates PI 3-kinase from cell lysate (not 
shown), we propose that the two SH2 domains of p85 act in 
concert to associate with two distinct tyrosine residues of CD28 
in vivo. The first is present within the consensus sequence 
^^^YMNM and the other at position 200 is a non-consensus 
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binding site. This additional domain could either increase the 
affinity of p85/CD28 interaction or, alternatively, it could bind 
an adaptor molecule which interacts with p85. Although Tyr'^^ 
is the major phosphorylation site in the CD28 cytoplasmic 
domain, our data support the hypothesis that Tyr^°° is also 
phosphorylated upon CD28 stimulation even though we do not 
directly demonstrate it. The observation that CD28 phospho- 
rylation is further decreased by a double point mutation sug- 
gests that either the level of Tyr^°° phosphorylation is too weak 
to be detected in the presence of Tyr*^^ phosphorylation, or, 
alternatively, that Tyr*^^ plays a role in Tyr^°° phosphoryla- 
tion, for instance by recruiting a tyrosine icinase. A third hy- 
pothesis is that Tyr^°°, although not being a direct target for 
phosphorylation, may be involved in the phosphorylation of 
other tyrosine residues of CD28 (residues 188 and 191). 

Our in vitro binding experiments also showed that a nonphos- 
phorylated peptide corresponding to the 17 C-terminal amino 
acids of CD28 also interacts with a purified GST-SH3 domain, 
probably through an interaction with the short CD28 proline-rich 
segment at residues 190-194 (Pa:xPP). Unexpectedly, phospho- 
rylation of this peptide at position Tyr^°° decreased SH3 binding. 
The functional significance of this is at present unknown, and the 
in vivo relevance of this interaction has not been established 
since CD28 only associates with p85 after stimulation (12). This 
SH3/proline-rich interaction may. however, increase the affinity 
of p85 SH2 domains binding to CD28. 

It is noteworthy that IL-2 secretion induced by a CD28 mAb 
costimulation was reduced, but not abolished, by mutation of 
Tyr^°° whereas it was completely impaired in the del 10 mutant. 
One explanation for the inability of deleted CD28 molecules to 
couple to the IL-2 secretory pathway is that a deletion of 10 
C-terminal residues is sufficient to disrupt the CD28 intracyto- 
plasmic structure. Nonetheless, this hypothesis is unlikely since 
other second messengers such as Ca^"*" rise and tyrosine phos- 
phorylation of cellular proteins still occur upon CD28 activation. 
An alternative explanation is that, although PI 3-kinase is cru- 
cial for CD28 function, it is not the only transducing pathway 
involved in the coupling of CD28 to IL-2 secretion. Indeed, other 
enzymes such as sphingomyelinase have been reported to be 
involved in the CD28 costimulatory function (43), eind their cou- 
pling to CD28 might also involve the C-terminal domain of the 
molecule. Consistent with this hypothesis is the observation that 
CD28 can function independently of PI 3-klnase, eind that the 
CD28/PI 3-kinase association is not sufiicient to mediate the full 
costimulatory function of CD28 (44). 

Several reports have shown that SH3 domains of v-Src (45), 
pSQ^" (46). or p56^*'* (47) could interact directly with the p85 
subunit of PI 3-kinase, probably through proline-rich motifs 
identified at positions 88-97 and 299-309. This mechanism of 
PI 3-kinase coupling increases the complexity of possible inter- 
actions between transducing proteins. Yet another mechanism 
that might account for PI 3-kinase coupling to CD 28 may 
involve indirect binding of p85 through an interaction with 
SH3 domains of one of these kinases previously coupled to 
CD28 via its C-terminal part. It has recently been shown that 
CD28 was phosphorylated by the Src-related protein-tyrosine 
kinases pSG'*^* and p59^ in vitro, and that this phosphoryla- 
tion could increase the binding of p85. Grb-2. and ITK (21). In 
absence of a consensus binding site for SH2 domains of these 
kinases (Ya^I/L), Ref. 40) in the intracytoplasmic domain of 
CD28. one of the questions remaining unanswered is how the 
Src kinases are recruited to the CD28 receptor after its 
stimulation. 
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The 50-kD CD2 (TU) molecule, originally defined as the sheep erythrocyte receptor, 
plays an important role in T lymphocyte activation as well as in facilitating adhesion 
between T lymphocytes and their cognate partners (1-5). Perturbation of the ex- 
tracellular domain of CD2 by its ligand, LFA-3, or certain anti-CD2 mAbs provides 
signals that synergize to augment TCR-mediated stimulation (6, 7) and initiates 
a rapid turnover in polyphosphoinositide accompanied by an increase in cytosolic 
free calcium concentration (8, 9). In addition, this same combination of anti-CD2 
antibodies (with or without LFA-3) induces lymphokine production from, and clonal 
expansion of, resting T lymphocytes (10). 

Recently, the primary structures of human and murine CD2 were deduced from 
cDNA and genomic cloning (11-17). The predicted structure in both species is a type 
I integral membrane protein. In man, the mature protein consists of a hydrophilic 
185 amino acid extracellular segment, a hydrophobic 25 amino acid transmembrane 
segment and a 117 amino acid cytoplasmic domain. The highest degree of homology 
between human and murine species is found in the cytoplasmic domain (59% at 
the amino acid level) and in both species the cytoplasmic domain is unique with 
respect to its multiple proline (21%) and basic residues and by secondary structural 
predictions, suggesting that it has an extended nonglobular conformation. Thus, 
unusual but conserved features among species including man, mouse, and rat sug- 
gest an important role for the CD2 segment in signal transduction (11, 12, 18). For 
this reason, we have performed a series of deletion mutation studies on the CD2 
cytoplasmic domain in the present report. 

Materials and Methods 

Construction of Truncated and Mutated Human CD2 Molecules and Transfection into a Murine T 
Cell Hyhridoma, lb construct truncated CD2 molecules, the human CD2 cDNA, PB2 (11), 
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was digested with restriction enzymes Hph I, Ban II, Fok I, Stu I, and Ava I, respectively, 
and blunted with T4 DNA polymerase. The Hph I and Ban II digests were ligatcd to the 
linker 5'-CTAAGGATCCTTAG-3; while the Fok I, Stu I, and Ava I digests were ligated to 
the linker 5-TAAGGATCCTTA-3' to regenerate the last amino acid, introduce a termina- 
tion codon, and provide a Bam HI restriction recognition site. Subsequently, the DNAs were 
inserted into the Bam HI site of DOL vector (kindly provided by Dr. Thomas Roberts, Dana- 
Farber Cancer Institute). The substitution mutants were generated by oligonucleotide-directed 
in vitro mutagenesis as previously described (19). The full-length human cDNA PB2 insert 
was subcloned into BamHI site of M13 mpl8. The synthetic oligonucleotides used for muta- 
genesis were 5'-CAGGCACCTAGTGATGAGCCCCCGCCTCCT-3' for CD2 M271-2, which 
changes the wild-type sequence CATCGT (His-Arg) into GATGAG (Asp-Glu), and 5'- 
CCGCCTCCTGGAGATGAGGTTCAGCAGCAG-3' for CD2 M278-9, which changes the 
wild-type sequence CACCGT (His-Arg) into GATGAG (Asp-Glu). The Bam HI fragment 
of the replicative form of M13 mutant DNAs was subcloned into expressing vector DOL. 
The plasmids containing full-length or modified CD2 cDNAs were isolated and sequenced 
around the modified region by double-stranded sequencing before transfection by Ca^* 
precipitation into ^-2, a helper- free retrovirus packaging cell line. Both transiently expressed 
and permanent viral stocks were used to infect the murine T cell hybridoma, 3D054.8 (kindly 
provided by P. Marrack, National Jewish Hospital) (20) in the presence of 8 /ig/ml polybrene 
(Aldrich Chemical Co., Milwaukee, WI). Procedures for the growth of ^-2 cells, transfection 
of cells, harvest of virus, and infection of cells were performed as described (21). Selection 
was initiated 48 h after infection using 0.4 mg/ml G418 (Geneticin; Gibco, Grand Island, 
NY), and wells containing single colonies were expanded. The G418 resistant clones were 
screened by indirect immunofluorescence as described in the legend to Fig. 2, and positive 
clones were further sorted on the Epics V cell sorter. Cells were maintained in RPMI 1640 
(Gibco Laboratories) medium supplemented with 10% heat-inactivated FCS (Flow Labora- 
tories, McLean, VA), 50 fiM 2-ME, 1 mM sodium pyruvate, 2 mM L-glutamine, 1% penicillin- 
streptomycin (Whittaker, M. A. Bioproducts, Walkersvillc, MD) and 0.4 mg/nil G418. 

Flow Cytometric Anafysis. For cytometric analysis, ascites were used at a 1 *200 dilution and 
10^ cell incubated for 30 min at 4°C. After washing in RPMI 1640 with 2% FCS, bound 
antibodies were detected using a 1 'AO dilution of fluorescein-coupled goat anti-mouse IgG 
as a second antibody (Meloy, Springfield, VA). 10,000 cells were analyzed per sample on 
an Epics V cell sorter. Histograms represent the number of cells (ordinate) vs. logio fluores- 
cence intensity (abscissa). 

Immunoprecipitation ofCD2 MoUades. 10-20 x 10* cells were surface labeled with 1 mCi 
'251 (IMS 30; Amersham Corp., Arlington Heights, IL) for 15 min at room temperature 
using the lactopcroxidase method (11). Cell lysates were prepared in RIPA buffer containing 
0.15 M NaCl, 1% Triton X-100, 0.1% sodium deoxycholate, 1 mM NaF, 1 mM PMSF, and 
protease inhibitors. Cell lysates were preclcared once with a formalin-fixed Staphylococcus aureus 
suspension, then with Affigel protein A beads (Pharmacia Fine Chemicals, Piscataway. NJ) 
coupled to an irrelevant antibody (anti-CD8; 21Thy2D3) before overnight incubation at 4°C 
with the anti-CD2 (3T4-8B5) antibody coupled to beads. Immunoprccipitates were extensively 
washed with RIPA buffer and run on a 10% SDS-PAGE after treatment with 5% 2-ME. 
The gel was dried and the autoradiograph was exposed for 2 wk at - 70°C with intensifying 
screens. 

Measurement qfCytosolic Cu?* by Indo-l Fluorescence, Cytosolic Ca^* concentrations were de- 
termined according to Grynkiewicz ct al. (22). Briefly, 2 x 10« cells were loaded for 45 min 
at 37°C with 2 Mg/ml of the acetylmethyl ester of indo-1 (Molecular Probes, Junction City, 
OR) in 200 yX of RPMI 1640 plus 2% FCS. Cells were diluted 10-fold before analysis on 
an Epics V cell sorter. Upon Ca** binding, indo-1 exhibits changes in fluorescein emission 
wavelengths from 480 to 410 nm (22). The ratio of 410/480 nm indo-1 fluorescence was recorded 
vs. real time and expressed in arbitrary units. One arbitrary unit represents ~200 nM 
fCa2*]i. For each determination, the baseline was assayed by recording indo-l-loaded cells 
for 1 min. Anti-Tllj (lold24Cl) and anti-TUs (lmono2A6) ascites were added at a i:i00 final 
dilution. The Ca'* ionophore A23187 (Sigma Chemical Co., St. Louis, MO) was added at 
a 1 Mg/nil final concentration. 
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Modulation of CDS MoUcuUs. To modulate the murine CDS molecule from the surface 
CD2 FL, cclb were incubated overnight with the 145 2Cll-purifled antibody (20 in^fvnX) at 
37°C. Cells were washed twice and an aliquot of the cell was used to evaluate the effect of 
modulation on murine CD3 and human CD2 expression using a standard indirect im- 
munofluorescence assay (with the 145 2C11 and 3T4-8B5 mAbs» respectively). Modulated 
cells were loaded with indo-1 and cytosolic [Ca^^]i was followed upon various stimuli. Under 
such conditions, CD2 FL unmodulated cells or cells modulated vnth the anti-human CDS 
mAb (Leu4) showed a (Ca^^*]; comparable to Fig. 3 a (not shown) when triggered via CD2. 

Proliferation of CTLL-20 Cells, For quantitation of IL-2 production, 10* cells/well were in- 
cubated in 96-well round-bottomed plates for 24 h in the presence of either ovalbumin (1 
mg/ml final concentration) plus 10* A20-11 B lymphoma cells or antiTll2 + anti-Tlls (as- 
cites UlOO) or culture medium. Incubation with the nonstimulatory combination of anti- 
Tlli + anti-Til 2 antibodies did not induce any detectable IL-2 production, while stimula- 
tion with antiTlla + anti-Tllj antibodies resulted in clear IL-2 secretion by CD2 FL cells. 
Because addition of PMA (Sigma Chemical Co.) was found to induce a substantial increase 
in lymphokine production, 5 ng/ml final concentration of PMA was added to ail experimental 
samples including the media control. Subsequently, supematants were harvested and titrated 
in triplicate for their ability to support the growth of 10,000 CTLL-20 cells. Cultures were 
pulsed after 24 h incubation with 1 fiCi ['H]thymidine per well and harvested after an ad- 
ditional overnight incubation at 37**C over glass fiber filters on a Mash apparatus. Filters 
were dried and counted after addition of scintillation fluid on a /? counter. Results are ex- 
pressed as mean of triplicate determinations of cpm of ['HJthymidine incorporated. Stan- 
dard deviations were generally <5-10% and results are representative of five independent 
experiments. 

Results and Discussion 

To precisely chsiracterize the functional and structural relationship of the cyto- 
plasmic domain of the human CD2 molecule (Fig. 1 a\ a series of deletion mutation 
of the CD2 cDNA were produced, as shown in Fig. 1 A, encoding 98, 77, 43, 18, 
and -3 amino acids out of the 117 predicted cytoplasmic CD2 amino acid residues. 
Full-length cDNAs as well as modified cDNAs were obtained as described in Materials 
and Methods and inserted into the retrovirus expression vector DOL (Fig. 1 c) under 
the control of the MLV LTR promoter (24) and defective viruses were generated 
(21). The murine T cell hybridoma 3D054.8 cell line specific for ovalbumin in the 
context of the H-2 (I-A^) molecule (20) and lacking the human GD2 was then in- 
fected with these defective retroviruses. G418-resistant clones were selected in the 
presence of 0.4 mg/ml G41B and analyzed for surface CD2 expression using an in- 
direct immunofluorescence assay with anti-Tlli and anti-Tlla antibodies on an Epics 
V cell sorter. Multiple clones corresponding to each type of truncation and expressing 
clearly detectable levels of surface GD2 were selected for further characterization. 
Clones used in subsequent functional studies were designated based on the nature 
of their CD2 cDNA retroviral insert: CD2 FL resulted from retroviral infection with 
the fuU-length CD2 cDNA, whUe CD2 AC98, CD2 AC77, CD2 AG43, CD2 AC18, 
and GD2 AG-3 resulted from infection with retroviruses containing the entire extra- 
cellular and transmembrane segment of CD2 but only 98 or fewer of the 117 cyto- 
plasmic residues. A representative pattern of reactivity with the anti-GD2 mAb is 
shown in Fig. 2 a. All of these transfectants express comparable levels of CD2 
('^5-10,000 copies/cell) except GD2 AG18, which expresses on the order of 50% the 
copy number. Tlie GD2 AG-3 transfectant expressed levels of GD2 comparable to 
GD2 AG18. However, the surface expression of GD2 on AG-3 was unstable and did 
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Figure 1. Production and anal- 
ysis of the human CI>2 cytoplasmic 
domain, (a) Comparison of pre- 
dieted transmembrane and cyto- 
plasmic domain sequences of hu- 
man and mouse CDIZ. Amino acid 
residues are designated in single- 
letter code with the transmembrane 
regions underlined. Identical resi- 
dues in human and mouse CD2 se- 
quences are starred. (&) Schematic 
structure of the transmembrane 
and cytoplasmic regions of human 
CD2 and variant molecules. Con- 
structs of full-length, deletion, and 
substitution mutants of CD2 are di- 
agrammed. The region most con- 
served between human and mouse 
CD2 is stippled, and the two re- 
peating PPPGHR segments are 
marked in black. The H denotes the 
histidine residues thought to form 
a putative binding site. The restric- 
tion sites that generate the trun- 
cated CD2 molecules are marked 
by arrows with numbers in paren- 
theses corresponding to amino acid 
residues, (c) Structure of the DOL 
retroviral expression vector. Se- 
lected restriction sites on the vector 
shown are: B, Bam HI; RI, £co Rl; 
RV, Eco RV; H3, Hind III. The 
vector contains two promoters: 
MLV-LTR to drive the expression 
of the CD2 cDNA and the SV40 
promoter to express the neomycin 
resistance gene (24). 



not allow functional analysis (data not shown). For each clone, similar reactivities 
to those obtained with anti-Til i were found using the anti-Tll2 antibody, while none 
of the unactivated clones was stained by the anti-Tils antibody (data not shown). 

lb prove that individual clonal recipients of the truncated CD2 cDNAs express 
appropriately sized CD2 proteins, immunoprecipitation and SDS-PAGE analysis 
of the corresponding '**I-Iabcled surface CD2 molecules was carried out. As ex- 
pected, no human CD2 was immunoprecipitated from the murine 3D054.8 cell 
line. In contrast, a 53-kD band was identified in SDS-PAGE analysis of clone CD2 
FL. Note that the 70 kD band in anti-CD2 precipitations is unrelated to CD2 as 
it is present in the control immunoprecipitates. Furthermore, parallel analysis of 
CD2 protein expressed by CD2 AC98, ^77, i^43, and AC18 revealed that the 
molecular weights of surface CD2 (51, 47, 43, and 40 kD, respectively) correlated 
well with the expected truncations (Fig. 2 h\ Thus, the truncated CD2 cDNAs in 
DOL direct protein synthesis of the variant CD2 forms on the surface of the murine 
3D054.8 cells. 

Given that pertubation of the external domain of the CD2 molecule with a combi- 
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Figure 2. Expression of human CD2 molecules on murine T cells, (a) Flow cytometric analysis 
of CD2 expression on murine T cells. Indirect immunofluorescence assays were carried out using 
the anti-TU i mAb (3T4-8B5) {thkk line) and compared with an irrelevant antibody (1HT4-4E5) 
(thin line) as background, {b) Immunoprccipitation of CD2 from lysates of iodinated cell lines. 
Immunoprecipitates were obtained from solubilized murine T cells using a nonspecific {NS) an- 
tibody (mouse anti-human CDS) or an anti-CD2 antibody directed against the Til i epitope 
and run under reducing conditions over an SDS 10% polyacrylamide gel (PAGE). The resulting 
autoradiogram is shown. A contaminant band of 70 kD (as well as material resolving at 200 
kD, not shown) was regularly detected in the autoradiognuns despite extensive predearing, as 
previously reported by others (20). 



nation of anti-Tll2 + anti-Tlls results in a rapid rise in [Ca^^]i linked to IL-2 gene 
induction (1, 9), we examined whether such a mitogenic combination of mAbs was 
effective in stimulation of CD2 FL as well as CD2 AC cell lines. Fig. 3 a shows an 
analysis of alteration in [Ca^*]i after stimulation with anti-Tllz + anti-Tlls in var- 
ious cells lines as measured with a calcium sensitive dye indo-1 (22) and flow cyto- 
metric analysis in real time. A clear rise in [Ca^*]i (~200 nM increment) was ob- 
served upon stimulation of GD2 FL, CD2 AG98 and CD2 AG77 cells. The calcium 
rise occurs within 2 min after adding the stimulating antibodies, most likely corre- 
sponding to the time required for expression of the Til 3 epitope etfter anti-Tll2 stim- 
ulation, a phenomena observed previously for human T lymphocytes (1). In con- 
trast, CD2 AC43 and CD2 AC18 clones were not triggered by anti-Tll2 + anti-Tlh 
antibodies. As expected, the nontransfected line 3D054.8 was also not stimulated. 
Given that an immediate [Ga^'*']i rise was observed after addition of the Ga^**^ iono- 
phore A23I87 (1 fig/ird, final concentration), it is clear that cells were loaded with 
the fluorescent dye. These data establish that a significant rise in [Ga^*]i can be in- 
duced through human GD2 structures expressed on the membrane of murine T 
cells even in the absence of the GOOH-terminal 40 amino acid residues of the CD2 
cytoplasmic segment. The present results with human GD2 are consistent with two 
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Figure 3. Elevation in intracellular free calcium mediated by CD2 stimulation, {a) Measure* 
ment of cytosolic Ca^^ by indo*l fluorescence. Indo>l fluorescence changes in real time histo- 
grams of T cell lines. Black and open arrows correspond to addition of anti^lU + anti-Tlls 
antibodies and calcium ionophore A23187, respectively. (^) Influence of CDS modulation on in- 
duction of (Ca^'^Ji elevation triggered through the human CD2 molecule. L<cft panel represents 
indirect immunofluorescence detection of murine CD3 and human CD2 molecule, respectively, 
on the surface of unmodulated CD2 FL {top) and modulated CD2 FL (AoCtom). The latter was 
incubated with the 145 2CU monoclonal antibody overnight (28). Right panel represents mea- 
surement of [Ca^*^]! vs. time in modulated CD2 FL cells after stimulation with anti-Tlla -f anti- 
Tlls antibodies (filled armtv) or calcium ionophore (open arrow). 



independent reports. He et al. (25) demonstrated a requirement for the cytoplasmic 
tail of rat CD2 to trigger a rise in [Ca^*^]! after transfection of rat CD2 in the human 
Jurkat T cell line. Bierer ct al, (26) demonstrated a requirement for the cytoplasmic 
tail of human CD2 expressed in murine T cells for an IL-2 response to liposomes 
containing both HLA-DR and LFA-3 molecules. 

Since human CD2 function in human T lymphocytes requires expression of the 
CD3-Ti octp complex (1, 3, 27), we examined whether the function of a human CD2 
molecule within a murine cell line was linked to murine CD3-Ti. As shown in Fig. 
3 b, modulation of the murine CD3 molecule by the hamster anti-mouse GD3€ an- 
tibody (145 2C11) resulted in nearly complete loss of the CD3 molecule from the 
cell surface of CD2 FL after incubation for 16 h at 37^C, while CD2 expression 
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was unaltered or slightly increased. Such anti-CD3 modulated CD2 FL cells were 
no longer stimulated by anti-Tll2 + anti-Tils antibodies to increase [Ca^*]i. By con- 
trast, the incubation of CD2 FL cells under similar conditions with anti-human 
CDS antibody (Leu4) did not have any effect (Materials and Methods). This result 
suggests that regulation of the CD2 pathway by CD3-Ti is intact in the cellular model 
herein, further supporting the validity of this functional approach in the study of 
the GD2 intracellular domain. We cannot, of course, exclude the possibility that 
CDS modulation leads to a generalized disruption of subsequent T cell activation. 

We next examined if nuclear activation events including IL-2 induction and sub- 
sequent IL-2 secretion could be triggered through the human CD2 molecule in mu- 
rine CD2 FL cells, lb this end, clones were stimulated and IL-2 secretion into su- 
pematants assayed using the IL-2-dependent CTLL-20 cells (29). As shown in Fig. 
4 a, supematants from CD2 PL or 3D054.8 cells stimulated with ovalbumin in the 
presence of the H-2(I-A'*) expressing A20-11 B lymphoma are able to induce prolifer- 
ation of the IL-2-dependent CTLL-20 cells in a comparable way. By contrast, the 
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Figure 4. Stimulation of IL-2 production through the human CD2 molecule and its variants 
forms o n mu rine T cells, (a) Eifect of culture supematants from CD2 FL or 3D054.8 on prolifer- 
ation of CTLL-20 ceils. Serial twofold dilutions of culture supematants from stimulated or unac- 
tivated murine cells were tested for their capacity to support the growth of the IL-2-dependent 
murine CTLL-20 cell line (29), as evaluated by pH]thymidine incorporation, {b) IL-2 produc- 
tion upon antigen (ovalbumin) or anti-Tll2 > antiTlls stimulation. Under these experimental 
conditions, the limit of detection for IL-2 was <4 U/ml. M represents the murine T cell line 
CD2 M271-2. Analysis of IL-2 production by two independent clones of the CD2 M278-9 provided 
evidence that such cells could produce significant amounts of IL-2 upon CD2 triggering ('v50% 
of the amount secreted upon antigen stimulation). Experiments were performed as above, but 
precise quantification of IL-2 production %vas obtained by running culture supematants in par- 
allel to a titration curve of rIL-2 (Biogen Laboratdries, Cambridge, MA). 
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combination of anti-Til 2 + anti-Tlls antibodies is effective in inducing IL-2 produc- 
tion by CD2 FL cells but not the parental line 3D054.8. Fig. 4 b shows that all 
of the cell lines tested including CD2 FL, 3D054.8, and the CD2 AC series produce 
a high amount of IL-2 (ranging from 30 to 100 U/ml, corresponding to a clonal 
variation repeatedly observed) when stimulated with ovalbumin in the I*A^ context. 
This result demonstrates the integrity of the IL-2 synthetic pathway in each clone. 
Perhaps more importantly, after stimulation with anti-CD2 antibodies, the clones 
CD2 FL, CD2 AG98, and CD2 AG77 produce comparable levels of IL-2, while trans- 
fectants CD2 AC43 and CD2 AC18, like the untransfected 3D054.8, are not trig- 
gered through CD2. Analysis of five other independent clones expressing the AC 43 
CD2 molecule clearly established that these cells are not triggered through human 
CD2 either to secrete detectable levels of IL-2 or to elevate [Ca^**^]! (data not shown). 
Taken together, these data show that a fuU-length human GD2 molecule, as well 
as a CD2 molecule lacking 19 or 40 COOH-terminal amino acids from the cyto- 
plasmic domain^ is able to activate T lymphocytes after appropriate perturbation 
of the CD2 extracellular segment. Interestingly, the CD2 AC77 clones express human 
CD2 molecules lacking residues 289 to 316. The latter corresponds to the segment 
most conserved among human and murine molecules, with 24 out of 27 residues 
being identical (Fig. 1, a and b). Presumably, these conserved residues function in 
another facet of CD2 biology unrelated to IL-2 induction and/or secretion. In con- 
trast, the AC 43 truncated molecules as well as shorter truncations are nonftinctional 
with respect to stimulating a rise in [Ca^^Ji and IL-2 production. 

These data provide unequivocal evidence that the CD2 cytoplasmic domain is 
involved in signal transduction and that one essential sequence of the cytoplasmic 
domain necessary for CD2-mediated activation is located between amino acids 253 
to 287. This region contains four histidines at amino acid positions 264, 271, 278, 
and 282 £uid includes two tEindemly repeated segments (PPPGHR, amino acids 
260-265 and 274-279) (see Fig. 1, a and b). These histidine residues could represent 
a binding site for an ion, cyclic nucleotide, or other small regulatory molecule. 

lb evaluate the role of the histidine residues, substitution mutants of CD2 were 
produced by site-directed mutagenesis as described in Materials and Methods. Two 
different categories of mutants with a stable CD2 surface expression were obtained: 
CD2 M271-2 and CD2 M278-9 in which the positively charged histidine and argi- 
nine residues at position 271 and 272 or 278 and 279 were replaced by negatively 
charged aspartic acid and glutamic acid, respectively (Fig. 1 b). Functional charac- 
terization of these two mutants shows that IL-2 production can be induced by anti- 
Til 2 and antiTlls antibodies to a level comparable to that of the GD2 FL clones 
(Fig. 4 b and data not shown). Thus, a putative "cage," requiring four histidine residues, 
is not necessary for this CD2 activation. Furthermore, since the mutations at posi- 
tion 278-279 alter the structure of the second repeat without affecting T cell activa- 
tion, the mg^COOH -terminal repeat is not required. Therefore, it may be sp>ecu- 
lated that perhaps only one PPPGHR sequence is necessary for induction of IL-2 
activity. This notion can be tested by further site-directed mutagenesis studies. Al- 
ternatively, the repeats and/or other histidine residues could function to regelate 
CD2 interaction with cytoskeletal, plasma membrane, cytosolic, nuclear, or other 
cellular components not examined by the present assays. It also remains to be deter- 
mined if the GD2 cytoplasmic region influences the CD2 extracellular segment and 
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hence, interaction with LFA-3. The present modd system will be useful at addressing 
these latter possibilities. 

Summary 

lb evaluate those residues in the 117 amino acids of the CD2 cytoplasmic domain 
required for transduction of T lymphocyte activation signals, a full-length human 
CD2 cDNA and a series of deletion and substitution mutants were inserted into 
the ovalbumin-specific, I-A**-restricted murine T cell hybridoma 3D054.8 using a 
retroviral system. The resulting cells express surface CD2 protein and unlike the 
parental murine line, are reactive with murine anti-human CD2 andbodies. Anti- 
TII2 plus and-Tlls antibody stimulation of cells expressing a full-length CD2 cDNA 
results in a ch£U*acteristic rise in cytosolic-frec calcium ([Ca**]i), and subsequent IL- 
2 secretion that accompany CD2 stimulation in human T lymphocytes. TVansfec- 
tants expressing CD2 AC98 and CD2 AC77, partially deleted CD2 molecules con- 
taining the entire extracellular and transmembrane GD2 segments but only 98 and 
77 amino acids of the cytoplasmic domain, respectively, are also activated by anti- 
CD2 ntAbs. In contrast, clones expressing more severely truncated CD2 structures, 
GD2 AC43 and CD2 AC18, are not stimulated. These data show that the cytoplasmic 
domain plays an essential role in transduction of activation signals via CD2, and 
that the segment between amino acid residues 253 and 278 is necessary for activa- 
tion. This region contains two tandem repeats of the sequence PPPGHR, thought 
to form part of a putative cationic site. Disruption of the latter by site-directed muta- 
genesis does not affect IL-2 gene induction, suggesting that only one of the repeats 
is required for activating this function of the CD2 molecule. 

Received Jor publication 30 November 1988 and in revised form 16 February 1989. 

References 

1 . Meuer, S. C, R. E. Hussey, M. Fabbi, D. Fox, O. Acuto. K. A. Fitzgerald, J. C. Hodgdon, 
J. P. Protentis, S. F, Schlossman, and E. L. Reinhcrz. 1984. An alternative pathway of 
T cell activadon: a functional role for the 50KD Til sheep erythrocyte receptor protein. 
CelL 36:897. 

2. Siliciano^ R. R, J. C. Pratt, R. E. Schmidt, J. Riu, and E. L. Reinherz. 1985. Activadon 
of cytolytic T lymphocytes and natural killer cell function through the TU sheep erythro- 
cyte binding protein. Nature (Lond), 317:428. 

3. Brotuer, P., L. Boumsell, C. Gelin, and A. Bernard. 1985. T cell activation via CD2 
(T, gp50) molecules: accessory cells are required to trigger T cell activation via GD2- 
D66 plus CD2-9.6/Tn (1) epitopes. J. Immunol 135:1624. 

4. Selvaraj, P.. M. L. Plunkctt, M. Dustin, M. E. Sanders, S. Shaw, and T A. Springer. 
1987. The T lymphocyte glycoprotein CD2 binds the cell surface ligand LFA-3. Nature 
(Lond.). 326:400. 

5. Dustin, M. L., M. E. Sanders, S. Shaw, and T. A. Springer 1987. Purified lymphocyte 
function-associated andgen 3 binds to GD2 and mediates T lymphocyte adhesion. J, 
Exp. Med 165:677. 

6. Yang, S. Y., S. Chouaib, and B. DuPont. 1986, A common pathway for T lymphocyte 
activation involving both the CD3-Ti complex and the CD2 sheep erythrocyte receptor 
determinants, y. Immunol 137:1097. 

7. Bierer, B. E., J. Barbosa, S. Herrmann, and S. J. Burakoff. 1988. The interaction of 



2082 CD2 INTRACELLULAR DOMAIN 

CD2 with its ligand, LFA-3, in human T cell proliferation. J. Immunol 140:3358. 

8. Alcover, A., M.J. Weiss, J. F. I>a]ey, and E. L. Reinherz. 1986. The Til surface glyco- 
protein is functionally linked to a calcium channel in precursor and mature T lineage 
cells. Proc. Nad. Acad. Set. USA. 83:2614. 

9. Pantaleo, G., O. Olive, A. Pbggi, W. J. Kozumbo, L. Moretta, and A. Moretta. 1987. 
Transmembrane signalling via the Tll-dependent pathway of human T cell activation. 
Evidence foe the involvement of 1,2-diacylglycerol and inositol phosphates. Eur. J. Im- 
munoi 17:55. 

10. Hunig, T, G. Ticfenthaler, K. H. Meyer Zum Buschcnfclde, and S. C. Meuer. 1987. 
Alternative pathway activation of T cells by binding of CD2 to its cell surface ligand. 
Nature (Land.). 326:298, 

11. Sayre, P. H., H. C. Chang, R. E. Hussey, N. R. Brown, N. E. Richardson, G. Spagnoli, 
L. K. Clayton, and £. L. Reinherz. 1987. Molecular cloning and expression ofTU cDNAs 
reveal a novel receptor-like structure on human T lymphocytes. Proc. NaiL Acad. Set. USA. 
84:2941. 

12. Clayton, L. K., P. H. Sayre, J. Novotny. and E. L. Reinherz. 1987. Murine and human 
Til (CD2) cDNA sequences suggest a common signal transduction mechanism. Eur. J. 
Immunol 17:1367. 

13. Sewell, W. A., M. H. Brown, J. Dunne, M.J, Owen, and M. J. Cnimpton. 1986. Molec- 
ular cloning of the T lymphocyte surface CD2 (Til) antigen. Proc. Natl Acad. Set. USA. 
83:8718. 

14. Seed, B., and A. Aruffo. 1987. Molecular cloning of the CD2 emtigen, the T cell erythro- 
cyte receptor, by a rapid immunoselection procedure. Proe. Natl. Acad Set. USA. 84:3365. 

15. Diamond, D. J., L. K. Clayton, P. H. Sayre, and E. L. Reinherz. 1988. Exon-intron 
organization and sequence comparison of human and murine Til (CD2) genes. Ptoe. 
Natl Acad. Sei. USA. 85:1615. 

16. Lang, G., D. Wotton, M. J. Owen, W. A. Sewell, M. H. Brown, D. Y. Mason, M. J. 
Crumpton, and D. Kioussis. 1988. The structure of the human CD2 gene and its expres- 
sion in transgenic mice. EMBO (Eur. Mol Biol Organ.) J. 7:1675. 

17. Sewell, W. A,, M. H. Brown, M.J. Owen, P.J. Fink, C. A. Kozak, and M.J. Crumpton. 
1987. The murine homologue of the T lymphocyte CD2 antigen: molecular cloning, 
chromosome assignment and ceU surface expression. Eur. J. Immunol 17:105. 

18. Williams, A. K, A. N. Barclay, S. J. Clark, D. J. Paterson, and A. C. Willis. 1987. Similar- 
ities in sequence and cellular expression between rat CD2 and CD4 antigens. J. Exp. 
Med 165:368. 

19. Taylor, J. W., J. Ott, and F. Eckstein. 1985. The rapid generation of oligonucleotide- 
directed mutations at high frequency using phosphorothionate- modified DNA. Nucleic 
Acids Res. 13:8764. 

20. Haskins. K., R. Kubo, J. White, M, Pigeon, J. Kappler, and P. Marrack. 1983. The 
major histocompatibility complex-restricted antigen receptor on T cells. I. Isolation with 
a monoclonal antibody. J. Exp. Med. 157:1149. 

21. Cepko, C. L., B. E. Roberts, and R. C. Mulligan. 1984. Construction and applications 
of highly transmissible murine retrovirus shuttle vector. Cell 37:1053. 

22. Grynkiewicz, G., M. Poenic, and R. Y. Tsien. 1985. A new generation of Ca'^ indicators 
with greatly improved fluorescence properties. J. Biol Chem. 260:3440. 

23. Pantaleo, G., D. Olive, A. Poggi, T. Pozzan, L. Moretta, and A. Moretta. 1987. Antibody- 
induced modulation of the CD3/T cell receptor complex causes T cell refractoriness by 
inhibiting the early metabolic steps involved in T cell activation. J, Exp. Med 166:619. 

24. Korman, A. J., J. D. Frantz, J. L. Strominger, and R. C. Mulligan. 1987. Expression 
of human class II major histocompatibility complex antigen using retrovirus vectors. 
Proc. Natl Acad Sei. USA. 84:2150. 



CHANG ET AL. 



2083 



25. He, Q., A. D. Beyers, A. N. Barclay, and A. F. Williams. 1988. A role in transmembrane 
signalling for the cytoplasmic domain of the CD2 T lymphocyte surface antigen. Ceil. 



26. Bierer, B. A., A. Peterson, J. C. Gorga, S, H. Herrmann, and S. J. Burakoff. 1988. Syn- 
ergistic T cell activation via the physiological ligands for CD2 and the T cell receptor. 
J. Exp. Med. 168:1145. 

27., Alcover, A., C. Alberini, O. Acuto, L. K. Clayton, C. Transy, G. Spagnoli, P. Moingeon, 
P. Lopez, and E. L. Reinherz. 1988. Interdependence of CD3-Ti and CD2 activation 
pathways in human T lymphocytes. EMBO (Eur. Mol. Bioi Organ.) J. 7:1973. 
28. Leo, O., M. Foo, D. H. Sachs, L. E. Samelson, and J. A. Bluestone. 1987. Identification 
of a monoclonal antibody specific for a murine T3 polypeptide. Proc, Nail. Acad. Set. USA. 



29. Gillis, S., M. Perm, W. Ou, and K. A. Smith. 1978. T cell growth factor: parameters 
of production and a quantitative microassay for activity. J. Immunol. 120:2077. 



54:979. 



84:1374. 



Journal of Virology. Sept. 1991. p. 5007-5012 Vol. 65, No. 9 

0022-538X/91/095007-06$02.0Q/0 

Copyright O 1991, American Society for Microbiology 

Effects of Changes in gpl20-CD4 Binding Affinity on Human 
Immunodeficiency Virus Type 1 Envelope Glycoprotein 

Function and Soluble CD4 Sensitivity 

MARKUS THALI, UDY OLSHEVSKY, CRAIG FURMAN, DANA GABUZDA. JOHN LI, 

AND JOSEPH SODROSKI* 

Division of Human Retroviroiogy, Dana-Farber Cancer Institute, and Department of Pathology, 

Harvard Medical School, Boston, Massachusetts 021 15 

Received 15 March 1991/Accepteu 14 June 1991 

Mutant gpl20 glycoproteins exhibiting a range of a£Bnities for CD4 were tested for ability to form syncytia 
and to complement an env-defective provinis for replication. Surprisingly, gpl20 mutants that efficiently 
induced syncytia and/or complemented virus replication were Identified that exhibited marked (up to 50-fold) 
reductions in CD4-binding ability. Temperature-dependent changes in gpl20, which result hi a seven- to 
ninefold increase In affinity for CD4, were shown not to be necessary for subsequent membrane talon or virus 
entry events. Mutant glycoproteins demonstrating even relatively small decreases in CIM-blnding ability 
exhibited reduced sensitivity to soluble CD4. The considerable range of CIM-binding affinities tolerated by 
replication-competent HIV*! variants has important Implications for antiviral strategies directed at the 
gpl20-CD4 interaction. 



Human immunodeficiency virus type 1 (HIV-1) is the 
etiologic agent of AIDS, which is characterized by a deple- 
tion of CD4-positive lymphocytes (3. 16, 17. 27, 35). HIV-1 
exhibits a tropism for CD4-positive cells due to a high affinity 
{Ka - 4 X 10'*' M) interaction between the CD4 glycopro- 
tein, which serves as the viral receptor, and the HIV-1 gpl20 
exterior envelope glycoprotein (13, 23, 24, 28, 30, 31). 
Following CD4 binding, the gpl20 glycoprotein and gp41, 
the transmembrane envelope glycoprotein, mediate the fu- 
sion of viral and target cell membranes, which is pH inde- 
pendent and necessary for virus entry (18, 25a, 40). Similar 
events mediated by the envelope glycoproteins expressed on 
the surface of an infected cell result in fusion of infected cells 
with CD4-positive cells to form syncytia (18, 25a, 29, 39). 

The gplio molecule binds to a protruding ridge of the CD4 
glycoprotein analogous to the CDR2 loop of immunoglob- 
ulins (1. 2. 6-8. 21, 26, 32, 34. 36, 42). The CD4-binding site 
on gpl20 is discontinuous, and changes in specific residues 
located in the second, third, and fourth conserved gpl20 
regions affect CD4 binding without grossly disrupting the 
gpl20 conformation (10, 11, 18, 25a, 33). Here, we examine 
the effects of changes in gpl20-CD4 affinity on envelope 
glycoprotein functions involved in syncytium formation and 
virus entry. We also examine the effect of changes in 
CIM-binding affinity on the sensitivity of gpl20 variants to 
soluble CD4, which has been shown to exhibit antiviral 
activity in vitro (4, 14, 15, 19, 38, 41). 

MATERIALS AND METHODS 

Envelope glycoprotein expression and CD4-bhidlng assays. 
For the binding assays, COS-1 cells were transfected with 
pSVIIIenv DNA expressing wild-type or mutant envelope 
glycoproteins as previously described (12, 18, 33). For initial 
experiments examining the effect of temperature on binding 
of the wild-type gpl20, supematants of transfected cells that 
were radiolabeled with [^^S]cysteine were incubated for 
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various lengths of time with 5 x 10'' SupTl lymphocytes at 
either 4 or 37*C. The cells were washed once with phos- 
phate-buffered saline (PBS), lysed, and then precipitated 
with an excess of a mixture of sera derived from AIDS 
patients (33). The unbound firaction of the supematants was 
also used for inununoprecipitation. The amount of gpl20 
precipitated from both bound and unbound fractions was 
measui^ by densitometry of sodium dodecyl sulfate-poly- 
acrylamide gels. Previous studies indicated that the ratio of 
bound to free gpl20 did not vary under these conditions over 
a greater than 20-fold range of wild-type gpl20 concentra- 
tions. CIM-binding abilities of the mutant glycoproteins were 
assessed as described above, by using a 90-min incubation at 
either 4 or 37'C with SupTl lymphocytes. 

Syncytium-fonning ability of envelope glycoproteins. To 
assess the syncytium-forming ability of mutant envelope 
glycoproteins, 5 x 10^ COS-1 cells in 100-mm^ dishes were 
transfected with 10 of the envelope expressor plasmid 
(pSVIIIenv). Forty-eight hours after transfection, the cells 
were rinsed twice in PBS and incubated with 50 mM EDTA, 
pH 7.5. at 37°C for approximately 40 min. Cells were 
removed from the plate by trituration, centrifiiged, washed 
in 5 ml of PBS, and suspended in 5 ml of Dulbecco modified 
Eagle medium-10% fetal calf serum (DMEM-10% FCS). To 
these cells was added 2 x 10"^ SupTl lymphocytes suspended 
in 5 ml of DMEM-10% FCS. The cells were transferred to a 
T25 flask and returned to 3T*C in a 5% COj incubator, and 
syncytia were scored in 6 to 8 h. 

Replication complementation assays. Complementation of 
the single-step replication of the HXBAenvCAT provirus 
into different lymphocytes was performed as previously 
described (18). Equivalent amounts of reverse transcriptase 
activity representing recombinant virions in COS-1 supema- 
tants were added to Jurkat, H9, or SupTl lymphocytes, and 
chloramphenicol acetyltransferase (CAT) activity was mea- 
sured at 72 h following infection. 

Soluble CD4 Inhibition of syncytium formation and virus 
replication. To examine soluble CD4 inhibition of syncytium 
formation, transfected COS-1 cells detached from tissue 
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Time (min) 

FIG. 1. Effects of temperature on relative CD4 binding. (A) Autoradiogram of wild-type gpl20 glycoprotein (lanes 1 through 4) or 
supematants from mock-transfected COS-1 cells Ganes 5 and 6) either bound Oanes B: 1. 3. and 5) or unbound (lanes F: 2, 4» and 6) to SupTl 
lymphocytes. Experiments were conducted at either AX: (lanes 1 and 2) or 3T*C Ganes 3 through 6). (B) Graph showing the ratio of the amount 
of wild-type gpl20 glycoprotein bound to CD4-positive SupTl lymphocytes to the amount of unbound gpl20 for various times of incubation 
at either 4 or yrc. The datum points represent the mean values of data from two independent experiments. 



culture dishes as described above were incubated with 
different concentrations of soluble CD4 (American BioTech- 
nologies. Inc.) in 2.5 ml of DMEM-10% FCS for 1 h at 37°C. 
To this mixture was added 2 x 10^ SupH lymphocytes 
suspended in 1 ml of DMEM-10% FCS. Cells were trans- 
ferred to a T25 flask, and syncytia were scored in 8 h. 

For inhibition of vims replication, recombinant HXBAenv 
CAT virus produced in COS-1 cells (18) was incubated with 
different concentrations of soluble CD4 for 1 h at 37*C. The 
virus preparations were then incubated with Jurkat lympho- 
cytes, and CAT activity was measured 48 h later as previ- 
ously described (18). 

RESULTS 

Temperature effects on CD4 binding. The effect of temper- 
ature on the ability of radiolabeled soluble gpl20 derived 
from the HXBc2 HIV-1 virus to bind to CIM-positive SupTl 
lymphocytes is shown in Fig. 1. Binding equilibrium was 
achieved by approximately 2 h at either 4 or 37*C. A seven- 
to ninefold increase in the ratio of bound to free gpl20 was 
observed at 37"^ relative to that observed at 4**C. This 
observed increase at 37*C was only marginally affected by 
fixing the SupTl lymphocytes in methanol prior to use (data 
not shown). 

Effect of changes in gpl20 on CD4 bfaidfaig at different 
temperatures. To examine whether changes in the HIV-1 
gpl20 glycoprotein previously shown to alter its affinity for 
CD4 (33) affected the temperature-dependent component of 
the binding process, the binding of gpl20 mutants to SupTl 
lymphocytes was examined at both 4 and 37'C, using a 
90-minute binding assay. The results presented in Table 1 
and Fig. 2 demonstrate that although several of the mutants 
exhibited CD4-binding abilities comparable with or even 
better than that of the wild-type envelope glycoprotein at 
4°C» significant increases in binding at 3T*C were not ob- 
served for these mutants (e.g., 257 T/R. 368 D/T, 368 D/P, 
368 D/E, 370 E/Q, 430 V/S, 457 D/R. and 457 D/A). The 
relative CD4-binding abilities of the mutants at 37*C, from a 
90- min assay, were similar to those previously reported by 
using a 60-min binding assay (33). 

Function of the envelope ^ycoprotein mutants. The ability 
of the envelope glycoprotein mutants to induce the forma- 



tion of syncytia was examined (Table 1). For all of the 
mutants exhibiting less than one percent of wild-type CD4- 
bmding ability, syncytium-forming ability was undetectable 
(e.g.. 368 D/K, 368 D/R. 370 E/R. 427 W/S, and 427 W/V). 
Several of the mutants that exhibited smaller decreases in 
CD4 binding were capable of forming syncytia, although this 
occurred at a lower efficiency than that observed for the 
wild-type glycoproteins (e.g., 257 T/R. 368 D/T, 368 D/E, 368 
D/N, 370 E/D, 430 V/S, and 457 D/A). 

The ability of the mutant envelope glycoproteins to com- 
plement the entry of an ^ nv-defective vinis into Juricat, 
SupTl, and H9 lymphocytes was assessed. For the mutants 
exhibiting less than 1% of the wild-type CD4-binding ability, 
the ability to complement virus entry was less than 10% that 
of the wild-type envelope glycoprotein in all the target cell 
types examined (e.g., 368 D/K, 368 D/R. 370 E/R, and 427 
W/V). The other mutants, which exhibited CD4-bindlng 
ability at 37'C of greater than 1% of the wild-type value, 
were able to complement virus entry into the target cells 
examined to various degrees. The 368 D/T, 368 D/E, and 370 
E/Q mutants all allowed efficient entry into Jurkat and SupTl 
lymphocytes despite approximately 3-, 2(^, and 90-foid 
reductions, respectively, in relative Cl>4-binding ability. 
Several of the mutants that did not exhibit temperature- 
related affinity increases were capable of complementing 
virus entry into some of the target cell types (257 T/R, 368 
D/T, 368 D/P, 368 D/E, 370 E/Q, 430 V/S, and 457 D/A). For 
these and for other mutants retaining some degree of repli- 
cation competence, the efficiency of entry into the various 
target cells utilized in these studies exhibited the pattern 
Jurkat s SupTl s H9 lymphocytes. 

Soluble CD4 sensitivity of mutant envelope glycoproteins. 
The ability of syncytium induction and of virus replication to 
be inhibited by soluble CD4 was assessed for those mutant 
glycoproteins competent for these functions (Table 2 and 
Fig. 3 and 4). Some mutants (120/121 VK/LE, 269 E/L, and 
485 KAO that exhibited mild (less than twofold) reductions in 
CD4-binding ability were inhibited by soluble CD4 to the 
same extent as was the wild-type glycoprotein. The func- 
tional mutants that exhibited greater than twofold decreases 
in relative CD4-binding ability were less sensitive than the 
wild-type glycoprotein to soluble CD4 inhibition of syncy- 
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TABLE 1. Phenotypes of gpl20 mutants" 



Mutant 


Relative CD4 binding at (fold increase)* 


Syncytium 
formation 


Replication complementation'^ in cell tine 


4'C 


3TX: 


Jurkat 


SupTl 


H9 


wad type 


0.14 


1.00 (7.1) 


100 


100 


100 


100 


257 T/R 


0.12 


0.06 (0.50) 


5 


40 


25 


3 


368 D/T 


0.31 


0.30 (0.97) 


16 


32 


23 


7 


368 D/P 


0.18 


0.10 (0.56) 


<1 


18 


19 


2 


368 D/E 


0.05 


0.05 (1.00) 


51 


70 


62 


22 


368 D/N 


ND 


0.22 (ND) 


38 


14 


11 


5 


368 D/K 


ND 


<0.005 (ND) 


<1 


7 


9 


2 


368 D/R 


ND 


<0.0004 (ND) 


<1 


2 


2 


0 


370 E/D 


0.14 


0.42 (3.0) 


73 


87 


81 


50 


370 E/Q 


0.06 


0.011 (0.18) 


<1 


33 


33 


3 


370 E/R 


ND 


<0.003 (ND) 


<1 


1 


3 


1 


427 W/S 


ND 


<0.012 (ND) 


<1 


ND 


ND 


ND 


427 WA^ 


<0.006 


<0.006 (ND) 


<1 


1 


ND 


1 


430 V/S 


0.41 


0.65 (1.6) 


80 


100 


ND 


ND 


457 D/R 


0.06 


0.10 (1.67) 


<1 


12 


9 


3 


457 D/A 


0.08 


0.10 (1.25) 


47 


36 


22 


6 



' Data reported represent the means of at least two independent experiments. 

* The relative CD4-bindtng ability, determined by using a 90-min binding assay as described in Materials and Methods, is shown. The relative CD4-binding 
ability was calculated according to the following formtila: 

(free wild-type spl20hrc 
(bound wild-type fSplTOhrx: 

The fold increase shown in parentheses represents the ratio of the relative CD4-binding ability of a given mutant at 37^ to that observed at 4**C. 

Syncytium formation and replication complementation are represented as the percentage of wild-type values observed for a given target cell. The percent 
activity observed for a ccmtrol plasmid not expressing the HIV-1 envelope glycoproteins was less than 1% for syncytium formation and less than 2% for replication 
comirfementation in all three target cell lines. 



, . .... [(bound mutant gpl20)tanpx 

relative CD4 buidingteavx - -rr- — — 

I (free mutant gpl20)teiivx 



tium fonnation or virus entry (e.g., 257 T/R, 368 DH", 368 
D/E, 368 D/N. 370 E/D, 370 E/Q, 457 D/A, and 457 D/N). 
The 430 V/S and 457 D/E mutants exhibited soluble CD4 
sensitivities intermediate between that of the wild-type gly- 
coproteins and those of mutants with larger decreases in 
CD4-binding ability. Mutations affecting the threonine 257 or 
aspartic acid 457 residues (257 T/A, 457 D/N) that did not 
decrease CD4-binding ability did not result in soluble CD4 
resistance, whereas other changes in these residues (257 
T/R, 457 D/A. 457 D/G, and 457 D/E) yielded mutants that 
exhibited decreased CD4 binding and resistance to soluble 

cn>4. 

DISCUSSION 

The interaction of gplZO with CD4 was reduced to unde- 
tectable levels by some changes in gpl20 residues 368, 370, 



or 427. These changes dramatically reduced the ability of the 
envelope glycoproteins to form syncytia or to complement 
an env-defective virus for replication, consistent with the 
miuor contribution of CD4 binding to these processes. 

While the gpl20 changes that reduced CD4-binding ability 
by 100-fold or more uniformly abrogated syncytium-forming 
and/or replicative functions, amino acid changes resulting in 
less dramatic effects on CD4 binding exerted more variable 
effects on these envelope glycoprotein functions. The ability 
of the 368 D/E, 370 E/Q, 370 E/D, and 457 D/A mutants to 
complement the replication of the e#iv-defective virus and 
the ability of the 368 D/T. 368 D/E, 368 D/N, 370 E/D. 430 
V/S, and 457 D/A mutants to form syncytia indicate that 
even 20- to 50-fold decreases in CD4-binding ability are not 
necessarily accompanied by abrogation of these functions. 
These results are consistent with reports that replication- 
competent HIV-1 isolates can exhibit a broad range of 
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TABLE 2. Soluble CD4 inhibition of envelope 
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* Rel^ive CD4-binding abilities vrc derived from experiments in which a 
60-min binding assay was used (33). 

^ Values shown are the averages of the results of three independent 
experiments. Values represent the concentration of soluble CD4 required to 
inhibit syncytium formation or virus entry by 50%. ND, not determined. 
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FIG. 4. Effects of gpl20 amino acid changes on sensitivity to 
soluble CD4 inhibition of replication complementation. The effect of 
different soluble CD4 concentrations on the CAT activity trans- 
ferred to Jurkat lymphocytes by recombinant virions carrying the 
wild-type or mutant envelope glycoproteins is shown. Each value 
represents the percentage of CAT activity observed for a ^ven 
mutant in the presence of soluble CD4 relative to the activity 
observed for the mutant in the absence of soluble CD4. 



CD4-btnding abilities (20). The decreases in syncytium- 
forming or replicative abilities for some of the mutants (e.g., 
257 T/R, 368 D/P, and 457 D/R) that exhibit less than 50-fold 
reductions in CE>4-binding ability are likely to result from the 
effect of the changes on envelope glycoprotein functions 
other than receptor binding. 

For some of the mutants (257 T/R, 368 D/P, 370 E/Q. and 
457 D/R), syncytium formation is affected more dramatically 
than is the ability to complement virus replication in the 
same target cells. A similar result has been seen for amino 
acid changes in the HIV-1 gp41 amino terminus that affect 
the membrane fusion process (25). These results probably 
reflect the requirement for a greater number of successful 




257 T/R 

370 E/D 
368 (VE 

457 O/A 



368 CVN 
388 D/T 




1 10 too 

[sCD4) (micrograms/ml) 

FIG. 3. Effects of amino acid changes in gpl20 on inhibition of 
syncytium formation by soluble CD4. The percentage of the number 
of syncytia scored in the absence of soluble CD4 is shown for 
increasing concentrations of soluble CE>4. Values are shown for a 
tsrptcal experiment with the wild-type (w.t.) or mutant envelope 
glycoproteins. 



envelope glycoprotein-receptor interactions in the process of 
cell-cell fusion than in the process of virus entry. 

Decreases in CD4-binding ability exerted greater effects 
on vims entry into some target cells than they did in other 
cell types. For all of the mutants examined, however, the 
relative efficiency of entry into various target cells followed 
the pattern Jurkat ^ SupTl ^ H9 lymphocytes. Thus, the 
efficiency of entry of the mutant virus into these cell types 
appears to be governed by quantitative factors, one of which 
may be the level of target cell CD4 expression. 

At 4'*C, the HIV-1 gpl20 glycoprotein bound to CD4. 
positive cells with an affinity that was approximately seven- 
to ninefold lower than is seen at FoUowing a shift to 
3TC, a g^^ual increase in the association of the wild-type 
gpl20 glycoprotein with CD4-positive cells was observed. 
This phenomenon cannot be simply explained by endocyto- 
sis of cell surface gpl20-CD4 complexes, since methanol 
fixation of the target cells did not eliminate the observed 
increase. If the effect of low temperature is to limit confor- 
mational changes in gpl20 or CD4, the results would suggest 
that induced changes in gpl20 and/or CD4 contribute to a 
higher affinity interaction at Mutations affecting sev- 

eral gpl20 residues (Thr-257, Asp-368, Glu-370, or Asp-457) 
decreased the efficiency of these induced changes, suggest- 
ing that multiple elements in the protein-protein interface 
contribute to the affinity increase. The gpl20-C:i>4 interac- 
tion, as is true for binding of some antibodies to their 
antigens (5, 9, 37), might be more appropriately described by 
an induced fit rather than by a static *Mock-and-key'* model. 
Several envelope glycoprotein mutants that did not undergo 
this temperature-dependent change exhibited significant lev- 
els of syncytium-forming or replicative ability, indicating 
that this conformational alteration is not a necessary step in 
the membrane fusion process. 

Relatively small decreases in CI>4-binding ability were 
associated with envelope glycoprotein mutants resistant to 
inhibition by soluble CD4. A good correlation was observed 
between mutants that exhibited a lower affinity for CD4 and 
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resistance to soluble CD4. This correlation was also ob- 
served for mutations affecting the same gpl20 amino acid but 
resulting in different effects on CD4 binding. For example, 
the 257 T/R, 457 D/A. 457 D/G, and 457 D/E mutants 
exhibited decreased CD4 binding and were resistant to 
soluble CD4, whereas the 257 T/A and 457 D/N mutants 
exhibited approximately wild-type C£>4-binding abilities and 
remained sensitive to soluble CD4. While these results do 
not rule out other mechanisms for the generation of soluble 
CD4-resistant HlV-1 variants, they strongly imply that 
changes in affinity for CD4 represent one such mechanism. 
Since fresh patient isolates of HIV-1 are often less sensitive 
to soluble CD4 than are HIV-1 isolates passaged extensively 
in T-lymphocyte cell lines (12a), some of these primary 
isolates might exhibit small but consequential decreases in 
binding affinity for CD4. The sixfold-lower ClM-binding 
affinity reported for the primary monocyte-propagated Ba-L 
isolate compared with the T-cell-passaged BHIO isolate 
might contribute to soluble CD4 resistance (20). HIV-2, 
which is less sensitive to soluble CD4 than is HIV-1, has 
been reported to have a 25-fold-iower affinity for CD4 (32a). 
Such changes in affinity apparently result in mcyor effects on 
soluble CD4 sensitivity, while exerting only minor effects on 
virus entry or syncytium formation. This difference may 
result from the higher cooperativity of envelope glycopro- 
tein-receptor interactions during the latter processes com- 
pared with the interaction of soluble CD4 and the envelope 
glycoproteins. Similar results have recently been reported 
for the inhibition of polio virus replication by soluble recep- 
tor molecules, in which replication-competent viruses resis*- 
tant to soluble receptor resulted from altered receptor af- 
finity (22). 

For some of the mutants exhibiting small changes in 
relative CD4-binding ability (e.g., 368 D/T and 370 E/D), the 
50% inhibitory dose (ID^^) for soluble CD4 inhibition of 
syncytium formation is affected more than the corresponding 
ID50 for virus replication. Soluble CD4 has been reported to 
act both as a competitive inhibitor and as an irreversible 
inhibitor, the latter activity resulting from dissociation of the 
gpl20 and gp41 subunits (22a, 32b). Since the soluble CD4- 
induced subunit dissociation is more efficient for envelope 
glycoproteins on the virions than on the ceil surface (32b), 
the observations for the mutants could be accoimtedi for by 
differences in the contributions of reversible and irreversible 
inhibitory activities of soluble CD4 in the syncytium forma- 
tion or virus entry process. 

Our results have implications for antiviral ther^ies di- 
rected at the gpl20-CD4 interaction. The net result of either 
a reduction in gpl20-CD4 affinity or the presence of a 
competitive inhibitor of the gpl20-CP4 interaction is to 
decrease the ratio of gpl20|^^gpl20fr^. One would expect 
that the effects of an identical reduction in this ratio on virus 
replication would be less for a competitive inhibitor than for 
a mutant exhibiting decreased affinity, since in the former 
case, the bound gpl20 retains wild-type function. Thus, our 
results imply that competitive inhibitors of the gpl20-CD4 
interaction will need to be extremely effective to significantly 
abrogate virus replication, especially in target cells express- 
ing high levels of CD4. 
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Fibroblastic proliferation accompanies many an^o- 
genesis-related retinal and systemic diseases. Since con- 
nective tissue growth factor (CTGF) is a potent mitogen 
for nbrosis, extracellular matrix production^ and angio- 
genesis, we have studied the effects and mechanism by 
which vascular endothelial growth factor (VEGF) regu- 
lates CTGF gene expression in retinal capillary cells. In 
our study, VEGF increased CTGF mRNA levels in a time- 
and concentration-dependent manner in bovine retinal 
endothelial cells and pericj^es, without the need of new 
protein S3mthesis and without altering mRNA stability. 
VEGF activated the tyrosine receptor phosphorylation 
of KDR and Fltl and increased the binding of phosphati- 
dylinositol 3-kinase (PI3-kinase) p85 subunit to KDR 
and Fltl, both of which could mediate CTGF gene induc- 
tion. VEGF-induced CTGF expression was mediated pri- 
marily by PI3-kinase activation, whereas PKC and ERK 
pathways made only minimal contributions. Further- 
more, overexpression of constitutive active Akt was suf- 
Hcient to induce CTGF gene expression, and inhibition 
of Akt activation by overexpressing dominant negative 
mutant of Akt abolished the VEGF-induced CTGF ex- 
pression. These data suggest that VEGF can increase 
CTGF gene expression in bovine retinal capillary cells 
via KDR or Fit receptors and the activation of PI3-ki- 
nase-Akt pathway independently of PKC or Ras-ERK 
pathway, possibly inducing the fibrosis observed in ret- 
inal neovascular diseases. 



Angiogenesis and fibrosis are key components in develop- 
ment, growth, wound healing, and regeneration (1). In addi- 
tion, these processes commonly occur together in many disease 
states where neovascularization is believed to initiate the path- 
ological cascade. Some of these diseases are proliferative dia- 
betic retinopathy (2), rheumatoid arthritis (3), and age-related 
macular degeneration (4). Thus, it is possible that the factors 
that regelate angiogenesis may also induce factors that stim- 
ulate extracellular matrix production and fibrosis. Accordingly, 
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we have studied the ability of vascular endothelial growth 
factor (VEGF),^ an established angiogenic factor, to regulate 
the expression of connective tissue growth factor (CTGF), a 
growth factor with known actions on fibroblast proliferation, 
matrix production, and associated with fibrotic disorders. 

VEGF is expressed as a family of peptides of 121, 145, 165, 
189, and 206 amino acid residues (5). Its expression is induced 
by hypoxia (6) and is essential in the vasculogenesis process 
during development (7). Several receptors have been shown to 
mediate the action of VEGF, and most of them belong to the 
tyrosine kinase receptor family (8). Upon the binding of VEGF 
to its receptors, multiple signaling cascades are activated, in- 
cluding the tyrosine phosphorylation of phospholipase Cy, ele- 
vation of intracellular calcium and diacylglycerol, activation of 
protein kinase C (PKC), and extracellular signal-regulated ki- 
nase (MAPK/ERK) for endothelial cell proliferation (9-12). In 
addition, VEGF also stimulates activation of phosphatidyli- 
nositol (PI) 3-kinase leading to Akt/PKB activation and possi- 
bly enhancing endothelial cell survival (13—15). However, in 
non-endothelial cells such as capillary pericjiies that predomi- 
nantly express Fltl receptor, the action of VE(iF is poorly 
understood. 

Connective tissue growth factor (CTGF), a member of CCN 
family (CYR61, CTGF, and NOV) (16, 17), is a potent and 
ubiquitously expressed growth factor that has been shown to 
play a unique role in fibroblast proliferation, cell adhesion, and 
the stimulation of extracellular matrix production (18, 19). The 
38-kDa protein was originally identified in conditioned medium 
from human umbilical vein endothelial cells (20), and the ex- 
pression was shown to be selectively stimulated by transform- 
ing growth factor-/3 (TGF-0) in cultured fibroblasts (21). Due to 
its mitogenic action on fibroblasts and its ability to induce the 
expression of the extracellular matrix molecules, collagen type 
I, fibronectin, and integrin (18), CTGF is supposed to play an 
important role in connective tissue cell proliferation and extra- 
cellular matrix deposition as one of the mediators of TGF-J3 



^The abbreviations used are: VEGF, vascular endothelial growth 

factor; CTGF, connective tissue growth factor; Fit, /ms-like tyrosine 
kinase; KDR/Flkl, fetal liver kinase/kinase domain-containing recep- 
tor; VEGFR, vascular endothelial growth factor receptor; PKB, protein 
kinase B; MAPK, mitogen-activated protein kinase; ERK, extracellular 
signal-regulated kinase; kb, kilobase pair; PKC, protein kinase C; PI, 
phosphatidylinositol; TGF-/3, transforming growth factor-/}; BREC, bo- 
vine retinal endothelial cells; PIGF, placenta growth factor; BRPC, 
bovine retinal pericytes; PCR, polymerase chain reaction; CAAkt, con- 
stitutive active Akt; DNAkt, dominant negative Akt; DNRas, dominant 
negative K-Ras; DNERK, dominant negative extracellular signal-regu- 
lated kinase; DNPKC^, dominant negative PKCf. 
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(22). CTGF also seems to be an important player in the patho- 
genesis of various fibrotic disorders, since it was shown to be 
overexpressed in scleroderma, keloids, and other fibrotic skin 
disorders (23), as well as in stromal rich mammary tumors (24), 
and in advanced atherosclerotic lesions (25). Recently, the in- 
tegrin a^^^ has been reported to serve as a receptor on endo- 
thelial cells for CTGF-mediated endothelial cell adhesion, mi- 
gration, and angiogenesis (26, 27). 

Besides TGF-0, the expression of CTGF is reported to be 
regulated by dexamethasone in BALB/c 3T3 cells (28), high 
glucose in human mesangial cells (29), kinin in human embry- 
onic fibroblasts (30), factor Vila, and thrombin in WI-38 fibro- 
blasts (31), tumor necrosis factor a in human skin fibroblast 
(32), and cAMP in bovine endothelial cells (33). Since many of 
these cytokines are known to induce VEGF, it is possible that 
increased VEGF expression can regulate the expression 'of 
CTGF. In the present study, we have investigated the regula- 
tion of CTGF by VEGF in retinal endothelial cells and pericytes 
via the PI3-kinase and several other signaling pathways. 

EXPERIMENTAL PROCEDURES 

Materials — Endothelial cell basal medium was purchased from Clo- 
netics (San Diego, OA). Endothelial cell growth factor was purchased 
from Roche Molecular Biochemicals. Dulbecco's modified Eagle's me- 
dium and fetal bovine serum were obtained from Life Technologies, Inc. 
VEGF, placenta growth factor (PIGF), TGF-^1, and anti-CTGF anti- 
body were ordered from R&D Systems (Minneapolis, MN). Anti-KDR 
(Flkl) and anti-FItl antibodies were purchased from Santa Cruz Bio- 
technology (Santa Cruz, CA). Protein A-Sepharose was purchased from 
Amersham Pharmacia Biotech. Anti-phospho-ERK, anti-ERK, anti- 
phospho-Akt, and anti-Akt were purchased from New England Biolabs 
(Beverly, MA). Anti-p85 and anti-phosphotyrosine were purchased from 
Upstate Biotechnology, Inc. (Lake Placid, NY). Phosphatidyl inositol 
(PI) was purchased from Avanti (Alabaster, AL), and PD98059, wort- 
mannin, and OF 109203X were obtained from Calbiochem. All other 
materials were ordered from Fisher and Sigma. 

Cell Culture — Primary cultures of bovine retinal endothelial cells 
(BREC) and pericytes (BRPC) were isolated by homogenization and a 
series of filtration steps as described previously (34). BREC were sub< 
sequently cultured with endothelial cell basal medium supplemented 
with 10% plasma-derived horse serum, 50 mg/liter heparin, and 50 
pig/ml endothelial cell growth factor. BRPC were cultured in Dulbecco's 
modified Eagle's medium with 5.5 mM glucose and 20% fetal bovine 
serum. Cells were cultured in 5% COg at 37 *C, and media were 
changed every 3 days. Cells were characterized for their homogeneity 
by immunoreactivity with anti-factor VIII antibody for BREC and with 
monoclonal antibody 3G5 for BRPC (35). Cells remained morphologically 
unchanged under these conditions, as confirmed by light microscopy. Only 
cells from passages 2 through 7 were used for the experiments. 

Recombinant Adenoviruses — cDNA of constitutive active Akt 
(CAAkt, Gag protein fused to N-terminal of wild type Akt) was con- 
structed as described (36). cDNA of dominant negative Akt (DNAkt, 
substituted Thr-308 to Ala and Ser-473 to Ala) was constructed as 
described (37). cDNA of dominant negative K-Ras (DNRas, substituted 
Ser-17 to Asn) was kindly provided by Dr. Takai (Osaka University) 
(38). cDNA of dominant negative extracellular signal-regulated kinase 
(DNERK, substituted Lys-52 to Arg in ATP-binding site) was con- 
structed as described (39). cDNA of Ap85 was kindly provided by Dr. 
Kasuga (Kobe University) (40). cDNA of PKC^ was kindly provided by 
Dr. Douglas Ways (Lilly). cDNA of dominant negative PKC^ (DNPKC^, 
substituted Lys-273 to Trp in ATP-binding site) was constructed as 
described (41). The recombinant adenoviruses were constructed by ho- 
mologous recombination between the parental virus genome and the 
expression cosmid cassette or shuttle vector as described (42, 43). The 
adenoviruses were applied at a concentration of 1 x 10^ plaque-forming 
units/ml, and adenoviruses with the same parental genome carrying 
the lacZ gene or enhanced green fluorescein protein gene (CLONTECH, 
Palo Alto, CA) were used as controls. Expression of each recombinant 
protein was confirmed by Western blot analysis and increased about 
10-fold compared with cells infected with the control adenovirus. 

Immunoprecipitation — Cells were washed three times with cold 
phosphate-buffered saline and solubilized in 200 fd of lysis buffer (1% 
Triton X-100, 50 mmol/liter HEPES, 10 mmol/liter EDTA, 10 mmol/liter 
sodium pyrophosphate, 100 mmol/liter sodium fluoride, 1 mmol/liter 
sodium orthovanadate, 1 /ig/ml aprotinin, 1 fig/m\ leupeptin, and 2 



mmol/liter phenylmethylsulfonyl fluoride). After centrifugation at 
12,000 rpm for 10 min, 1.0 mg of protein was subjected to immunopre- 
cipitation. To clear the protein extract, protein A-Sepharose (20 fx\ of a 
50% suspension) was added to the cell lysates, after which they were 
incubated for 1 h, followed by centrifugation and collection of the 
supernatant. A specific rabbit anti-KDR or Fltl antibody was added 
and rocked at 4 "C for 2 h; 20 ^1 of protein A-Sepharose was then added, 
and the sample was rocked for another 2 h at 4 °C. For denaturation, 
protein A-Sepharose antigen-antibody conjugates were separated by 
centrifugation, washed five times, and boiled for 3 min in Laemmli 
sample buffer. 

Western Blot Analysis — Immunopredpitated proteins or 30 /tg of 
total cell lysates were subjected to SDS-gel electrophoresis and electro- 
transferred to nitrocellulose membrane (Bio-Rad). The membrane was 
soaked in blocking buffer (phosphate-buffered saline containing 0.1% 
Tween 20 and 5% bovine serum albumin) for 1 h at room temperature 
and incubated with primary antibody overnight at 4 *C followed by 
incubation with horseradish peroxidase-conjugated secondary antibody 
(Amersham Pharmacia Biotech). Visualization was performed using 
the enhanced chemiluminescence detection system (ECL, Amersham 
Pharmacia Biotech) per the manufacturer's instructions. 

PI3-Kinase Assay — PI3-kinase activities were measured by the in 
vitro phosphorylation of PI (10). Cells were lysed in ice-cold lysis buffer 
containing 50 mM HEPES, pH 7.5, 137 mM NaCl, 1 mM MgClj. 1 mM 
CaClg, 2 mM Na3V04, 10 mM NaF, 2 mM EDTA, 1% Nonidet P-40, 10% 
glycerol, 1 mM phenylmethylsulfonyl fluoride, 2 /xg/ml aprotinin, 5 
p.g/ml leupeptin, and 1 ^tg/ml pepstatin. Insoluble material was re- 
moved by centrifugation at 16,000 X g for 10 min at 4 *C. PI3-kinase 
was immunopredpitated from aliquots of the supernatant with an- 
tiphosphotyrosine antibodies. After successive washings, the pellets 
were resuspended in 50 /x.1 of 10 mM Tris, pH 7.5, 100 mM NaCl, and 1 
mM EDTA. 10 /il of 100 mM MgClg and 10 ItX of PI (2 /xg/juil) sonicated in 
10 mM Tris, pH 7.5, with 1 mM EGTA was added to each pellet. The 
PIS-kinase reaction was initiated by the addition of 5 of 0.5 mM ATP 
containing 30 /xCi of [y-^^P] ATP. After 10 min at room temperature with 
constant shaking, the reaction was stopped by the addition of 20 ptl of 8 
N HCl and 160 /jlI of chloroform/methanol (1:1). The samples were 
centrifuged, and the organic phase was removed and applied to silica 
gel TLC plates developing in CHCVCH3OH/H2O/NH4OH (60:47:11:2). 
The radioactivity in spots was quantified by Phosphorlmager (Molecu- 
lar Dynamics, Sunnyvale, CA). 

Amplification of Human CTGF cDNA Using Reverse Transcriptase- 
Polymerase Chain Reaction (PC/?>— cDNA templates for PCR were syn- 
thesized by reverse transcriptase (First Strand cDNA Synthesis Kit, 
Amersham Pharmacia Biotech) from human fibroblast according to the 
method recommended by the manufacturer. A standard PCR was per- 
formed (PCR optimizer kit, Invitrogen, Carlsbad, CA) using 5'-AGGG- 
CCTCTTCTGTGACTTCG-3' (sense primer) and 5'-TCATGCCATGTC- 
TCCGTACATC-3' (antisense primer) (20). The PCR products were 
then subcloned into a vector (pCRII, Invitrogen) and sequenced in 
their entirety, and comparison with the published human sequences 
revealed complete sequence identity. This cDNA probe was used for 
hybridization. 

Northern Blot Analysis — Total RNA was isolated using acid-guani- 
dinium thiocyanate, and Northern blot analysis was performed as de- 
scribed previously (44). Total RNA (20 /utg) was electrophoresed through 
1% formaldehyde-agarose gels and then transferred to a nylon mem- 
brane. ^'''P-Labeled cDNA probes were generated by use of labeling kits 
(Megaprime DNA labeling systems, Amersham Pharmacia Biotech). 
After ultraviolet cross-linking using a UV cross-linker (Stratagene, La 
Jolla, CA), blots were pre-hybridized, hybridized, and washed in 0.6X 
SSC, 5% SDS at 65 *C with 4 changes over 1 h. All signals were 
analyzed using a Phosphorlmager, and lane loading differences were 
normalized using the 36B4 cDNA probe (45). 

Analysis of CTGF mRNA Half-life—CTGF mRNA half-life experi- 
ments were carried out using BREC and BRPC. The cells were exposed 
to vehicle or VEGF (25 ng/ml) for the indicated periods prior to mRNA 
stability measurements. Transcription was inhibited by the addition of 
actinomycin D (5 Mg/ml). For inhibition of protein synthesis, cells were 
treated with cycloheximide (10 pig/ml) for the times indicated. 

Statistical Analysis — Determinations were performed in triplicate, 
and all experiments were repeated at least three times. Results are 
expressed as the mean ± S.D., imless otherwise indicated. Statistical 
analysis employed Student's t test or analysis of variance to compare 
quantitative data populations with normal distributions and equal vari- 
ance. Data were analyzed using the Mann-Whitney rank sum test or the 
Kruskal-Wallis test for populations with non-normal distributions or un- 
equal variance. A p value of <0.05 was considered statistically significant. 
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Fig. 1. Stimulation of CTGF mRNA expression by VEGF and 
PIGF. A (BREC) and B (BRPC), time-dependent induction of CTGF 
mRNA after stimulation by 25 ng/ml VEGF. C (BREC) and D (BRPC), 
dose-dependent induction of CTGF mRNA 6 h after stimulation by 
VEGF. E (BREC) and F (BRPC), time-dependent induction of CTGF 
mRNA after stimulation by 25 ng/ml PIGF. Size of mRNAs corre- 
sponded to -2.4 (CTGF) and -1.6 kb (36B4), Representative Northern 
blots and control 36B4 (top) and quantitation of three experiments after 
normalization to the control signals are shown (bottom). Asterisks indi- 
cate significant differences at p < 0.05 (*), p < 0.01 (**), and p < 0.001 



RESULTS 

CTGF mRNA Expression by VEGF and PIGF— The effects of 
VEGF on the expression of CTGF mRNA were studied by 
Northern blot analysis in BREC and BRPC. As shown in Fig, 1, 
A and B, 25 ng/ml VEGF increased CTGF mRNA (-2.4 kb) 
levels in a time-dependent manner, reaching a maximum after 
6 h in BREC (3.1 ± 0.70-fold, p < 0.001, Fig. lA) and after 9 h 
in BRPC (2.0 ± 0.22-fold, p < 0.01, Fig. LB). 

The dose response to VEGF-induced CTGF mRNA expres- 
sion was studied after 6 h of VEGF stimulation. As shown in 
Fig. 1, C and D, the expression of CTGF mRNA was up-regu- 
lated in a dose-dependent manner, with significant increases 
observed at concentrations as low as 0.25 n^ml in both BREC 
(Fig. IC) and BRPC (Fig, ID). Maximal increases were ob- 
served at VEGF concentrations of 25 ng/ml in both BREC and 
BRPC. 

Since BREC and BRPC may express both KDR and Fltl, we 
examined the effects of PIGF, a Fltl-specific ligand (46-48), on 
the induction of CTGF gene expression in vascular cells. As 
shown in Fig. IE, CTGF mRNA levels were not affected after 
stimulation of 25 ng/ml of PIGF in BREC. In contrast, PIGF 
increased CTGF mRNA after 3 h of stimulation, which peaked 
after 9 h in BRPC (1.9 ± 0.30-fold, p < 0.01, Fig. IF), suggest- 
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Fig. 2. Effect of VEGF on CTGF protein expression. A, confluent 
monolayers of BREC ileft) or BRPC {right) in serum-free media were 
studied after 10 h, with and without 25 ng/ml of VEGF. An equal 
amount of protein for each sample was subjected to 10% SDS-gel elec- 
trophoresis and transferred to a nitrocellulose membrane. Signals were 
detected by Western blot analysis with anti-CTGF antibody. The size of 
the CTGF protein corresponded to -38 kDa. B, VEGF (25 ng/ml) and 
TGF-131 (10 ng/ml) were studied as described in Figs. 1 and 2A. VEGF 
and TGF-/31 increased (TTGF protein and mRNA expression by a sim- 
ilar amount. 



ing that VEGF-induced CTGF gene expression was mediated 
primarily by KDR in BREC and Fltl in BRPC, 

VEGF Induction of CTGF Protein Production—To determine 
if the effects of VEGF on CTGF mRNA were correlated with its 
protein level, CTGF protein expression was assessed by West- 
em blot analysis using anti-human CTGF antibody. As shown 
in Fig. 2A, the detected size of CTGF protein was -38 kDa in 
both BREC and BRPC. VEGF (25 ng/ml) increased the level of 
CTGF protein after 10 h in both BREC and BRPC. Compara- 
tive studies were performed on the effects of VEGF (25 ng/ml) 
and TGF-/31 (10 ng/ml) on the expression of CTGF mRNA and 
protein. As shown Fig. 2B, VEGF and TGF-pi increased CTGF 
protein expression by a similar amoimt (2.5 ± 0.4- and 2.8 ± 
0.8-fold, respectively, in BREC). CTGF mRNA levels were also 
increased a similar extent (3.0 ± 0.3- and 3.3 ± 0.5-fold, 
respectively). 

Effects of VEGF on the Half-life of CTGF mRNA— The effects 
of VEGF on the stability of CTGF mRNA were examined. 
Northern blot analyses were performed with addition of acti- 
nomycin D (5 /xg/ml) after 6 h of VEGF (25 ng/ml) stimulation. 
In BREC (Fig. 3A) and BRPC (Fig. SB), the half-life of CTGF 
mRNA was 1.7 and 3.6 h, respectively. There was no significant 
difference between VEGF-treated and -untreated cells. 

Effects of Cycloheximide on CTGF mRNA Regulation — In 
order to examine the possibility that VEGF regulates CTGF 
mRNA expression through new protein synthesis of cytokines 
or transcription factors, cells were treated for 6 h with VEGF 
(25 ng/ml) and a protein sjnithesis inhibitor, cycloheximide (10 
^^ml). Fig. 3, C and D, shows that cycloheximide did not 
prevent the increase of CTGF mRNA. Addition of both VEGF 
and cycloheximide increased CTGF mRNA 2.4 ± 0.41-fold in 
BREC (Fig. 3C) and 2.5 ± 0.40-fold in BRPC (Fig. 3Z)) after 6 h 
as compared with cycloheximide alone (p < 0.01). These data 
suggest that the stimulation of CTGF mRNA expression by 
VEGF was not induced by increased synthesis of a regulatory 
protein. 

Involvement of ERK and PIS-Kinase-Akt in VEGF Signal- 
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Fig. 3. A and J3, decay of CTGF mRNA in the presence of actinomycin 
D. C and effects of cycloheximide on CTGF mRNA regulation. A 
(BREC) and B (BRPC). cells were incubated with or without 25 ng/ml 
VEGF for 6 h, and de novo mRNA transcription was inhibited by 
addition of actinomycin D (5 ^g/ml). Total RNA was extracted at 2 and 
4 h after administration of actinomycin D. The values in the gpraph 
indicate the percentage of initial CTCjtF mRNA signal remaining in the 
specific conditions. C (BREC) and D (BRPC), cells were incubated with 
or without VEGF (25 ng^ml) and cycloheximide iCHX, 10 jxg/ml) for 6 
and total RNA was analyzed. Representative Northern blots and con- 
trol 36B4 {top) and quantitation of three experiments after normaliza- 
tion to the control signal are shown {bottom). Asterisks indicate signify 
icant differences at p < 0.05 (*) and p < 0.01 (**). 

ing — Since ERK and PI3-kinase-Akt pathways have been re- 
ported to play central roles in VEGF signaling and biological 
actions (9—15), we investigated whether or not VEGF can ac- 
tivate ERK and PI3-kinase-Akt pathways equally in BREC and 
BRPC. As shown in Fig. 4A, immunoblot analysis of immuno- 
precipitates of KDR from BREC stimulated with VEGF or 
PIGF using an antibody to phosphotyrosine and PI3-kinase p85 
subunit demonstrated that VEGF, but not PIGF, promoted the 
tyrosine phosphorylation of KDR and interactions of KDR and 
p85 subunit of PI3-kinase. In contrast, as shown in Fig. 4B, 
immunoblot analysis of immunoprecipitates of Fit 1 from BRPC 
stimulated with VEGF or PIGF demonstrated that both VEGF 
and PIGF increased the tyrosine phosphorylation of Fltl and 
interactions of Fltl and p85 subunit of PI3-kinase. These data 
suggest that VEGF can activate the receptor tyrosine phospho- 
rylation and interaction with PI3 -kinase p85 subunit in both 
KDR and Fltl. 

To investigate the activation of Akt and ERK, we next per- 
formed immunoblot analysis with anti-phosphorylated Akt or 
anti-phosphorylated ERK antibodies using total cell lysates 
from BREC or BRPC stimulated with VEGF. As shown in Fig. 
4, C and £>, VEGF induced phosphorylation of both Akt and 
ERK in BREC by 3.1- and 5.8-fold (Fig. 4C), but only induced 
phosphorylation of Akt in BRPC by 2.6-fold (Fig. 4Z». No effect 
on ERK phosphorylation was observed in BRPC. These data 
suggest that VEGF activated both ERK and PI3-kinase-Akt 
pathways in BREC, but stimulated only PI3-kinase-Akt path* 
way in BRPC. 

Since the activation of PI3-kinase by VEGF has not been 
reported in BRPC. we studied the effects of VEGF on PI3- 
kinase activity in BRPC. As shown in Fig. 5, the addition of 
VEGF (25 ng/ml) increased PI3-kinase activity in a time-de- 
pendent manner by 2.1 ± 0.27-fold (p < 0.01) after 5 min and 
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Fig. 4. VEGF effects on the ERK and PI3-kinase-Akt. A (BREC) 
and B (BRPC), cells were incubated with VEGF (25 ng/ml) or PIGF (25 
ng/ml) for 6 min. Proteins immunopFecipitated UP) with KDR (A) or 
Fltl (B) antibodies were fractionated by SDS-polyacrylamide gel elec- 
trophoresis. Immunoblots were probed with an antibody to phosphoty- 
rosine (pY, top) and reprobed with antibody to p85 {bottom) and KDR (A, 
middle) or Fltl (S, middle). C (BREC) and D (BRPC), cells were incu- 
bated with VEGF (25 ng/ml) for 15 min. Immunoblots were probed with 
an antibody to phosphorylated Akt {top) and reprobed with antibody to 
total Akt {middle) or phosphorylated ERK (bottom). WB^ Western blot. 

BRPC 




FfG. 5. Effects of VEGF on PIS-kinase activation in BRPC. 
BRPC were incubated with VEGF (25 ng/ml) for the indicated times and 
harvested. Equal amounts of lysates were immunoprecipitated with 

anti-phosphotyrosine antibody, and immunocomplexes were assayed 
for their ability to phosphorylate PI to phosphatidylinositol phosphate 
{PIP). Representative autoradiogram {right) and quantitation of three 
experiments in the percentage of intensity of control {left) are shown. 
Asterisks indicate significant differences atp< 0.05 (*) and p < 0.01 
(*•). 

by 1.6 ± 0.17-fold (p < 0.05) after 10 min in BRPC. 

Effects of PKC, ERK, and PI3 Kinase Inhibition on VEGF- 
induced CTGF Expression — ^To investigate the signaling path- 
ways involved in VEGF-induced CTGF expression, the effects 
of inhibition of PKC, ERK, and PIS-kinase were determined. 
Cells were treated with 25 ng/ml VEGF for 6 h after pretreat- 
ment with the kinase inhibitor GF 109203X, a classical and 
novel PKC-specific inhibitor (1 /xM) (49, 50); PD98059, a MAPK/ 
ERK kinase inhibitor (20 ^m); or wortmannin, a PI3-kinase 
inhibitor (100 nM) (Fig. 6, A and B). Neither GF 109203X nor 
PD98059 had significant effects on VEGF-induced CTGF 
mRNA expression, but wortmannin inhibited the effects of 
VEGF by 88 ± 6.5% {p < 0.01) in BREC (Fig. 6A) and 78 ± 22% 
(p < 0.01) in BRPC (Fig. 6B). To confirm further the involve- 
ment of PIS-kinase in VEGF-induced CTGF expression, we 
used recombinant adenoviruses encoding DNRas, DNERK, or 
Ap85 of PIS-kinase. BREC were transfected with each adeno- 
viral vector, followed by stimulation with 25 ng^ml VEGF for 
6 h. Neither DNRas nor DNERK had significant effects on 
VEGF-induced increase in CTGF mRNA, but Ap85 of PIS- 
kinase completely inhibited VEGF-induced CTGF expression 
(p < 0.001, Fig. 6C). 

Role ofPKCiandAktfPKB in VEGF-induced CTGF Expres- 
sion — Since it has been reported that atypical PKC (51-55) and 
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Fig. 6. Characterization of the role of PKC, ERK, and PIS- 
kinase in VEGF-induced CTGF expression. A (BREC) and B 
(BRPC), cells were pretreated with GF 109203X {GFX, 1 /xm), PD98059 
(PD, 20 /xm), or wortmannin (WM, 100 dm) followed by stimulation with 
25 ng/ml VEGF for 6 h. C, BREC were infected with control (C), DNRas, 
DNERK, or Ap85 by adenoviral vectors. After 24 h, cells were starved 
overnight, followed by stimulation with 25 ng/ml VEGF for 6 h. Repre- 
sentative Northern blots and control 36B4 (top) and quantitation of 
three experiments after normalization to the control signals are shown 
(bottom). Asterisks indicate significant differences atp< 0.01 (**) and 
p < 0.001 (***). 



Akt^KB (13, 14, 56) have significant roles as signaling mole- 
cules downstream of PI3-kinase, we examined the involvement 
of PKC^ and Akt in this process, BREC were infected with each 
adenoviral vector, followed by stimulation with 25 ng/ml VEGF 
for 6 h. Neither wild type PKC^ nor DNPKC^ had significant 
effects on VEGF-induced increase in CTGF mRNA (Fig. 7 A). In 
contrast, as shown in Fig. 7B, infection with CAAkt increased 
CTGF mRNA expression 2.1 ± 0.21-fold (p < 0.01) without 
VEGF and 2.5 ± 0.40-fold with VEGF. Overexpression with 
adenoviral vector containing DNAkt inhibited VEGF-induced 
CTGF expression by 86 ± 13% ip < 0.01). 

DISCUSSION 

In this study, we have shown that VEGF can increase the 
mRNA expression of CTGF in a time- and concentration-de- 
pendent manner in both microvascular endothelial cells and 
contractile cells (capillary pericytes) possibly indicating that 
the effects of VEGF on CTGF expression may occur in all cells 
with VEGF receptors. This possibility is supported by the re- 
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Fig. 7. Role of PKCf and Akt/PKB in VEGF-induced CTGF 

expression. BREC were infected with adenoviral vectors containing 
cDNAs of control, wild type PKC^, and DNPKCf (A) or control, DNAkt, 
and CAAkt (B). After 24 h, cells were starved overnight, followed by 
stimulation with 25 ng/ml VEGF for 6 h. Representative Northern blots 
and control 36B4 (top) and quantitation of three experiments aft«r 
normalization to the control signals are shown (bottom). Asterisks indi- 
cate significant differences at p < 0.01 (**). 

suits showing that both Fltl (VEGFRl) and KDR/Flkl 
(VEGFR2) can mediate the increases in CTGF mRNA expres- 
sion. The ability of Fltl to induce increases in CTGF mRNA 
levels is demonstrated in the pericytes that have predomi- 
nantly Fltl receptors and where the expression of KDR/Flkl 
receptors were not significantly high enough to be determined 
by Northern blot analysis as reported in a previous publication 
(57). In addition, PIGF, a Fltl receptor-specific ligand (48), was 
able to induce CTGF mRNA levels in BRPC but not in BREC, 
again supporting the postulate that VEGF can induce CTGF 
mRNA by activating through Fltl in pericytes. The KDR/Flkl 
receptors in the endothelial cells can also induce CTGF gene 
expression since KDR/Flkl receptors are the predominant 
VEGF receptors in endothelial cells (58), and PIGF was not 
effective in inducing CTGF mRNA expression in endothelial 
cells. Further studies will be necessary to determine whether 
other types of VEGF receptors, such as Flt4 (59) and neuropi- 
lin-1 (60), which are present in endothelial cells, can also in- 
duce CTGF expression. 

The VEGF dose-response curves for CTGF in both BRPC and 
BREC are similar and suggest that VEGF binds to high affinity 
receptors, consistent with the known Kj values of Fltl and 
KDR/Flkl at 10-100 pM (58, 61). VEGF-induced CTGF mRNA 
is most likely due to an induction of transcription rather than 
altering the half-life of CTGF mRNA since the addition of 
VEGF failed to change the degradation rates of CTGF mRNA. 
The time course of the action of VEGF on CTGF (which re- 
quired 6—9 h) suggests this is potentially a chronic action of 
VEGF. In addition, the time needed to achieve maximum effect 
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is also consistent with the calculated mRNA half-life of CTGF 
mRNAof2-4h. 

From a biological perspective, the effects of VEGF on CTGF 
mRNA could potentially have important physiological impact 
for several reasons. First is that the increase in CTGF mRNA 
results in increased protein levels. Second, the VEGF concen- 
tration that was minimally active (0.25 ng/ml) can easily bind 
and activate a significant percentage of the VEGFR-1, -2 re- 
ceptors (58, 61). Third, this low level of VEGF may exist even in 
non-pathological states, suggesting that low levels of VEGF 
may have physiological actions on maintaining extracellular 
matrix production via the induction of CTGF. At 2.5—25 ng/ml 
VEGF which are encountered in hypoxic and angiogenic states 
(62), the induction of CTGF expression by VEGF could poten- 
tially induce the fibrosis that frequently accompanies neovas- 
cularization. This possibility is supported further by the dem- 
onstration that the protein levels of CTGF expression were 
increased 10 h after the addition of VEGF that was consistent 
with the maximum increase in the mRNA levels at 6-9 h. In 
addition, the potency of VEGF on CTGF expression appeared to 
be similar to TGF-/31, suggesting that both of them could in- 
duce fibrosis associated with neovascularization. 

The activation of the endogenous tyrosine kinases of KDR/ 
Flkls can stimulate multiple signaling pathways, including 
Ras-ERK (63), PI3-kinase-Akt (13-15), and phospholipase Cy- 
PKC (9-11) cascades. Much less is known regarding the regu- 
lation of Fit 1 receptors. The results in BREC confirmed previ- 
ous publications that VEGF can increase the tyrosine 
phosphorylation of KDR/Flkl and its interaction with p85 sub- 
imit of PI3-kinase. In addition, VEGF also activates the 
ERKl/2 pathway confirming earlier reports from many labora- 
tories, including ours. In contrast, VEGF was unable to acti- 
vate ERKl/2 but stimulated the activation of PI3-kinase and 
phosphorylation of Akt in BRPC. These results have provided 
further direct evidence that the signaling pathways for Fltl in 
vascular cells are different from those for KDR/FIkl. The lack 
of effect on ERKl/2 activation also supports the h3rpothesis that 
Fltl, unlike KDR/Flkl, is not involved in mitogenic actions 
(64). Further studies will be needed to determine the structural 
differences responsible for the inabihty of VEGFRl to engage 
the Ras-ERK pathway. 

The present report does provide the strong evidence that 
VEGF is inducing CTGF gene expression in both endothelial 
cells and pericytes via VEGFRl or -R2 by the activation of 
PI3-kinase and Akt. This evidence includes the ability of wort- 
mannin, a PI3-kinase inhibitor, to inhibit the effects of VEGFs 
in both cell types, whereas PD98059, a MAPK/ERK kinase 
inhibitor, and GF109203X, a general classical PKC and novel 
PKC inhibitor (1 ^m) (49, 50), did not have significant actions. 
Adenovirus containing dominant negative mutants of p85 sub- 
unit of PI3-kinase or Akt inhibited the action of VEGFs, 
whereas overexpression of dominant negative mutants of Ras 
and ERKl by adenovirus vectors did not inhibit CTGF mRNA 
expression. Conversely, the overexpression of constitutive ac- 
tive Akt increased CTGF mRNA expression by 2.5-fold. The 
molecular steps between Akt activation and the enhancement 
of CTGF gene expression in the nucleus remain unclear, al- 
though the PKC^ isoform is most likely not involved since the 
overexpression of either the wild type or dominant negative of 
PKC^ isoform did not alter the effects of VEGF on CTGF mRNA 
levels. 

The molecular processes between Akt phosphorylation and 
CTGF gene expression in the nucleus have not been studied. 
However, Pendurthi et al. (31) reported that factor VII and 
thrombin induced CTGF gene expression through a PI3-kinase- 
dependent pathway. TGF-^ has been reported to increase the 



transcription rates of CTGF. A promoter element of CTGF, 
which is responsive to TGF-^ stimulation, has been reported to 
be present between —162 and — 128 nucleotides in the 5' region 
(65). However, it is unlikely that the effects of VEGF on CTGF 
mRNA levels are mediated via the expression of TGF-0 since 
the addition of cycloheximide did not change these effects. 

In summary, these results have provided the first evidence 
that VEGF can induce the expression of CTGF via both Fltl 
and KDR/Flkl by the selectively activated PI3-kinase-Akt 
pathway but independent of the Ras-ERK pathway. In addi- 
tion, the spectrum of signaling pathways may be different 
between Fltl and KDR/Flkl, possibly reflecting their physio- 
logical roles. Biologically, these results support the conclusion 
that VEGF, through its effects on CTGF expression, may have 
physiological roles such as the maintenance of capillary 
strength and wound healing via the extracellular matrix pro- 
duction. In disease states, VEGF-induced CTGF may cause the 
proliferation of fibrocellular components in retinal neovascular 
diseases such as proliferative diabetic retinopathy and age- 
related macular degeneration. 
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Basic fibroblast growth factor is a substrate for protein 
phosphorylation and is phosphorylated by capillary 
endothelial cells in culture 
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abstract a phosphorylated basic fibrobla^ growth 
factor (FGF) can be detected In extracts of bovine capillary 
endothelial ceUs and human hepatoma cells. Accordingly, 
human basic FGF contains consensus sequences that account 
for its phosphorylation on Thr^ll2 by the catalytic sabunit of 
the cAMP-dependoit protein kinase A (PK- A) and on Scr-64 by 
the calcium- and phosphoUpld-dependent protein Icinase C 
(PK-C). A kinetic analysis of both of these reactions revealed 
that basic FGF Is among the better substrates for these 
enzymes. Although the kinase responsible for the phosphoryl- 
ation III vivo has not yet been Identified, we examined the effects 
of pboqihorylation on the biological activity, heparin-UndIng 
capacity, and receptors-binding capacity of pho8|diorylated 
basic FGF. No effects of phosphorylation were observed when 
the mitogen was phosphorylated by PK-C. In contrast, when 
basic FGF was phosphorylated In the receptor-binding domain 
with PK-A, the growth factor was 3-8 times better at displacing 
radiolabeled basic FGF In the radioreceptor assay. No effects 
were seen on the binding of this FGF to immobilized heparin 
or cell-associated glycosaminoglycans, suggesting that this 
phosphorylation modifies the affinity of basic FGF for its 
receptor. Biological assays for basic FGF failed to Identify 
differences between the phosphorylated and unphosphorylated 
forms of recombinant bask FGFs presumably because of the 
presence of ectophosphatases and the experimental conditions 
of proliferation and mitogenic assays (37^C, 24-96 hr). Because 
the relative affinity of basic FGF for Its receptor and cell- 
associated glycosaminoglycans may regulate Its activity, the 
identification of a modified form of basic FGF may t>e of 
particular importance In understanding the mechanisms that 
regulate Its biological activity, bioavailability, and processing 
to and from the extracellular matrix. 



Although the acidic and basic fibroblast growth factors 
(FGFs) have been clearly established to participate in the 
regulation of the proliferation of many cell types, the struc- 
tural characterization of these molecules (1, 2) has raised 
many questions regarding their possible mechanism of ac- 
tion. As an example, both mitogens lack a classical signal 
sequence (3, 4) that would allow their secretion and access to 
their putative receptor located on the extracellular surface of 
the plasma membrane (5, 6). Because both acidic and tiasic 
FGFs have been found in the extracellular nuttrix, it has been 
suggested that they utilize an alternative secretion pathway to 
the outside of the cell and that their activity is tightly 
regulated by limiting their bioavailability (7-10). In view of 
the fact that both FGFs are also characterized by their high 
affinity for immobilized heparin and that they remain cell- 
associated even when expressed with consensus signal pep- 
tide sequences (11, 12), we have been investigating the 
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possibilities that these proteins are cytoplasmic and that 
posttranslational changes might modulate their processing to 
and from the extracellular matrix. 

Protein phosphorylation is a posttranslational modification 
that has been implicated in the regulation of almost all steps 
of cell division (13-16). Recently, Plouat et ai. (17) reported 
that the release of acidic FGF from rod outer segment 
membranes requires ATP-dependent protein phosphoryl- 
ation. Because basic and acidic FGFs contain consensus 
sequences for phosphorylation by the phospholipid- and 
calcium-dependent protein kinase C (PK-C) and basic FGF 
contains a consensus sequence for the cAMP-dependent 
protein kinase A (PK-A), we tested the hypothesis that 
growth factors, and in particular FGFs, are potential sub- 
strates for protein phosphorylation. 

MATERIALS AND METHODS 

Materials. PK-C was a gift from G. Walton (University of 
California-San Diego, La Jolla) and I. Vtlgrain and J. M. 
Pelarin (Unit6 244, Institut National de la SanU et de la 
Recherche M^icale, Grenoble, France). The catalytic sub- 
unit of PK-A was provided by S. Taylor (University of 
California-San I>iego, La Jolla). Casein kinases I and II were 
purified from bovine lung as previously described and were 
provided to us by C. Cochet (Unit^ 244, Institut National de 
la Sant6 et de la Recherche M^icale, Grenoble) (18, 19). 
Recombinant human basic and acidic FGFs (20) were obtained 
from P. Barr and L. Cousens (Chiron). Synthetic transforming 
growth factor a (TGF-a) was supplied by Nicholas Ling (Salk 
Institute), and platelet-derived growth factor (PDGF) was a 
gift of R. Ross (University of Washington, Seattle). Recom- 
binant insulin-like growth factor I (IGF-I) was obtained from 
Fbjisawa Pharmaceuticals (Osawa, Japan); nerve growth fac- 
tor (NGF), from D. 3chubert (Salk Institute); and tumor 
necrosis factor a (TNF-a), from Suntory Institute for Biomed- 
ical Research (Osaka, Japan). [y-^^JATP (4000 Ct/nmiol; 1 Ci 
= 37 GBq) and [r-^^]orthophosphate were purchased from 
ICN. Antiserum 773 is an antiserum against basic FGF-(l-24) 
coiuugated to bovine serum albumin and was raised in n^bits. 
Protein A-Sepharose was obtained from Pharmacia. Reagents 
for sodium dodecyl sulfate/polyacrylamide gel electrophore- 
sis (SDS/PAGE) were obtained from Bio-Rad, and all other 
materials were from Sigma. 

Phosphorylation of Growth Factors by PK-C and PK-A. 
Different growth factors (*«0.5 mS) were incubated for 15 min 
at 30*'C in the presence of purified PK-C from bovine brain 



Abbreviations: FGF; fibroblast growth factor; PK-A, cAMP- 
dependent protein kinase; PK-C, calcium- and phospholipid- 
dependent protein kinase; IGF, insulin-like growth factor; TGF, 
transforming growth factor; PDGF, platelet-derived growth factor; 
EOF, epidermal growth factor; NGF, nerve growth factor; TNF, 
tumor necrosis factor; ACE cells, adrenocortical capillary endothe- 
lial cells. 
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(21)» the catalytic subunit of PK-A purified from porcine 
skeletal muscle (22), or bovine lung casein kinases I and II 
(18, 19). PK-C assays were performed in 20 fil of 10 mM 
Tris HCl (pH 7.5) containing 10 mM [>-^^]ATP (1500 
cpm/pmol), 10 mM MgCb, 0.6 mM CaCl2, 40 ^8 of phos- 
phatidylserine per ml, 0.8 /ig of dioctanoylglycerol per ml; 
and 1 /tg of PK-C per ml. PK-A assays were performed in 20 
fjLl of 10 mM Tris HCl (pH 7.5) containing 5 [y'^JATP 
(3000 cpm/pmol), 10 mM MgCl2, and 3 /ig of the catalytic 
subunit of PK-A per ml. Casein kinase assays were per- 
formed in 20 m1 of 10 mM Tris-HQ (pH 7.5) containing 10 /iM 
[y-^^P]ATP (3000 cpm/pmol), 50 mM Mga2, and either 10 Mg 
of CK-I or 4 ^ig of CK-II per ml. The reactions were stopped 
with Laenunli sample buffer, and the phosphorylated pro- 
teins were separated by 15% or 20% SDS/OPAGE (22) on a 
mini-slab-gel apparatus (Idea Scientific, Corvallis, OR) and 
visualized by overnight autoradiography. 

Phosphoamino Add Analysis and Tryptic Mapping. Radio- 
labeled peptides were extracted fix>m the polyacrylamide gel 
in 0.05 M ammonium bicaibonate (pH 7.3-7.6) supplemented 
with 0.1% SDS and 1% 2-mercaptoethanol. After precipita- 
tion with 50% trichloroacetic acid, the pellet was dissolved in 
6 M HCI, and the protein was hydrolyzed for 60 min at 110°C. 
Phosphoamino acids were separated by two-dimensional 
high-voltage electrophoresis as describeld (23). The plates 
were run for 20 min at 1.5 kV in pH 1.9 buffer in the first 
dimension and then for 16 min at 1.3 kV in pH 3.5 buffer in 
the second dimension. 

PhospborylatkHi of Basic FGF by Cells in Cultnre. Three 
10-cm dishes of SK-HEP-1 hepatoma cells and adrenocorti- 
cal capillary endothelial cells (ACE cells) (21 x 10^ cells) 
were labeled overnight at 3T'C in phosphate-free Dulbecco*s 
modified Eagle's medium (DMEM) supplemented with 2% 
dialyzed calf serum, 0.5% (SK-HEP-1) or 5% (ACE) normal 
DMEM, and 0.25 /iCi (SK-HEP-1) or 0.40 ftCi (ACE) of 
P^PJorthophosphate per ml. At the end of the labeling period, 
cells were washed twice with phosphate-bufifered saline and 
lysed in radioimmunoprecipitation assay (RIPA) buffer (24) 
for 30 min at 4'*C. After centrifiigation in an Eppendorf 
Microfiige, the supematants were inmiunoprecipitated by 
using the polyclonal antibody 773 (1:2(X)) in the presence or 
absence of 5 /ig of basic FGF. Twenty-five microliters of 
protein A-Sepharose suspension (50% in RIPA buffer) was 
added to the tube and mixed for 30 min. At the end of this 
incubation, the solution was centrifuged, and the pellet was 
washed three times with RIPA buffer, resuspended in Laenun- 
li sample buffer and analyzed by 15% SDS/PAGE (22); 

Biological Assays ci FGF Acttrity. Basic FGF was phos- 
phorylated with PK-C or the catalytic sul>unit of PK-A in the 
presence of unlabeled ATP under the conditions described 
above. Controls consisted of basic FGF treated with an 
identical incubation, but in the absence of enzyme. The 
phosphorylated and nonphosphorylated mitogens were 
tested for their capacity to bind the FGF receptor by using the 
BHK cell assay described by Moscatelli (25). Their effects on 
the proliferation of capillary endothelial cells was performed 
as described (26), and cell numbers were determined with a 
Coulter Counter. 

RESULTS 

Addk and Basic FGFs Are Substrates for Protein Pho^ho- 
rylation. Human basic FGF was phosphorylated by both 
PK-C and the catalytic subunit of PK-A (Fig. 1). In the 
absence of substrate, there was clear self-phosphorylation of 
PK-C, PK-A, and casein kinase II (lanes 1, 2, and 4), and as 
expected (19, 21) casein kinase I showed little, if any, 
setf-phosphorylation Qane 3). Basic FGF was a good sub- 
strate for PK-C (lane 5) as shown by the appearance of an 
intense radiolabeled band corresponding to 18 kDa. This 
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Fig. 1. Phosphorylation of recombinant acidic FGF and basic 
FGF by various purified protein kinases: Enzymes alone (lanes 1-4) 
or with 0.5 of basic (lanes 5-8) or acidic (lanes 9-12) FGF were 
tested for their protein kinase activity. Purified PK-C (lanes 1, 5, and 
9), PK-A (lanes 2, 6, and 10), casein kinase I Qsoics 3, 7, and 11) or 
casein kinase II (CK II) (lanes 4, 8, and 12) were incubated under the 
standard phosphorylation conditions described in Materials and 
Methods^ and the radiolabeled proteins were separated by SDS/15% 
PAGE and visualized by autoradiography. Arrowheads show the 
position of the autophosphorylated enzymes and of the phosphoryl- 
ated growth factor. 

phosphorylation was also shown to be strongly phospholipid 
dependent (not shown). Similar results shown in lane 6 of Fig. 
1 establish that human basic FGF is a good substrate for 
PK-A as well. There was no evidence that basic FGF is a 
substrate for casein kinase I (lane 7), and only trace amounts 
of the mitogen were phosphorylated by casein kinase II Oane 
8). An analysis of the results with acidic FGF are presented 
in lanes 9-12. While human acidic FGF was a substrate for 
PK-C-dependent phosphorylation (\me, 9), PK-A and casein 
kinase I were ineffective Qanes 10 and 11), and casein kinase 
il had a small but negligible effect Qasic 12). 

Identifkation of the Site of Phosphorylation. Phosphoamino 
acid analysis of radiolabeled human basic FGF revealed the 
presence of phosphoserine after incubation with PK-C and 
phosphothreonine after incubation with PK-A (Fig. 2). Thus, 
the amino acid target sites phosphorylated by PK-C and 




basic FGF/PK-C basic FGF/PK-A 



r^. -■ . .. ^ 



Fig. 2. Identification of the amino acids targeted by PK-C and 
PK-A. Recombinant human basic FGF was phosphorylated by PK-C 
or PK-A, and recombinant acidic FGF was phosphorylated by PK-C. 
Radiolabeled amino acids were identified by two-dimensional elec- 
trophoresis after partial hydrolysis in 6 M HCI. PK-A has no effect 
on acidic FGF and was not analyzed. 
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PK-A in human basic FGF are distinct. Three sites in the 
sequence of basic FGF are compatible with the consensus 
sequence (27) required for PK-C-dependent phosphoryl- 
ation: Ser-64, Ser-108, and Ser-143. Reverse-phase HPLC 
analysis of the firagments generated by Staphylococcus Vg 
protease digestion of the basic FGF phosphorylated by PK-C 
identified Ser-64 as the target amino acid for this enzyme 
(J.-J.F., unpublished data). 

The primary structure of human basic FGF contains two 
potential sites that meet the criteria for PK-A-dependent 
phosphorylation (Thr-112 and Ser-113). Because threonine 
was identified as the target amino acid for the PK- 
A-dependent phosphorylation (Fig. 2), it is possible to pro- 
pose Thr-112 as the targeted amino acid for PK-A. This is 
ftirther supported by the observation that bovine basic FGF, 
in which Thr-112 has replaced Ser-112, was phosphorylated 
by PK-A on a target serine rather than threonine (result not 
shown). It is particulariy interesting to note that this PK-A 
site (11U--112) is located in the receptor-binding donuun of 
basic FGF (28) and that all forms of basic FGF (human, 
bovine, ovine, rat, and frog) contain this potential site of 
phosphorylation. This is in contrast to the other members of 
the FGF family including acidic FGF (2) and the proteins 
encoded by the oncogenes int'2 (29), FGF-5 (30) and hst/ks 
(31, 32). 

Spedfidty of FGF Phosphorylation. Several protein kinases 
were tested for their ability to phosphorylate acidic and basic 
FGFs in an effort to establish the possible specificity of PK-A 
and PK-C. Casein kinases I and II, enzymes that usually 
prefer acidic substrates (18, 21), did not significantly catalyze 
the phosphorylation of either mitogen (Fig. 1). The tyrosine 
kinase of the epidermal growth factor (EGF) receptor was 
also unable to phosphorylate basic FGF (C. Cochet, personal 
communication), although gag-fps, on oncogenic fusion pro- 
tein with broad protein tyrosine kinase activity (provided to 
us by K. Gould, Salk Institute), was able to phosphorylate 
basic FGF and, to a larger extent, acidic FGF (not shown). 

Kinetics of Phosphorylation. The and Vnax values were 
deduced from Lineweaver-Burke analyses of the reaction, 
and are shown in Table 1. The values (1.5 and 3.4 ^M) 
measured for the FK-C-dependent phosphorylation of basic 
and acidic FGFs rank these two factors among the better 
substrates for this enzyme (27). As an example, EGFRl, a 
peptide analog of the site of PK-C-dependent phosphoryl- 
ation in the EGF receptor, has a /Tn, of 15 ^tM (21). The Vmax 
for the phosphorylation of basic FGF was 50 times greater for 
basic FGF than for acidic FGF, suggesting that it can be more 
extensively phosphorylated. The stoichiometry of phosphate 
incorporation was 0.5-0.6 mol of phosphitte per mol of basic 
FGF, suggesting one site of phosphorylation. 

The Km value obtained for the phosphorylation of basic 
FGF by PK-A (17 /tM) ranks basic FGF among the better 
substrates for this kinase (33). Although the rate (V^axf 0.06 



Table 1. Kinetic parameters of the phosphorylation of FGFs by 
PK-C and PK-A 









Vmax* nmol/min 


Protein kinase 


Substrate 


mM 


per mg 


PK-C 


bFGF 


1.5 


18 




aPGF 


3.4 


0.37 


PK-A 


bFGF 


17.0 


0.06 




aFGF 


NS 


NS 



Various amounts of human recombinant acidic FGF (aFGF) and 
basic FGF (bFGF) were phosphorylated for 15 min at 30^C in the 
presence of [y-'^]ATP (10 fiM; 1500 cpm/pmol) and PK-C or PK-A 
under the conditions described in Fig. 1. Radioactivity incorporated 
into the trichloroacetic acid-precipitable proteins was determined 
and corrected for the autophosphorylation of the kinase. The kinetic 
parameters were calculated from Lineweaver-Burke double- 
reciprocal plots of the results. NS, not a substrate. 



nmol/min per mg) of this reaction is relatively low, 0.8-1.2 
mol of phosphate are incorporated per mol of basic FGF. 
Acidic FGF is not a substrate for this enzyme. 

Basic FGF Exists as a Phosphorylated Protein in Cells 
Grown in Culture. The hepatoma cell line SK-HEP-1 and 
boviiie ACE cells were selected for analyses because they 
synthesize basic FGF (7, 8, 34). Detergent extracts of the 
phosphate-labeled cells were inununoprecipitated with a 
specific anti-basic FGF antibody (35, 36), and two m^or 
radiolabeled bands were detected by autoradiography after 
electrophoresis (Fig. 3). In each instance, one of the bands 
could be identified as being basic FGF by virtue of antibody 
specificity, the estimated molecular weights of the radiola- 
beled bands (16-18 kDa), and their displacement by the 
addition of unlabeled FGF during the immunoprecipitation. 
The phosphorylated basic FGF synthesized by ACE cells 
was further characterized by showing that it could be ex- 
tracted with 2 M NaCl and eluted with a characteristic 1.4- 
1.6 M NaCl from the heparin-Sepharose column (not shown). 
No radiolabeled basic FGF was detected in conditioned 
media. 

Effects of Phosphorylation on the Biological Activities of 
Basic FGF. No differences were found when PK-C- 
phosphorylated basic FGF was tested on capillary (Fig. 4A) 
or vascular (not shown) endothelial cell proliferation assays. 
There also were no effects on thymidine incorporation into 
3T3 cells (not shown). Because each of these assays are 
long-term assays consisting of incubations of 24-96 hr, this 
basic FGF was also tested in a radioreceptor assay (Fig. 4B). 
There were no differences between this form of phosphoryl- 
ated basic FGF and the recombinant material as determined 
by their capacity to displace labeled basic FGF bound to 
either hi|^- or low-affinity sites on BHK cells. There were 
also no effects on the binding of basic FGF to immobilized 
heparin (not shown). 

It was of particular interest to determine the effects of 
PK-A-phosphorylated FGF because its target amino acid 
(Thr-112) is located in the receptor-binding domain of basic 
FGF (28). As expected, the time course and experimental 
conditions of the proliferation (Fig. 4Q and mitogenesis (not 
shown) assays precluded any detection of difference between 
the unphosphorylated and PK-A-phosphorylated basic FGF. 
In contrast, thie phosphorylated FGF was 3-8 times more 
potent at displacing ^^I-labeled FGF binding to its receptor 
than the unphosphorylated recombinant basic FGF. The 
effect of PK-A-phosphorylated FGF was shown to be specific 
for the high-affinity receptor because the displacement of 
FGF bound to cell-associated glycosaminoglycans was not 
different between the phosphorylated and unphosphorylated 




SK.Hop ACE 

Fig. 3. Phosphorylation of basic FGF in cultured bovine ACE 
cells and human hepatoma SK-HEP-1 cells. Immunoprecipitates 
fix>m '^P-Iabeled cells were prepared as discussed, analyzed by 
SDS/15% PAGE, and visualized by autoradiography. Inmuinopre- 
cipitations were performed in the absence (-) or In the presence (+) 
of an excess of nonradioactive basic FGF. The position of the 
molecular weight standards (xlO^^) is shown on both sides of the 
figure, and radioiodinated recombinant FGF is shown in the far-lefl 
lane. 
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basic FGFs. Accordingly, the affinity of the phosphorylated 
FGF for immobilized heparin was also found to be unchanged 
by phosphorylation (not shown). We also exploited the fact 
that the site of phosphorylation of basic FGF by PK-A can be 
modified by heparin so that the enzyme phosphorylates 
Ser-64 rather than Thr-112 (37). Under these conditions, 
there were no dififerences in any of the assays between the 
phosphorylated and unphosphorylated forms of basic FGF 
(data not shown). 

Growth Factors Are Sabstrates for Phosphorylation. TGF- 
a, TGF*/3. and IGF-I were substrates for PK-C-mediated 
phosphorylation (Fig. 5A), Not all growth factors were 
substrates; however, EGF, PDGF, and NGF were either 
poorly or not-at-all phosphorylated by this kinase. In the case 
of PK-A, TGF-a and TNF were substrates for phosphoryl- 
ation, but EGF, PDGF, and NGF were not. The spectrum of 
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Fig. 5. Phosphorylation of growth factors by purified PK-C (A) 
and PK-A (0). The growth factors indicated (0.5 m8 per lane) were 
phosphorylated as described in the text. The reactions were stopped 
by the addition of Laemmli sample buffer, and the phosphorylated 
proteins were visualized by autoradiography after separation by 
SDS/15% PAGE {A) or SDS/20% PAGE {B), PosiUons of the 
molecular weight markers (shown x 10~^) are indicated in the 
far-right lane. Arrowheads indicate the position of the autophospho- 
rylated protein kinases. Because PDGF was stored in the presence 
of bovine scrum albumin, only 0.1 ^ of this factor was used, and the 
phosphorylation of bovine scrum albumin can be detected as a 
67-kDa band. Ctl, control; bFGF, basic FGF; aFGF, acidic FGF. 
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Fig. 4. Biological activities of phos- 
phorylated basic FGFs. Recombinant hu- 
. nian basic FGF was phosphorylated (■) 
by PK-C and tested for its capacity to 
stimulate the proliferation of ACE cells 
(i4) and for its ability to displace ^I- 
labeled FGF binding to its receptor in 
BHK cells {B), Basic FGF was phospho- 
rylated by PK-A (•) and tested for its 
capacity to stimulate the proliferation of 
ACE cells (Q and to displace '"Mabeled 
FGF binding to the FGF receptor on 
BHK cells (C). In each instance, control 
samples (□, o) were treated under iden- 
tical conditions, but in the absence of 
enzyme. 



growth factors phosphorylated by different kinases varied 
presumably by virtue of the presence of consensus phospho- 
rylation sequences. Structurally related factors exhibited 
very different substrate specificities. Basic FGF and TGF-a 
were phosphorylated by both PK-C and PK-A, yet the 
structural homolog acidic FGF, which has 55% structural 
identity with basic FGF, was not phosphorylated by PK-A, 
and EGF, which is 33% homologous to TGF-a, was not 
phosphorylated by either kinase. 

DISCUSSION 

It is not known at what step in its synthesis basic FGF is 
phosphorylated or in fact what enzymes are responsible for 
the phosphorylation of basic FGF in intact cells. Attempts to 
identify the amino acid phosphorylated in vivo have so far 
failed because of the small amounts of growth factor made by 
the cells (7-10) and the low recoveries from immunoprecip- 
itation, peptide hydrolysis, and two-dimensional electropho- 
resis. The availability of transfected cells with a high expres- 
sion of basic FGF (11, 12, 38) should alleviate this problem. 

Little, if anything, is known about the mechanisms that 
regulate synthesis, secretion, and bioavailability of growth 
factors, and the results presented here suggest that it will be 
important to consider the possible roles of phosphorylation. 
In the case of basic FGF, one such role includes targeting to 
specific subcellular organelles. The metabolism of basic FGF 
^er binding to its receptor is different from many other 
growth factors (39). It is specifically metabolized to three 
long-lived fragments that continue to retain their capacity to 
bind heparin. Bouch^ et aL (40) have suggested that it is 
targeted to and accumulates in the nucleolus after binding to 
its receptor on the plasma membrane. Whether this transport 
is kinase dependent is not known. With the observation that 
FGFs have no obvious signal sequence, remain cell- 
associated, and may not be secreted in a classical sense (1, 2, 
7), the targeting of these mitogens to specific locations for 
processing in the cell may be of particular importance. 

The observation that the PK-A-phosphorylated basic FGF 
has a greater capacity to displace FGF from its receptor 
supports the notion that it possesses a higher affinity for the 
FGF receptor. If this is the case, then this posttranslational 
change may well contribute to the regulation of FGF activity. 
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Increased binding of FGF to its receptor has recently been 
proposed as one of the mechanisms that might enable cells to 
recruit FGF from the extracellular matrix. Thus, it is inter- 
esting to speculate that protein kinases mediate the release of 
FGFs from the basement membranes. This notion is sup- 
ported by the work of Plou6t et al. (17), who recently have 
demonstrated that the release of acidic FGF from retinal rod 
outer segment membranes requires ATP-dependent phos- 
phorylation. Any involvement of protein kinases would 
invoke the existence of these enzymes either in the extra- 
cellular milieu or associated with the outer plasma mem- 
brane. 

The results presented here may have several important 
implications for our understanding of the mechanisms regu- 
lating growth factor activity. Because basic FGF, and pos- 
sibly other growth factors, are phosphorylated in intact cells, 
their biological activities, bioavailability, and/or specificities 
may well be different from the unphosphorylated forms 
currently being examined in experimental models. The failure 
to establish that the phosphorylation of basic FGF resulte in 
modified activities in the proliferation and mitogenic assays 
most likely reflects the lability of this posttranslational 
modiHcation. This supports a potential role for ectokinases 
and ectophosphatases in regulating the biological activities of 
basic FGF. The balance between kinases and phosphatases 
would effectively force a local action for phosphorylated 
basic FGF. Because the site of phosphorylation of basic FGF 
by PK-A is modified by heparin (37) and this site of phos- 
phorylation is required for increased receptor binding, pro- 
tein kinase action is ineffective at modifying FGF activity 
when it is bound to glycosaminoglycans in the extracellular 
matrix. The generation of basic FGF free of any heparin or 
related glycosaminoglycans will be prerequisite to observing 
any effect of PK-A on basic FGF in vivo and, thus, supports 
the potential role of heparinases in regulating FGF activity 
(8). 

It is important to note that the experimental parameters 
examined here represent only three (binding, autogenesis, 
and proliferation) effects of basic FGF. It will be of impor- 
tance to establish whether the phosphorylation of FGF 
modulates cell differentiation (41) and cell adhesion (42). The 
phosphorylation of potentially cytoplasmic and certainly 
cell-associated growth factors like the FGFs may well pro- 
vide a critical regulatory step on signal transduction and any 
one of their many diverse biological end points. 
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ABSTRACT Epidermal growth factor (EGF) and type a 
tnmsforming growth factor <TGF-a) bind to a specifk regjkw 
In snbdomaiii III of the ertrardlwlar iwrtidn tbe EGF 
receptor (EGFR). Binding leads to receptor dimcriiatlon, auto- 
and transpiiospliorylation on intraccDtilar tyrosine reridnes, 
and activation of signal transduction pathways. We compared 
the bindiii« and Moiogical action of EGF and TGF-a hi 
CliiiMse hamster ovary (CHO) ceils eipressing either wfld-Qfpe 
hnman EGFR (HER497R) or a viuteit EGFR that has an 
arginine-to-lysine solistitntlon in tlie ektraoeilniar domain at 
codon 497 (HER497K) witfafai snbdomain IV of EGFR. Both 
rMeptors oiiiblted two orders of binding sites with radlolo- 
dhiated EGF (^^-EGF). Sfa nilar r esults were obUdned with 
^•TGF-a in ceils eipressing HER497R. In contrast, only oiie 
order of hm-afllnify Unding sites was seen with *^-TGF-a in 
the case of HER497K. Although EGF and TGF-a enhanrfd 
tyi twlne phosphor y lation of both receptors, CHOeeilseipiess» 
ing fiER497K ediibited an attenuated growdi response to EGF 
and TGF-tt and a reduced Induction of the protoonoogenes 
FOS, JUNt and M^C. Moreover, high concentratldns <tf 
TGF-a (5 nM) h ihibited gr<»wth in these ceils bat not hi oefls 
eipresaing HER497R. Tbew findings indicate that a leghn in 
subdomain IV of EGFR regulates signal transduction across 
tlie ccd membrane and selectively modulates the binding char- 
acteristlGS of TGF-o. 



The family of transmembrane tyrosiiie Idnase receptors con- 
stitutes an important group of regulatory proteins, which 
include the insulin receptor and tbe epidermal growth factor 
receptor (EGFR). Ligand specificity is confeited by the 
presence of unique sequences in the extracellular domain of 
each receptor. Often, more tlian one ligand can bind to the 
same receptor. Thus» the insulin receptor binds insulin and, 
with a lesser afiSnity, insulin-lilce growth factors I and II (1). 
EGFR binds EGF, type a transforming growth factor fTGF- 
a), amptiiregulin, betaceUulin, and heparin-binding EGF-lii&e 
growth fiactor (2, 3). Ligand binding in the extracellular 
domain leads to receptor autophosphorylation on tyrosine 
residues located within the intracellidar domain and initiation 
of a cascade of biochemical reactions that mediate the 
biological actions of many hormones and growth factors 
(1-3). In the absence of Ugand, the extracellular donuun acts 
tlurough unknown mechanisms to prevent constitutive and 
unregulated activation of the intrinsic tyrosine kinase activity 
of the intracellular domain (4, 5). 

The extracellular domain of transmembrane tyrosine ki- 
nase receptors often contains different structural subdo- 
mains. In the case of EGFR, there are four contiguous 
regions consisting of subdomain I at the N terminus and the 
cysteine-rich subdomains II and IV flanking subdomain III 
(6). Cross-linking studies have identified a 47-amino acid 
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sequence (residues 321-367) within subdomain m as the 
EGF/TOF-a binding region (6, 7). The specific functions erf 
the other extracellular subdomains are not known. 

Recently, a variant EGFR has been identified that has an 
aiginine-to-lysine substitution at codon 497 (8). This variant 
receptor was found in normal human lymphocytes and a 
number of cultured human cancer cell lines but not in A431 
ceUs (8). Inasmuch as the potential role of this region of 
EGFR with respect to modulation of cell proUferation and 
gene activation is not known, in the present study we sought 
to characterize the biological properties of HER497K. Ac- 
cordingly, we expressed the wild-type human EGFR 
(HER497R) and HER497K in Chinese hamster ovary (CHO) 
cells dev(Hd of endogenous EGFR and determined whether 
there are differences between the two receptors with respect 
to ligand bindinig, cell growth, and induction of the immediate 
eariy response genes FOS, JUN^ and MYC. 

MATERIALS AND METHODS 

Materials. The following materials were purchased: recep- 
tor grade murine EGF from Collaborative Research; GF/F 
filters from Whatman; FY-20 anti-phosphotyrosine antibod- 
ies from ICN; ^^l, [o-^^pj^icTP, ECL blotting kit, and 
Hyperfilm-ECL from Amersham; lodo-Gen from Pierce; 
pGEM-7Zf(+) plasmid from Promega; pSVK 3 expression 
vectorfirom Pharmacia; human MFC, and/l/NcDNAs 
ftom the American Type Culture Collection; GeneScreen 
membranes fix>ro New En^and Nuclear; (^iagen minicol- 
umns ; 3-(4,5-dimethylthiazol-2-y l)-2,5-diphenyltetrazolium 
bromide (MTT) from Sigma; and Ecoscint scintillation fluid 
fix>m National Diagnostics. Recombinant human TGF-a, 
13A9 monoclonal anti-EGFR antibodies (9), and CHO cells 
were gifts from M. Winkler and B. Fendley (Genentech, 
South San Francisco). 

Generation of Expression Vectors. Two oveiis^yping oligo- 
nucleotides were synthesized (8) and used to generate a linker 
that contained the following restriction sites: Sph \/HindSXl/ 
Sac l/Ecom/Bgl ll/Sma 1/Sal l/BstXi, The linker was 
subcloned into the cloning vector pG£M-72X at the Sph 
i/BstXI sites, generating the vector pGEM7.2Zf. Four EcoKt 
fraffments (HERA, HERB, HERC, and HERD) encompass- 
ing the entire EGFR coding region were isolated &x}m a A 
library derived from TiiA4 himian pancreatic cancer cells, 
with HERB containing the nucleotide base change (8). The 
wild-type EGFR expression plasmid fragment was prepared 
by substituting HERB with fragment HERO, ori^nally 
cloned from A431 cells (2, 25). The four EGFR fira^^ents 
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were reassembled as follows: the 314-bp Sac l/EcoRl frag- 
ment of HERA, the 1841-bp EcoU HERB (or HERG) 
fragment, and the 637-bp EcoKl/Bgl U fragment of HERC 
were sequentially subcloned into pGEM-7.2Zf, generating 
plasmid HER7.2ABC. The 771-bp EcoKl HERC fri«ment 
and the 884-bp EcoKl/Sca I fragment of HERD were se- 
quentially subcloned into pGEM-7.2Zf, generating plasmid 
HER7.2CD. A 1025-bp Bgl ll/Sal I fragment from 
HER7.2CD was then subcloned into HER7.2ABC, yielding a 
full-lengUi EGFR cDNA. The two 3.8-kb Hindm/Sal I 
EGFR cDNAs were then subcloned into the pS VK 3 expres- 
sion vector under the control of the simian virus 40 eariy 
promoter, yielding pHER497K (variant EGFR) and 
pHER497R (wild-type EGFR). Authenticity of the constructs 
was confirmed by sequendng. Transfection of pHER497K, 
pHER497R, and pSV2-d^ DNA into CHO cells deficient in 
synthesis <^ dihydrofolate reductase was earned out by the 
calcium phosphate precipitation technique (10). After readi- 
ing confluency , cells were plated at a 1:10 dilution in selection 
medium [Dulbecco's modified Eagle's medium (DMEM)/ 
Ham's F-12 medium (1:1) lacking hypoxanthiiie, glycine, and 
thymidine] supplemented with 5% dialyzed fetd bovine se- 
nmi (FBS). After 2 weeks, single clones of cells were 
isolated, and eadi individual clone was plated separately. 
After clonal expansion, cells from each individual clone were 
tested for EGFR expression by Northern blot analysis and 
^I-labeled EGF (^I-EGF) binding. Hie selected clones 
were shown to express only the variant HER497K or wild- 
type HER497R by subjecting RN A samples to reverse tran- 
scriptase PGR amplification followed by asymmetric PGR 
and sequencing (8). 

BfaMUng Experiments. BiologicaUy active ^I>EGF (40-50 
fiC\fii%\ 1 Ci = 37 GBq) and ^^I-TGF-a (50-70 MCi/fig) were 
iodinated with lodo-Gen and chloramine T, respectively (11). 
Binding was performed at 4'^C for 5 hr on cells in monolayer 
culture (70-80% confluent) with increasing concentrations of 
labeled Ugand and DMEM containing 0.1% bovine serum 
albumin (BSA) and 20 mM Hepes (11). Incubations were 
stopped by washing cells with phosphate-buffered saline 
containing 0.1% BSA. Nonspecific binding, determined in 
the presence of a 1000-fold excess of unlabeled EGF, did not 
exceed 10% of total binding. Equilibrium binding data were 
analyzed with the lioand program (12). 

Ceil Growth Assay. CeU proliferation was assayed by 
measuring the metabolism of the tetrazoliimi salt MTT and by 
monitoring pH]thymidine incorporation into DNA (13). The 
MTT assay measures the cell's ability to metabolize MTT, 
which correlates well with the rate of cellular proliferation 
(13). For the MTT experiments, cells were plated at a density 
of 8 X 10^ cells per well in 96-well plates and grown overnight 
in 0.2 ml of DMEM/Ham's F-12 medium (1:1) containing 5% 
.FBS. After replacing the medium with 0.2 ml of the same 
medium contaiiQing 0.5% FBS, cells were incubated for 72 hr 
at 37'C in the absence or presence of ligands, prior to the 
.addition of MTT (5 mg/ml) for 4 hr (13). Reactions were 
stopped with acidified isopropanol (0.04 M), and the absor- 
bance of the supematants was measiu^ at 570 lim with an 
ELISA plate reader (Molecular Devices). Absorbance read- 
ings were converted to the percentage of control growth, 
which was determined in the absence of growth fiictors and 
defined as 1(X)% (13). For the pH]thymidine incorporation 
experiments, cells were i^ated at a density of 3 x 10* cells per 
well in 12-well plates and incubated for 48 hr in 1.5 ml of 
DMEM/Ham's F-12 medium (1:1) containing 5% FBS. Cells 
were then incubated for 40 hr in the absence or presence of 
ligand with seium-free medium containing 5 m of insulin per 
inl, 5 MS of transferrin per ml, 5ti%d[ selenium per mi, and 
0.1% BSA before adding pH]thymidine (1 /iCi/ml) for 1 hr. 
CeUs were collected by rapid filtration on GF/F filters and 
washed sequentially with 10% trichloroacetic acid, 70% 
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ethanol, and 100% ethanol. Radioactivity was measured in a 
liquid scintillation counter using Ecoscint as the scintillant. 
Data were then expressed as percentage change from control 
values. 

EGFR Pho«pborylaliot. Cells were incubated for the spec- 
ified times at yPC in binding medium in the absence or 
presence of each ligand and then solubilized in bujBTer con- 
taining 50 mM Tris HCl (pH 7.7), 150 mM NaQ, 5 mM 
EDTA, 1% deoxycholate, 1% Triton X-100, 6.1% sodium 
dodecyl sulfate, 1 mM phenylmethylsulfonyl fluoride, 2 mM 
benzamidine, and 0.1 mM Na3V04 (14). Cell homogenates or 
immunoprecipitated EGFR (using the 13A9 ahti-EGFR an- 
tibodies) were subjected to SDS/7.5% PAGE and tiansfenred 
to nitrocellulose membranes (14). Membranes were incu- 
bated with FY-20 anti-phosphotyrosine antibodies (1 /ig/ml) 
for 1 hr, washed, and sulgected to chemiluminesoence West- 
em blotting with the ECL blotting kit and Hypetfilm-ECL. 

Northern Biol AndlyBis* Total RN A was isolated by 
guanidium thiocyanate extraction method (15), size- 
fiactionated, and transferred onto OeneScreen membranes 
(IQ. FOters were hybri dized under high-stringehcy condi- 
tions using [a-^ldCTP-labeled cDNAs (IQ. The intensity of 
the radiographic bands was quantified by laser denutomdry . 

Statistical Anal^ais. Statistical analysis of the experimental 
results was obtained by one-tailed Student's / test using the 
STATViEW computer program, with P < 0.05 taken as signif- 
icant. 

RESULTS 

Characterizatioa of Ui^nd Binding. EGF and TGF-a bind- 
ing and dissociation are pH dependent (14). Therefore, to 
determine EGFR affinity and capacity, equilibrium binding 
studies (4*t:) were carried out at pH 7.4 (Fig. 1) and pH 8.0 
(Table 1). Computer-based analysis of ^I-EGF binding data 
(12) demonstrated the presence of two orders €i binding sites 
at the two pH levels in CHO cells expressing either wild-type 
HER497R or variant HER497K. At pH 7.4, there were 
i^proximately 90,000 and 116,000 low-afifinity receptors in 
cells expressing HER497R and HER497K, respectively (Ta- 
ble 1). The number of high-affinity binding sites (3700 and 
4800) was also similar in both cell types (TaUe 1). At pH 8.0, 
there was a decrease in the number of binding sites for EGF 
with both receptors and a concomitant increase in binding 
affinity (Table 1). ^I-TGF-a also exhibited two ohlers of 
binding sites in cells expressing HER497R. As in the case of 
EGF, there was a decrease in the number of TGF-a binding 




Bound, nM 



Flo. 1. Scale hard analysis. CHO cells expressing the indicated 
EGFR were incubated for 5 hr {4^ in six-well ^ates with binding 
medium CpH 7.4) containing increasing concentrations (0.1-80 ng/ 
ml) of either ^I-EGF or ^^si.TGF-a. Data are representative of two 
experiments. 
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Table 1. Binding characteristics of ^^^I-EGF and ^^si-TGF-a 



pH 


Ligand 


Receptor, no. per cell 


Ka, nM 


HER497R 


HER497K 


HER497R HER497K 


7.4 


EOF 


89.900 


116.200 


5.73 13.6 






3.700 


4.800 


0.1 0.28 


7.4 


TGF-a 


123.300 


158.100 


4.57 5.62 






8,200 




0.16 


8.0 


EOF 


47.300 


83,800 


3.39 8.49 






1.500 


2,900 


0.05 0,17 


8.0 


TGF-a 


61.600 


91,400 


2.26 3.75 






1.100 




0.02 



HER497R- and HER497K-expressing cells were incubated for 5 hr 
(4**C) at the indicated pH in six-well plates with increasing concen- 
trations of either "^l-EGF or i"I-TGF-a (0.1-80 ng/ml). Data were 
analyzed with the ugand program as described and are means of 
two separate experiments with each ligand. 

sites at pH 8.0 in coiuuncUon with an increase in binding 
affinity CTable 1). However, irrespective of the pH, only a 
single order of low-affinity binding sites was observed with 
^I-TGF-a in ceUs expressing HER497K aable 1). 

To determine the effect of pH on ligand dissociation, cells 
were initially incubated for 5 hr at 4X (pH 7.4) with either 
"^I-TGF-o or "*I-EGF, washed at 4*C, and incubated for 2 
hr at various pH levels (4**C) in fresh medium devoid of 
radioactivity (Fig. 2). At pH 4, the m^ority of ^1-EGF was 
released from the cells. Raising the pH caused a progressive 
decrease in the percentage of dissociated radioactivity, with 
50% dissociation occurring at pH 4.8 and S.4 with HER497K 
and HER497R, respectively. The least dissociation occurred 
at pH 6.5-7.0, and the amount of ^I-EGF dissociated was 
similar with HER497R and HER497K. In contrast, with both 
receptors, the minority of ^1-TGF-a was still dissociated 
from the cells at pH 5, and 50% dissociation occurred at pH 
»7.0. However, at pH 8 and 8.5, ^^I-TGF-a dissociated more 
readily from HER497K Uian from HER497R. 

Growth Stimulatioa by EGF and TGF-or. As determined by 
the MTT dye reduction assay, EGF and TGF-a modulated 



00-1 
80 - 
70 - 

O ^ H 

:S 50- 

8 

5 

^ 30H 

20 

10 H 




4 



6 



T 
7 



T 
B 



0 



pH 



Fig. 2. Effect of pH on Ugand dissociation. HBR497R-expi«ssing 
(solid symbols) and HER497K-expressing (open symbols) cells were 
incubated for 5 hr (4*^) in six-well plates with binding medium (pH 
7.4) containing 200,000 cpm of "^I-EGF (squares) or "^l-TGF-a 
(circles) and then washed and incubated for 2 hr at 4X in fresh 
medium at the indicated pH. Radioactivity dissociated from the cells 
during the 2-hr incubation was expressed as a percentage of the 
radioactivity that bound to the cells during the initial 5-hr incubation. 
Data are means ± SB of duplicate determinations from three 
experiments. 
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the growth of EGFR-expressing CHO ceils. In the case of 
HER497R, maximal stimulation of 40% occurred at concen- 
trations of 0. 1-0.8 nM with either growth factor (Fig. 3 A and 
C). Higher ligand concentrations resulted in a slight decrease 
in growth stimulation. In the case of HER497K» there was 
only a minimal increase in cell growth with EGF (0. 1 nM) and 
none with TGF-a (Fig. 3 A and C). Furthermore, at high 
concentrations (2-5 nM), TGF-a inhibited ceil proliferation 
(Fig. 3C). When [^H]thymidine incorporation was assayed, 
maximal stimulation of 45-90% occurred at concentrations of 
0.2-2 nM EGF and TGF-a with HER497R (Fig. 3B). In 
contrast, with HER497K, low concentrations of EGF in- 
creased ['H]thymidine incorporation by 45%, whereas 
TGF-a was without efifect. Conversely, high concentrations 
of TGF-a (1.5-5 nM) markedly inhibited [^HJthymidine in- 
corporation, and EGF was without effect (Fig. 3D). 

^ects of EGF and TGF-or on EGFR PiMwplioryUition. To 
determine whether EGF and TGF-a induce tyrosine phos- 
phorylation of HER497K, inununoblotting experiments were 
carried out with anti-phosphotyrosine antibodies. Both EGF 
and TGF-a (34 nM) increased the phosphotyrosine content of 
HER497K (Fig. 4i4). Maximal effects were seen 2-5 min after 
addition of either EGF or TGF-a (Fig. 4A). To confirm that 
the phosphorylated band represented HER497K and that 
HER497R was also phosphorylated, EGFR was immimopre- 
cipitated from CHO ceils expressing HER497K and 
HER497R using an anti-EGFR antibody (13A9), followed by 
immunoblotting with the anti-phosphotyrosine antibodies. 
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Fig. 3. Effects of EOF and TGF-a on cell growth. HER497R- 
expressing (solid symbols) and HER497K-expressiiig (open sym- 
bols) cells were incubated for 72 hr in the absence or presence of the 
indicated concentrations of EGF and TGF-a. Cell proliferation was 
determined by the MTT dye reduction assay 04 and C) and by 
monitoring [^H]thynudine incorporation (jB and D), Control values, 
determined in the absence of growth factor, were defined as 0. Data 
are expressed as percentage change from control values and are 
ro caM ± SB of quintuplicate determinations finom three experiments 
(MTT) or triplicate determinations firom three experiments (PH]thy- 
midine incorporation). 
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Fig. 4. Effects of EGF and TGF-a on phosphotyrosine content 
and cell morphology. (A) HER497K-expressing cells were incubated 
at 3TC with 34 nM EGF or TGF-a for the indicated times. Samples 
were lysed, subjected to SDS/PAGE, and transferred to membranes. 
The membranes were incubated with the PY-20 anU-phosphoty- 
rosine antibodies and treated with ECL Western blotting reagents. 
(B) HER497R- and HER497K-exprcssing cells were incubated for 2 
min at 3TC with 3.4 nM EGF (lanes E) or TGF-a (lanes T). HER497R 
and HER497K were then immunoprecipitated, and the inununopre- 
cipitates were subjected to immunoblotting with PY-20 antibodies as 
described above. Lanes C» control. iC~F) Cell morphology. 
HER497R- (C and D) and HER497K- (£ and f) expressing ceUs were 
grown in six-well plates in medium containing 0.5% FBS in the 
absence (C and E) or presence {O and F) of 3.4 nM EGF. (x320.) 

Both EGF and TGF-a (3.4 nM) enhanced tyrosine phosphor- 
ylation of HER497K and HER497R (Fig, 4B), even at con- 
centrations as low as 250 pM (data not shown). 

Effects of EGF and TGF-a on AfYC, FOS^ and JUN mRNA 
Levels and Cell Morphology. In both HER497K- and 
HER497R-cxprcssing cells, EGF (1.7 nM) caused a time- 
dependent increase in the mRNA levels of the protoonco- 
genes Af KC. F05, and JUN, However, this efifcct was 
consistently more pronoimced in the case of HER497R by 
comparison with HER497K (Fig. 5). Similaiiy, the efifects of 
TGF-a on the expression of these mRNA moieties was more 
pronounced in the HER497R-expressing cells (Fig. 5). When 
two additional HER497R-expressing clones were examined 
for MYC, FOS, and JUN induction, EGF and TGF-a again 
exerted a greater effect on the expression of these mRNA 
moieties by comparison with several HER497K-expressing 
clones (data not shown). 

Parental CHO cells were pleomorphic and flat and did not 
alter their shape after EGF or TGF-a addition (data not 
shown). HER497R-expressing cells had a polygonal shape 
and acquired either a fusiform or a rounded morphology after 
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Fio. 3. Efifects of EGF and TGF-a on MYC, FOS, and JUN 
mRNA levels. HER497R- and HER497K-expressing ceUs were 
incubated with 1.7 nM EGF or TGF-a for the indicated times. RNA 
was isolated and analyzed by Northern blotting. Exposure times: 
MYC, 48 hr; FOS, 96 hr; JUN, 24 hr; 7S, 16 hr. Each autoradiograph 
is representative of at least three experiments. 

addition of EGF for 40 hr (Fig. 4 C and D). HER497K- 
expressing cells had a uniformly narrow and fusiform shape 
in the absence of EGF (Fig. 4E), with many cells acquiring 
a rounded morphology after incubation with EGF for 40 hr 
(Fig. 4F). Similar morphological differences were observed 
with TGF-a as well as in two other HER497R- and 
HER497K-expressing clones (data not shown). 

DISCUSSION 

In the present study, we expressed variant (HER497K) and 
wild-type (HER497R) EGFRs in CHO cells that are devoid of 
endogenous EGFRs and compared the two receptors with 
respect to ligand binding, growth stimulation* tyrosine kinase 
activation, and induction of MYC, FOS, and JUN. There 
were two m^uor differences in the binding characteristics of 
HER497R- and HER497K-expressing CHO cells. First, irre- 
spective of whether binding was performed at pH 7.4 or 8.0, 
there were no high-affinity binding sites for ^^l-TGF-a with 
HER497K. In contrast, simUar high- and low-affinity binding 
sites were obtained with ^^I-EGF and ^"I-TGF-a in the case 
of HER497R and with ^^^i-EGF in the case of HER497K. 
Second, at high pH levels, TGF-a dissociation was more 
pronounced with HER497K by comparison with HER497R. 
Thus, a single amino acid substitution in subdomain IV of 
EGFR selectively alters TGF-a binding and dissociation, 
even though the EGiP/TGF-a binding region is located in a 
47-amino acid sequence (residues 321-367) within subdomain 
III (residues 313-446) of EGFR (6, 7). Inasmuch as EGFR 
dimers appear to have a higher ligand binding affinity than 
EGFR monomers (17), our findings raise the possibility that 
the arginine-to-lysine substitution may result in attenuated 
TGF-a-induced HER497K dimerization. They also imply a 
modulatory role for this EGFR region on the TGF-a binding 
pocket within subdomain III. 

Tyrosine phosphorylation of specific amino acids in the 
intracellular domain of EGFR allows for recruitment of 
important regulatory intracellidar substrates such as She and 
Grb2, activation of p21'**, initiation of a cascade of phos- 
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phorylation reactions that activate cytosolic proteins, and 
activation of nuclear oncoproteins (2, 18). In this study, EGF 
and TGF>a efficiently increased tyrosine phosphorylation of 
HER497R and HER497K, indicating that the intrinsic tyro- 
sine kinase activity of both receptors was activated by both 
iigands. However, in HER497R-expressing cells, EGF and 
TGF-a significantly increased cell growth and caused the 
anticipated increases in MYC» FOS, and JUN mRNA levels. 
In contrast, in HER497K-expressing cells, both factors ex- 
hibited a markedly reduced growth response and a decreased 
ability to raise MYC, FOS, and JUN mRNA levels. These 
dififerences indicate that, in spite of relatively similar intrinsic 
tyrosine kinase autophosphorylating activities, HER497K 
activates a less mitogenic pathway than HER497R. There- 
fore, it is possible that the region in the extracellular domain 
of EGFR that contains the arginine-to-lysine substitution 
somehow modulates the efficiency of recruitment of intra- 
cellular substrates to the activated receptor and/or allows for 
the activation of altematiye signaling pathways that do not 
lead to efficient nuclear protooncogene induction and sub- 
sequent growth stimulation. The hypothesis that this portion 
of EGFR may represent an important regulatory region is 
supported by the observations that the seven residues up- 
stream of codon 497 and the five downstream residues are 
conserved in the mouse, rat, and human EGFRs (1S^21) and 
that the intercy steine spacing is also completely conserved in 
this region of the human, rat, mouse, and chicken EGFRs as 
well as in /er-2i, the homologue of EGFR in Caenorhabditis 
elegans (22). 

EGF smd TGF-a generally exert the same effects on ceil 
proliferation and function in a variety of cell types. However, 
a number of quantitative and qualitative differences have 
been reported between their biological actions (9, 23, 24). In 
the present study, HER497K-expressing cells were markedly 
growth inhibited by high concentrations of TGF-a but not 
EGF, perh^s because of the absence of high-affinity TGF-a 
* binding sites in these cells. Neither factor inhibited the 
growth of HER497R-expressing cells. Therefore, our find- 
ings raise the possibility that some of the differences between 
the actions of EGF and TGF-a may be due to the presence 
of HER497K or other variant EGFRs in these cells. In 
support of this hypothesis, TGF-a exerts a more potent 
growth inhibitory effect than EGF in RL95-2 human endo- 
metrial cancer cells (23), which express both HER497R and 
HER497K (8). 

Recently, several null alleles that result in loss of let'23 
function have been identified (22). Two of these alterations 
are located in subdomain IV and involve mutations of cys- 
teine to either tryptophan or tyrosine at codons 700 and 753, 
respectively (22). It is unlikely that these null mutants are 
analogous to the variant human EGFR described in the 
present study, inasmuch as HER497K kinase activity was 
intact and cell proliferation and nuclear protooncogene in- 
duction were regulated by this variant receptor in response to 
ligand binding. In addition, both HER497K and HER497R 
altered the flattened, nontransformed morphology of the 
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parental CHO cells. Taken together, our data suggest that the 
extracellular juxtamembranous region of human EGFR has 
the potential to modulate both ligand binding and transmem- 
brane signaling to the intracellular domain, which may allow 
the EGF family of Iigands to differentially regulate a variety 
of cellular processes. 

1. Czech, M. P. (1989) Cetf 59, 235-238. 

2. Schlessinger. J. & Ullrich, A. (1992) Neuron 9, 383-391. 

3. Thome, B. A. & Plowman, G. D. (1994) MoL Cell. BioL 14, 
1635-1646. 

4. Sboelson, S. E., White, M. F. & Kahn, C. R. (1988) J. Biol. 
Chem, 263, 4852-4860. 

5. Hsu, C. Y., Mohanunadi, M., Nathan, M., Honegger, A. M.. 
Ullrich, A., Schlessinger, J. & Huiwitz, D. R. (1990) CeU 
Growth mffer, 1, 191-200. 

6. Lax, I., Buigess, W. H., Bellot, F., Ullrich, A., Schlessinger, 
J. & Givol, D. (1988) MoL CeU. BioL 8, 1831-1834. 

7. Wu, D.. Wang, L., Chi, Y.. Sato, G. H. & Sato, J. D. (1990) 
Proc. NatL Acad. ScL USA 87, 3151-3155. 

8. Moriai, T., Kobrin, M. S. &, Korc, M. (1993) Biochem. Bio- 
phys. Res, Commun. 191, 1034-1039. 

9. Winkler. M. E., O'Connor, L., Winget, M. & Fendly. B. (1989) 
Biochemistry 28, 6373-6378. 

10. Chen, C. & Okayama, H. (1987) MoL CelL BioL 7, 2745-2752. 

11. KoTc, M. & Fuunan, J. E. (1989) /. BioL Chem, 264, 14990- 
14999. 

12. Munson, P. J. & Robard, D. (1980) Aiud. Biochem, 107, 
220-239. 

13. Raitano, A. B., Scuderi, P. & Korc, M. (1990) Pancreas 5, 
267-277. 

14. Korc, M., Chandrasekar, B. A Shah. G. N. (1991) Cancer Res, 
51,6243-6249. 

15. Chiigwin, J. M. , Przybyla, A. E. , MacDonald, R. T. A Rutter, 
W. J. (1979) Biochemistry 18, 5294-5299. 

16. Korc, M., Chandrasekar, B., Yamanaka, Y., Friess, H., 
Buchler, M. A Beger, H. G. (1992) J, Clin. Invest. 90, 1352- 
1360. 

17. Yarden, Y. A Schlessinger, J. (1987) Biochemistry 26, 1443- 
1451. 

18. Gotoh, N., Tojo, A., Muroya, K., Hashimoto. Y., Hattori, S., 
Nakamura, S., Takenawa, T., Yazaki, Y. A Shibuya, M. (1994) 
Proc. NatL Acad. ScL USA 91, 167-171. 

19. Avivi, A., Lax, 1. M., Ullrich, A., Schlessinger, J., Givol, D. 
& Morse, B. (1991) Oncogene 6, 673-676. 

20. Lax, I., Johnson, A., Howk, R., Sap, J., Bellot, F., Winkler, 
M., Ullrich, A., Vennstrom, B., Schlessinger, J. A Givol, D. 
(1988) Mol. CelL Biol, 8, 1970-1978. 

21. Fetch, L. A., Harris, J., Raymond, V. W., Blasband, A., Lee, 
D. C. & Earp. H. S. (1990) MoL CelL BioL 10, 2973-2982. 

22. Aroian, R. V. , Lessa, G. M. A Sternberg, P. W. (1994) EMBO 
J. 13, 360-366. 

23. Korc, M Haussler, C. A. A Trookman, N. S. (1987) Cancer 
Res. 47, 4909-4914. 

24. Gan. B. S., Hollenberg, M. D., MacCannell, K. L., Lederis, 
K., Winkler, M. E. A Derynck, R. (1987) J. PharmacoL Exp, 
Ther. 242, 331-337. 

25. Ullrich, A., Coussens, L., Hayflick, J. S., DuU, T. J., Gray A., 
Tam, A. W., Lee, J., Yarden, Y., Libennann, T. A., Schles- 
singer, J., Downward, J., Mayes, E. L. V., Whittle, N., Water- 
field, M. D. A Seebuig, P. H. (1984) Nature (London) 309, 
418-425. 



INHIBITION OF CYTOTOXIC T CELL DEVELOPMENT BY 
TRANSFORMING GROWTH FACTOR AND REVERSAL BY 
RECOMBINANT TUMOR NECROSIS FACTOR a 

By GERALD £. RANGES, IRENE S. FIGARI. TERJE ESPEVIK. and 

MICHAEL A. PALLADINO. jR. 

From the Department of Molecular Immunology, GenentecK Inc., 
South San Francisco, California 94080 

Transforming growth factor p (TGF-/3)/ although first defined for its ability 
to induce nonneoplastic cells to express a transformed phenotype, has now been 
shown to exert multiple actions on both normal and transformed cells (1). Recent 
studies have also defined a variety of immunoreg^latory properties of TGF-/8, 
including inhibition of T and B cell proliferation, IL-2-R induction, IL-1 -induced 
thymocyte proliferation, cytokine production, including IFN-y and TNF-a, 
natural killer cell activity, and class II antigen expression (2-6). The mechanism(s) 
through which TGF-/9 exerts these immunoregulatory effects is at present not 
known. 

TNF-a (also referred to as cachectin) has also been shown to express multi- 
functional immunomodulatory activities besides its direct cytotoxic/cytostatic 
effects on transformed cells (7-10). However, in contrast, to TGF-/8, TNF-a 
enhances IL-2-R expression, class II antigen expression, and IFN-7 production 
by activated lymphocytes (9, 1 0). 

The contrasting immunoregulatory activities of these two proteins prompted 
studies to further define their immunoregulatory activities in vitro. In this report, 
we describe the dose-dependent inhibition of CTL generation by TGF-/3 and the 
reversal of this inhibition by recombinant murine TNF-a (rMuTNF-a). In 
addition, we demonstrate that TNF-a is an important cytokine iiivolved in CTL 
development. 

Materials and Methods 

Animals. 6-12-wk-old female BALB/c and C57BL/6 (B6) mice were obtained from 
Charles River Breeding Laboratories (Wilmington, MA). 

Reagents. Porcine platelet-derived TGF-/3 (R and D Systems, Minneapolis, MN) was 
reconstituted in 4 mM HCl to 1 Mg/ml and stored at 4**C. rMuTNF-a (sp act 7x10' 
U/mg), as determined by a standard cytotoxic bioassay using L-M cells, contained <0*025 
pg of endotoxin per microgram protein by the limulus amoebocyte assay (11, 12). The 
specific activities of recombinant human TNF-a (rHuTNF-a) and -/8 provided by Gcnen- 
tech. Inc., as determined by the L-M bioassay were 5 X 10' and 2x10® U/mg protein, 
respectively (13, 14). 

' Abbreviations used in this paper: CM EM. complete minima) essential medium; NRS, normal 
rabbit serum; TGF-^, transforming growth factor 0. 
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Mixed Lymphocyte Cultures (MLC), CTLs were generated in 5-d MLC by incubation in 
24-weII tissue culture plates (3524; Ck>star. Cambridge. MA) of 5 X 10* or 5 X 10* B6 
responding spleen cells and 5X10^ BALB/c stimulator spleen cells (irradiated 2,000 rad) 
per well in 2 ml of complete minimal essential medium (CMEM) consisting of Eagle's 
minimum essential medium supplemented with 0. 1 mM nonessential amino acids, 2 mM 
L-glutamine^penidllin (100 U/ml), and streptomycin (100 Mg/nil) (Gibco, Grand Island, 
NY) and 10% heat-inactivated PCS (Hyclone Laboratories, Inc., Logan, UT). 

CTL Assay. After 5 d of culture, the cells were harvested, washed three times in 
CMEM and tested for cytotoxic activity in a 4-h *'Cr-releasc assay. P815 (DBA/2 
mastocytoma, H-2'*; American Type Culture Collection, Rockville, MD) or LBRM-33- 
1 A5B(B10.BR lymphoma. H-2*'; A. Zlotnik, DNAX Research Institute for Molecular and 
Cellular Biology, Piilo Alto. CA) were labeled with 150 fiCi Na^CrO* (5 mCi/ml; 
Amersham Corp., Arlington Heights, IL) for 45 min at 37^C, followed by three washes 
in CMEM. 100 id of target cells (10^ cells/ml) and 100 ftl of effector cells at various 
concentrations were added in triplicate in 96-well round-bottom microtiter plates (Costar). 
After 4 h of incubation at 37 ®C and S% CO2 in air, the suf>ematant5 were harvested 
(Skatron, Rockville, MD) and their radioactivity was determined in an automatic gamma 
counter (Micromedic Systems, Horsham, PA). Percent specific cytotoxicity was calculated 
as 100 X [cpm of test supematants of effector cells and target celb incubated together 
(ex]>erimenta] release)] — [cpm of supematants of target cells incubated alone (spontaneous 
release)]/f[cpm after lysis of target cells with 2% NP-40 (maximum release)] — [sponta- 
neous release]). Results given are the mean of triplicate cultures ± SE. Spontaneous 
release of target cells alone was <10% of maximum for all experiments. 

Detection ofMuTNF-a in Supematants of the MLC. The quantity of MuTNF-a produced 
in the MLC was determined using the WEHI-164 clone 13, which is capable of detecting 
levels as lowasT X 10"* U/ml (0.01 pg/ml) of MuTNF-a per milliliter (15). Using WEHI- 
164 clone 13 as indicator cells, the cytotoxicity induced by MuTNF-a was determined by 
an MTT tetrazolium colorimetric assay as detailed previously (15). The amount of 
MuTNF-a in test supematants was calculated on the basis of cytotoxicity obtained in the 
presence of various dilutions of a rMuTNF-a standard. Data are expressed as units of 
TNF-a per milliliter. 

The specificity of this assay for MuTNF-a was demonstrated by inhibiting the cytolytic 
activity of test supematants with anti-rMuTNF-a antiserum produced in New Zesdknd 
rabbits. This antiserum developed by methods similar to those described for production 
of anti-rHuTNF-a antiserum (16), had a neutralization titer of '^lO* U/ml in the L-M 
bioassay (1 2). Normal rabbit serum (NRS) was obtained from control New Zealand rabbits. 

Results 

Inhibition of CTL Generation by TGF-P, When added to MLC. TGY-fi inhibited 
the generation of B6-anti-H-2**-specific CTL in a dose-dependent fashion (Table 
I). The inhibitory effects were most pronounced when TGY-p was added during 
the first 24 h of the MLC (Fig. 1). Less inhibition was observed when the addition 
was delayed for 48 h, and only a minimal effect was seen if the delay was more 
than 72 h. Similarly, at the doses tested TGF-/J showed no inhibitory activity if 
included directly in the CTL assay. These data indicate that TGF-fi inhibits CTL 
generation in a dose-dependent manner, and its mechanism of action ap|)ears to 
involve early stages of the MLC. 

Inhibition of TNF-a Production by TGF-0 during MLC. Although a variety of 
in vitro immunoregulatory activities have been ascribed to TNF-a, studies 
examining the production of TNF-a during an MLC and the effects of altering 
endogenous TNF-a levels during CTL development have not been reported (8- 
1 0). Moreover, our previous studies have shown that TGF-^ can inhibit TNF-a 
production by murine macrophages (6). Therefore, we considered it important 
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Table 1 

Effect ofTGF-P on In Vitro CTL GeneratUm 



(ng/ml) 


Percent specific ^'Cr release (± S£) at^ E/T ratios of: 


50:1 


25:1 


12.5:1 


6.25:1 


None 


89 ± 2.0 


89 ± 4.0 


86 ± 0.4 


75 ± 1.0 


10 


18 ±2.0 


S± 1.0 


7 ± 2.0 


2 ±0.5 


l.O 


30 ± 0,4 


16 ± 2.0 


8 ± 1.0 


3 ± 1.0 


0.10 


82 ± 0,4 


79 ± 2.0 


82 ± 0.2 


63 ± 1.0 



♦ TGF-^ was added at the start of culture. 

* P815 (H-2**) target cells were used. Data arc from one of three repre- 
sentative experiments. Percent lysis against ^'Cr-labeled LBRM-SS- 
1 A5B %yas <10% at 50:1 £/T ratio. 




12 3 4 5 Control 

Day of TGF-p Addition 



Figure 1. Kinetics of TGF-/3 inhibition of CTL gencr* 
ation. 10 ng/ml TGF-i9 were added to MLC <m days 
shown. On day 5, this same concentration of TGFH9 was 
added to the CTL assay. Data presented are the mean ± 
S£ of four independent experiments performed at an 
£/T ratio of 25:1. Similar results were obtained at all 
E/T ratios tested (data not shown). 



T 1 r 




4 24 46 

CultUfB Hours 



FicintE 2. Inhibition of TNF<^ production dur- 
ing MLC by TGF-^. 300 mI were removed from 
replicate MLC at the times indicated, and assayed 
for TNF-a as described in Materials and Methods. 
MLC contained no TGF-^ 0.1 ng TGF*/9/ml 
or 1.0 ng TGF-/?/ml (O). The presence of 10 
TGf-fi in the WEHM64 MuTNF-a bioassay 
did not affect the measurement of MuTNF-a ac- 
tivity, while the presence of anti-rMuTNF-a anti- 
serum reduced the detection of MuTNF-a to back- 
ground levels. Statidard errors for all determina- 
tions were <10%. Results are presented as mean 
of triplicate determinations of one of four repre- 
sentative experiments. No significant amounts of 
MuTNF-a were produced by cultures containing 
B6 cells or irradiated BALB/c cells alone. 



to examine whether the inhibitory effects of TGF-/8 on CTL development could 
in part be due to the inhibition of TNF-a production. The concentration of 
MuTNF-a was measured at various times during the first 48 h of MLC in the 
absence or presence of LO or 0.1 ng/ml TGF-/3. The data (Fig. 2) indicate that 
as early as 4 h after culture initiation, -^7 X 10"* U/ml (0.1 pg/ml) of TNF-a 
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Table II 

Reversal ofTGF-& Inhibition ofCTL Generation by TNF-a 



Percent specific "Cr release (± SE) at* 
Treaiinenf '^"o* 





25:1 


12.5:1 


6.25:1 


Control 


83 ± 3.0 


80 ± 4.0 


69 ± 4.0 


10ng/fnlTGF-/9 
10* U/m» rMuTNF-a 


16 ± 1.0 


12 ± 3.0 


7 ±2.0 


83 ± 2.0 


85 ± 4.0 


78 ± 1.0 


10 ng/ml TCF-^ + rMuTNF-a at 
I0» U/ml 








54 ± 7.0 


35 ± 1.0 


21 ±0.6 


10* U/ml 


62 ± 3.0 


48 ± 1.0 


25 ± 2.0 


I0» U/ml 


49 ± 3.0 


40 ± 3.0 


18 ± 1.0 


10» U/ml 
10' U/ml 


28 ± 1.0 


20 ± 3.0 


1 1 ± 2.0 


16 ±2.0 


8± 0.4 


5 ± 1.0 



* TGF-^ and/or rMuTNF-a were added to MLC on day 0 at concentrations indicated. 

* Data are from one of three representative experiments. 




rMuTNF-a(IOUAnr) - ♦ 

Stimutatof:Res p onder 0.1 :1 
Cet Ratio 



FictntE 3. EfTects of rMuTNF-a on CTL generation 
during suboptimal culture conditions. MLC were estab- 
lished at stimulator/responder ratios of 1 : 1 (Q) and 0.1:1 
(p). 10 U/ml rMuTNF-a %vas added on day 0 to MLC at 
a 0.1:1 ratio and at a 1:1 ratio Results are mean 
± SE of triplicate determinations performed at an E/T 
cell ratio of 12.5:1. Similar results were obtained at all 
E/T cell ratios tested (data not shown). 



1:1 




12.5:1 



6.25:1 3.13:1 

Effector:Target Cell Ratio 



1.56:1 



Figure 4. Inhibition of CTL generation by 
antibodies to rMuTNF-a. Rabbit serum poly- 
clonal antibodies to rMuTNF-a was added at a 
1:100 final dilution on day 0 of MLC. MLC 
contained: no antibodies (Q); polyclonal anti- 
bodies to rMuTNFnx (A); NRS (O). Results are 
mean ± SE of triplicate determinations. CTL 
were washed three times in CMEM to prevent 
carryover of antibodies to rMuTNF-a into the 
^'Cr assay. 
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can be detected and that TGF-^ suppresses the production of TNF-a in a dose^ 
dependent manner. 

To determine if the suppression of MuTNF-a production was critical to the 
inhibitory activity of TGF-^ on CTL generation, exogenous rMuTNF-a was 
added to MLC in the absence and presence of TGF-/8 (Table II). The addition 
of 10* U/ml of rMuTNF-a to TGF-j9-suppressed MLC significantly restored the 
CTL activity. These results are not due to any direct activity of rMuTNF-a on 
the P8 1 5 target cells since the viability of these cells was not affected by rMuTNF- 
a doses as high as 10^ U/ml. When rHuTNF-cr was substituted for rMuTNF-a, 
reversal of TGF-iS suppression was detected only at concentrations of 10* U/ml 
rHuTNF-a, and no activity was observed when rHuTNF-/S was used at this same 
concentration (data not shown). These results may indicate a species preference 
with regard to action of TNF-a in this context » as suggested in previous studies 

(11). 

Enhancement of CTL Development by rMuTNF-a, Under the optimal conditions 
used in developing the CTLs» the addition of rMuTNF-a at the doses tested had 
only minimal enhancing activity. However, at a suboptimal responder/stimulator 
ratio (0.1:1), rMuTNF-a significantly enhanced both the proliferative response 
(dau not shown) and CTL generation to H-2'' targets (Fig. 3). 

InhUndon of CTL Generation by Antibodies to rMuTNF-ix. As our earlier studies 
demonstrated that rMuTNF-a can significantly enhance CTL development if 
added to MLC established at suboptimal stimulator to res]X>nder ratios, we 
considered it important to investigate whether antibodies to rMuTNF-a could 
inhibit CTL generation. As shown in Fig. 4, the addition of rabbit polyclonal 
antibodies to rMuTNF*a, but not addition of NRS to MLC on day 0 significantly 
inhibited CTL generation, further supporting our data of the critical role of 
TNF-a during CTL development. 

Discussion 

We have investigated the effects of TGF-|9 and rMuTNF-a on CTL generation 
and function. Our investigations have indicated that: (o) TGF-/3 will inhibit, in a 
dose-dependent manner, CTL generation, but only when TGF-/9 is added in the 
early stages of the MLC; {b) TGF-/3 does not inhibit the cytotoxic activity of 
CTL; (c) TNF-a production, which can be detected as early as 4 h after the 
initiation of an MLC, is inhibited by TGF-/5; {d) addition of rMuTNF-a to TGF- 
/3-inhibited MLC significantly reverses the inhibitory activity of TGF-/3; and (e) 
antibodies to rMuTNF-a significantly inhibit CTL development. These results 
show that TGF-/3 does not inhibit CTL generation by nonspecific cytostatic or 
cytotoxic processes, since the addition of TGF-/9 to MLC for the final 72 h does 
not affect the CTL activity. Rather, TGF-/3 appears to inhibit a differentiation 
step in the early development of CTL. In addition, similar results have been 
obtained with rHuTGF-/3 (the polypeptide sequence of mature human and 
jK>rcine TGF-/9 is identical) indicating that the effects observed with the natural 
p>orcine preparation are mediated by TGF-0 alone, and not a contaminant (data 
not shown) (17). 

Previous studies have indicated that CTL generation involves the production 
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of two or more lymphokines, including IL-2 (18-21). TGF-fi has been shown to 
affect T cell activation by downregulating IL-2-R expression (2) and thereby 
inhibiting T cell proliferation. Scheurich et al. (10) have reported that rHuTNF- 
a can enhance the response to IL-2 by upregulating the expression of IL-2-R. 
Our data indicate that TNF-a plays an important role in the generation of CTL, 
and that TGF-fi may inhibit the development of such cells, at least in part, by 
inhibiting TNF-a production. Our data also demonstrate that, while the reversal 
of TGF-/S suppression is significant, it is not complete, regardless of the rMuTNF- 
a dose used. This may point to multiple effects of TGF-^ on CTL generation, 
only some of which are reversible by TNF-of, and which require the presence of 
additional cytokines. In recent experiments, we have observed that IL-2, which 
induces TNF-a production, can also reverse TGF-/? inhibition of CTL generation 
(our unpublished observations) (22). Thus, TGF-fi may inhibit CTL generation 
directly by downregulating IL-2-R expression, or indirectly .by preventing up- 
regulation of this receptor by blocking TNF-a production. Studies are now 
under way to further investigate the role of TNF-a in CTL generation, and the 
suppression of that process by TGF-fi, 

Summary 

The immunoregulatory effects of transforming growth factor (TGF-/9) and 
recombinant murine tumor necrosis factor a (rMuTNF-o) on CTL generation 
and activity were examined. The results demonstrate that TGF-/3, in a dose- 
dependent manner, inhibited CTL generation but not CTL activity. The inhib- 
itory effects were detected only when TGF-fi was added within the first 48 h of 
the MLC. Little activity was seen when it was added thereafter, including the 
addition of TGF-fi to the cytotoxicity assay. The production of TNF-a, which 
occurs during early phases of the MLC and which is inhibited in the presence of 
TGF-/8, appears to have an important regulatory role, as altering the levels of 
TNF-a in an MLC can significantly influence CTL development. The inhibitory 
effects of TGF'0 on the MLC can be significantly reversed by the addition of 
rMuTNF-a to the cultures. These results demonstrate that TGF-fi can inhibit 
MLC and subsequent CTL generation at early stages of the reaction, and such 
inhibition may involve the suppression of TNF-a production. 

We thank Ms. Stephanie Shipley for excellent technical assistance, Mr. Chris Nelson for 
preparing the rabbit anti-rMuTNF-a antisera, and Dr. £. Rinderknecht (Genentech, Inc.) 
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P00761 (TRYP_PIG) Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 100. 



Names and origin 



Protein names 



Recommended name: 
Trypsin 
EC=3.4.21.4 



± 



Organism 

Taxonomic identifier 



Sus scrofa (Pig) 



— 



9823 [NCBI] 



Taxonomic lineage 



Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Cetartiodactyla > Suina > Suidae > Sus 



Protein attributes 



Sequence length 
Sequence status 
Sequence processing 

Protein existence 



! 231 AA. 

1 

i Complete. 



; The displayed sequence is further processed into a mature 
i form. 



I Evidence at protein level. 



General annotation (Comments) 



Catalytic activity 
Cofactor 

Subcellular location 



Preferential cleavage: Arg-|-Xaa, Lys-j-Xaa. 
Binds 1 calcium ion per subunit. 
Secreted > extracellular space. 



Sequence similarities 



i Belongs to the peptidase S1 family. 
; Contains 1 peptidase SI domain. 



Ontologies 



Keywords 

' Biological process 



Cellular component 
Ligand 



Digestion 



Secreted 



Calcium 
Metal-binding 



Molecular function 



j Hydrolase 

! Protease 

i Serine protease 



PTM 



Technical term 



j Disulfide bond 
j Zymogen 

i 3D-structure 
Direct protein sequencing 
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Gene Ontology (GO) 

Biological process 



j digestion 

I Inferred from electronic annotation. Source: UniProtKB-KW 





proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


extracellular space 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key i Position(s) Length Description I Graphical view 



Molecule processing 



Feature identifier 





; Propeptide 


i 1-8 

\ 

i 

i 
i 


8 


Activation 
peptide 


1 1 

i 




1 Chain 


f 9-231 

r 


223 


Trypsin 


1 1 


Regions 










m 


Domain 


I 9-229 


221 


Peptidase 
SI 


; 1 


Sites 










□ 


Active site 


48 


1 


Charge 

relay 

system 


1 


□ 


Active site 


92 


1 


Charge 

relay 

system 


1 


□ 


Active site 


185 


1 


Charge 
relay 

system 






Metal binding 


60 


1 


Calcium 


— 

1 




Metal binding 


62 


1 




Calcium; 
via 

carbonyl 
oxygen 


1 


□ 


Metal binding 

• 

1 


65 

_ J 


1 

.J 


Calcium; 
via 

carbonyl 
oxygen 


1 



PRO 0000028205 



PRO 0000028206 
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Amino acid modifications 



Secondary structure 
1 

I Helix I Strand | Turn 

Details... 

Sequences 



LJ 


Metal oinding 




1 


oaicium 


1 
1 


□ 


Site 


179 


1 


Required 
for 

specificity 

} By similarity | 





□ 


Disulfide bond 


1 5 ^— > 


145 






1 


□ 


Disulfide bond 


33 <^ 


^49 






11 


□ 


Disulfide bond 


117 ^ 


218 








□ 


Disulfide bond 


124^ 


191 








□ 


Disulfide bond 


156*-^ 


170 








□ 


Disulfide bond 


181 ^ 


205 








Natural variations 








□ 


Natural variant 




20 


1 


1 — V 


1 



II 



i I 



* « * * • 



23-^ 



Sequence 



Length iVIass (Da) 



P00761 [UniParc]. 

Last modified July 21, 1986. Version 1. 
Checksum: A0A125CF7FC138C2 



FASTA 



231 



24.409 
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r 

« Hide 
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3D structure databases 
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Proteomic databases 
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Genome annotation databases 










EnsembI 


ENSSSCT0000001 7947; ENSSSCP0000001 7465; 




ENSSSCG00000016483. 






Phylogenomic databases 










GeneTree 


ENSGT00580000081 320. 






HOVERGEN 


HBG013304. 
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(jrinouD 


t\Jo*fOJOr V. 


Enzyme and pathway databases 




BRENDA 


3.4.21.4. 249. 


Family and domain databases 




InterPro 


i IPR009003. Pept cys/ser Trypsin-like. 
IPR018114. Peptidase S1/S6 AS. 
IPR001254. Peptidase_S1_S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfeim 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp^SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240 TRYPSIN DOM 1 hit 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


TRYP_PIG 


Accession 

1 


Primary (citable) accession number: P00761 


Entry history 


Integrated into July 21 , 1 986 

UniProtKB/Swiss- 

Prot. 

Last sequence July 21 , 1 986 
update: 

Last modified: April 5, 201 1 

This is version 100 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Relevant documents 




PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 


Peptidase families 

Classification of peptidase femilies and list of entries 


SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 
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P07477 (TRY1_HUMAN)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5. 201 1 . Version 1 33. 



Names and origin 



rToiein names 

• 

i 


1 

rXGCornrnenuBu name. 

Trypsin-1 
EC=3.4.21.4 
Alternative name(s): 

Beta-trvDsin 

Cationic trypsinogen 
Serine protease 1 
Trypsin 1 

Cleaved Into the followina 2 chains: 

1. Alpha-trypsin chain 1 

2. Alpha-trypsin chain 2 




Name: PRSS1 
Synonyms:TRP1, TRY1. TRYP1 


" ' " ( 

Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier | 


9606 [NCBI] 





Taxonomic lineage | Eukaryota > Metazoa > Chordata > Craniate > Vertebrata > 

I Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
1 Primates > Haplorrhini > Catarrhini > Hominidae > Homo 



Protein attributes 



Sequence length 


1 247 AA. 


Sequence status 


j Complete. 


Sequence processing 


j The displayed sequence is further processed into a mature 




i form. 

H Ml 1 * 


Protein existence 


1 Evidence at protein level. 

> 


General annotation (Comments) 



Function 


Has activity against the synthetic substrates Boc-Phe-Ser-Arg- 
Mec, Boc-Leu-Thr-Arg-Mec, Boc-Gln-Ala-Arg-Mec and Boc-Val 
-Pro-Arg-Mec. The single-chain form is more active than the 
two-chain form against all of these substrates. 


Catalytic activity 


Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa. 


Cofactor 


Binds 1 calcium ion per subunit. 


Subcellular location 


Secreted > extracellular space. 


Post-translational modification 

. 

_ .. ._ 1 


Occurs in a single-chain form and a two-chain form, produced 
by proteolytic cleavage after Arg-1 22. 


Involvement in disease 

j 


Defects in PRSS1 are a cause of hereditary pancreatitis (HPC) 
[MIM:1 67800]; also known as chronic pancreatitis (CP). HPC is 
an autosomal dominant disease characterized by the presence 
of calculi in pancreatic ducts. It causes severe abdominal pain 
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Sequence similarities 



Caution 



attacks. I Ref.e 1 1 Ref.ll I { Ref.12 1 1 Ref.16 1 1 Ref.17 1 1 Ref.16 1 1 Ref.19 | 
I Ref.20 II Rel21 H Ref.22 H Ref.23l 



Belongs to the peptidase S1 family. 
Contains 1 peptidase S1 domain. 



Ty r-1 54 was propo sed to be phosphory lated (LiiOE]) but it has 
been shown (I Ref.i4T) to be sulfated instead. Phosphate and 
sulfate groups are similar in mass and size, and this can lead to 
erroneous interpretation of the results. 



Mass spectrometry 



Molecular mass is 2 4348±2 Da from positions 24 - 247. 
Determined by ESI. i Reus I 
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Ontologies 


Keywords 




Biological process 


Digestion 


Cellular component 


Secreted 


Coding sequence diversity 


Polymorphism 


Disease 


Disease mutation 


Domain 


Signal 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Sulfation 

Zymogen 


Technical term 


3D-structure 
Complete proteome 
Direct protein sequencing 


Gene Ontology (GO) 






Inferred from electronic annotation. Source: UniProtKB-KW 
proteolysis 

Inferred from electronic annotation. Source; InterPro 


Cellular component 


extracellular space 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 
protein binding 

Inferred from physical interaction. Source: IntAct 

serine-type endopeptidase activity 

Traceable author statement 1 ^^^-^^ 1. Source: UniProtKB 


Complete GO annotation... 




Binary interactions 





With Entry #Exp. 

i I 

GRB2 * P62993 I 1 



IntAct 

EBI-1222902.EBI-401755 



Notes 



Sequence annotation (Features) 



Feature key 



Position(s) 



Length 



Description 



Graphical view 



i Feat ij re 
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Molecule processing 

^ 1 Signal peptide 


1 -15 


15 


Ref.9 Ref.10 

' 1 


1 








Propeptide 


16-23 


8 


Activation peptide 


II 


PRO. 






Chain 


24 — 247 


224 
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1 


PRO 


01 


liil 


Chain 
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99 


Alpha-trypsin chain 
1 


1 1 


PRO. 


-01 




Chain 
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2 


i 


PRO, 



Regions 



Site! 
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s 

Active site 
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221 


Peptidase 81 

Charge relay system 


1 1 
1 i 

. 








Active site 
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Charge relay system 


1 1 
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□ 


Active site 
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1 
1 


Charge relay system 
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□ 


Metal binding 


75 


1 


Calcium 


1 ! 




□ ^ 


l^etal binding 


77 


1 


Calcium; via 
carbonvl oxvaen 


1 ! ' 

i 

i 




Metal binding 


80 




1 


Calcium; via 
carbonyl oxygen 


1 1 

i 




H 


Metal binding 


85 




Calcium 


1 ' 






jjMIM 

Ami 

H : 


Site 

no acid modificatio 
Modified residue 


194 

ns 

154 


1 


Required for 
specificity 

1 By similarity | 

Sulfotyrosine LB^^^^ 1 


1 






B i Disulfide bond 


30^ 160 1 




1 i 


■ 




iLi 


Disulfide bond 


48^64 ! 




II 






Disulfide bond 1 139^206 






1 1 


i 




□ 


Disulfide bond | 171 ^ 185 






11 


( 






Disulfide bond | 196^220 






1 1 







Natl 

□ 


iral variations 
Natural variant 


16 


1 


A V In HPC; 
disrupts signal 
sequence cleavage 

fiitp 1 Ref.l8 1 




VAR. 


0' 




Natural variant 


22 


1 


D G in HPC; 
Increased rate of 

artix/fltion 1 Ref.8 I 


1 


VAR. 


P' 
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□ 


Natural variant 


23 


1 


K R in HPC; 
increased rate of 
activation. 1 ^^^-^^ 1 


1 

1 


j VAR 

1 
f 

! 

i 

i „. 


0' 




Natural variant 


29 


1 


\ N lin HPC. 

Ref.12 II Ref.16 | 
! Ref.17 II Ref,23 


1 1 VAR 

I i 

i 

1 i 


□ 


Natural variant 


29 


1 


N T in HPC. 

1 Ref.21 1 


1 1 

r 

1 


i VAR. 

J 

i 
i 


0' 




Natural variant 


54 


1 


1 N S in HPC; 
associated with tle- 
29; the double 
mutant shows 
increased 
autocatalytic 
activation which is 
solely due to the lle- 
29 mutation, 1 23 j 


1 VAR. 

\ 
( 
I 

t 

I 
f 

\ 

s 
i 

i 

1 
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o: 


□ 


Natural variant 

■ 
■ 


79 

: 

■ 
■ 
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1 

■ 

i 

: 

: 
, 
. 


E K in HPC; Lys- 
79 trypsin activates 
anionic trypsinogen 
PRSS2 2-fold while 
the common 
pancreatitis- 
associated mutants 
His-122 orlle-29 
have no such effect. 

[dbSNP:rs28934902] 

j Ref.22 j 


1 1 VAR 
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t 

i 
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Natural variant 
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1 j VAR. 
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i 
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-0' 




Natural variant 


116 ; 


1 


R C in HPC. 

Ref.12 1 


1 


VAR 




Natural variant 
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1 


R C in HPC; 
suppresses an 

autocleavage site. 

1 Ref.21 1 


1 


VAR. 


0' 




Natural variant 

1 


122 


1 i 

J 


R ^ H in HPC; 
suppresses an 
autocleavage site 
which is probably 
part of a fail-safe 
mechanism by which 
trypsin, which is 
activated within the 
pancreas, may be 
inactivated; loss of j 
this cleavage site j 
would permit i 
autodigestion ' 
resulting in | 

panrrMtitis 1 Ref.11 [ i 

i Ref.12 1 Ref.16 1 
1 Ref.18 Ret. 20 ! 


1 

( 

« 

1 
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□ 


Natural variant 


137 


1 


T — > M in a 
colorectal cancer 
sample; somatic 

mutatinn 1 Ref 24 | 


1 


VAR 

1 
1 




□ 


Natural variant 


139 


1 


C F in HPC. 

1 Ref.12 1 


1 


VAR 


.0- 


Experimental info 














□ 


Mutagenesis 


154 


1 


Y -> F: Lack of 

fiulfetion 1 Ref.14 | 
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Sequence databases 



@EMBL 
^ GenBank 
® DDBJ 



IPI 



PIR 



RefSeq 



M27602 mRNA. Translation: AAA61 232.1. 



4 ._ 



IPI00815665. 



; B25852. 



NP 002761.1. NM 002770.2. 



UniGene 



Hs.449281 . 



http://www.uniprot.org/uniprot/P07478 
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3D structure databases 




ProteinModelPortal 


P07478. 


SMR 


P07478. Positions 24-247. 


Mod Base 


Search... 


Protein-protein interaction databases 


STRING 


P07478. 


Protein family/group databases 




MEROPS 


S01.258. 


PTM databases 




PhosphoSite 


P07478. 


Proteomic databases 




PRIDE 


P07478. 


Genome annotation databases 


• 


: 

EnsembI 

■ 


ENST00000438955; ENSP00000414534; ENSG00000204983. 


GenelD 1 5645. 

i 


KEGG j 


hsa:5645. 



UCSC uc003wap.1. human. 



Organism-specific databases 

CTD I 5645. 

1 

GeneCards j GC07P1 42298. 

,__ * 

H-lnvDB ^ I HtX01 75861. . 

HGNC j HGNC:9483. PRSS2. 

HPA i CAB025487. 

i CAB025538. 



MIM 


601564. gene. 


neXtProt 


NX_P07478. 


Orphanet 


676. Hereditary chronic pancreatitis. 


PharmGKB 


PA33833. 


GenAtlas 


Search... 


Phylogenomic databases 




eggNOG 


prNOG14047. 


HOVERGEN 


HBG013304. 



InParanoid P07478. 



http://www.uniprot.org/uniprot/P07478 
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Enzyme and pathway databases 



BRENDA 

Gene expression databases 

ArrayExpress 


3.4.21.4. 247. 
P07478. 


Bgee 


P07478. 


CteanEx 


HS_PRSS2. 


Genevestigator 


P07478. 


GermOnline 

Family and domain databases 

interrro 

* 


ENSG00000204982. Homo sapiens. 

irKUuyuuo. Kept cys/ser i rypsin-iiKe. 
IPR018114. Peptidase S1/S6 AS. 
IPR001254. Peptidase S1 S6. 
IPR001314. Peptidase_S1A. 

[oraphicai viewj 

1^ J 1 ^ 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit, 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 

Other Resources 

NextBio 
SOURCE 


Search... 

21930. 
Search... 



Entry information 



Entry name 


TRY2_HUMAN 


Accession 


Primary (citable) accession number: P07478 


Entry history 


Integrated into April 1. 1988 

UniProtKB/SwIss- 

Prot: 

Last sequence April 1, 1988 
update: 

Last modified: • April 5, 201 1 

This is version 1 16 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Disclaimer 



j Any medical or genetic information present in this entry is 

' provided for research, educational and informational purposes 



http://w\vw.uniprot.org/uniprot/P07478 
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I only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatnnent or care. 



Relevant documents 



Human chromosome 7 

Human chromosome 7: entries, gene names and cross-references to MIM 

Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 

Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 

MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 
Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 
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P35030 (TRY3_HUMAN)* Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 1 14. 



Names and origin 



Protein names 



Recommended name: 
Trypsin-3 
EC=3.4.21.4 

Alternative name(s): 
Brain trypsinogen 
Mesotrypsinogen 
Serine protease 3 
Serine protease 4 
Trypsin HI 
Trypsin IV 



Gene names 



Organism 



Name: PRSS3 
Synonyms:PRSS4. TRY3, TRY4 

Homo sapiens (Human) [Complete proteome] 



Taxonomic identifier 



9606 [NCBI] 



Taxonomic lineage 



Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 



Protein attributes 



Sequence length 


1 304 AA. 


Sequence status 


1 Complete. 



Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Digestive protease specialized for the degradation of trypsin 

inhibitors Ref.5 | 


Catalytic activity 


■ ■ ■ 

Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa. 


Cofactor 


Binds 1 calcium ion per subunit. 



Subcellular location 
Tissue specificity 



-I- 



Secreted. 



Pancreas and brain. 



Sequence similarities 



Belongs to the peptidase SI family. 
Contains 1 peptidase SI domain. 



Ontologies 



Keywords 



http://www.uniprot.org/uniprot/P35030 
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Biological process 
Cellular component 



I 



Digestion 
Secreted 



Coding sequence diversity 



I Alternative splicing 





Polymorphism 


Domain 


Signal 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 

Serine protease 


PTM 


Disulfide bond 

Sulfation 

Zymogen 


Technical term 


3D-structure 
Complete proteome 


Gene Ontology (GO) 




Dioiogicai process 


digestion 

Traceable author statement -B^iiJ. Source: UniProtKB 
endothelial cell migration 

Inferred from mutant phenotype. Source: UniProtKB 
zymogen activation 

Inferred from direct assay. Source: UniProtKB 


Cellular component 


extracellular space 

Inferred from direct assay. Source: UniProtKB 


Molecular function 


calcium ion binding 

Inferred from direct assay. Source: UniProtKB 
protein binding 

Inferred from physical interaction L5§ti.. Source: UniProtKB 

serine-type endopeptidase activity 

Inferred from direct assay l^^LLi. Source: UniProtKB 

serine-type peptidase activity 

Inferred from direct assay. Source: UniProtKB 



Complete GO annotation. 



Alternative products 



This entry describes 3 isoforms produced by alternative splicing. [Align] [Select] 
Isof orm A (identifier: P35030-1 ) 

This fsoform has been chosen as the 'canonical' sequence. All positional infonnation in this entry 
refers to it. This is also the sequence that appears in the downloadable versions of the entry. 

Isoform B (identifier: P35030-2) 



http://www.uniprot.org/uniprot/P35030 
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The sequence of this isoform differs from the canonical sequence as follows: 
1 -45: MCGPDDRCPARWPGPGRAVKCGKGU\AARPGRVERGGAQRGGAGL 



M 



Isoform C (identifier: P35030-3) 

The sequence of this isoform differs from the canonical sequence as follows: 
1-70: MCGPDDRCPA...DADGGEALGT MNPFLILAFVGAA 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) 



Length i Description 



Graphical view 



Featu 



□ 


Signal peptide 


1 -? 




1 Potential | 








□ 


Propeptide 


?-80 




Activation peptide 





PR 


3. 




Chain 


81 - 304 


224 


Trypsin-3 


1 


PR 


3. 



Regions 

Domain 



81 - 301 



221 Peptidase 81 



Sites 



Active site 



120 



Active site 



Active site 



Metal binding 



164 



257 



132 



Charge relay 
system 



Charge relay 
system 



Charge relay 
system 



Calcium 



Metal binding 



134 



Calcium; via 
carbonyl oxygen 



Metal binding 



137 



Calcium; via 
carbonyl oxygen 



Metal binding 



142 



Calcium 



Site 



251 



Required for 

specificity 

1 By similarity | 



Amino acid modifications 
Modified residue 



211 



1 I Sulfotyrosln e 

' I By similarity j 



Disulfide bond 



87*-^ 217 



□ 



Disulfide bond 



□ 



Disulfide bond 



□ 



Disulfide bond 



105 
196 



121 
263 



J 



II 



I I 



228 ^ 242 



II 



http ://www. un iprot.org/uniprot/P3 5030 
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Disulfide bond 



Natural variations 



253 ^> 277 I 



I I 



Afternatlve sequence 



Alternative sequence 



Natural variant 



Natural variant 



1-70 



1 -45 



188 



224 



70 MCGPD...EALGT 



MNPFLILAFVGAA 
in isoform C. 



45 



1 



MCGPD...GGAGL 
— ► M in isoform B. 



A. 

[dbSNP:rs855581] 

1 Ref"l irRef.2 || Ref.3 | 



T — * S. 

[dbSNP:rs1 063273] 



VSP 



VSP 



VAR 



VAR 



□ 



Natural variant 



232 



Y — C. 

[dbSN P.TSi 048379] 



VAR 



Experimental info 

D ! Sequence conflict 



89 




□ 



Sequence conflict 



224 - 225 



Sequence conflict 



253 - 254 



1 



Missing in 
CAA50484. 1 Ref.i 



TQ "> RE in 

CAA33527. i 



CQ WK in 



CAA33527. 1 Re^-^ 



Secondary structure 



I Helix I Strand | Turn 



Details. 



Sequences 



Sequence 



Length Mass (Da) 



Isoform A [UniParc]. 

Last modified October 14, 2008. Version 2. 
Checksum: 4C4303C310B7BFFC 



FASTA 



304 



32.529 



r 



10 20 30 40 50 60 

MCGPDDRCPA RWPGPGRAVK CGKGLAAARP GRVERGGAQR GGAGLELHPL LGGRTWRAAR 

70 eO 90 100 110 120 

DADGCEALGT VAVPFDDDDK IVGGYTCEEN SLPYQVSLNS GSHFCGGSLI SEQWWSAAH 

130 140 150 ISO 170 180 

CYKTRIQVRIi GEHNIKVLEG NEQFINAAKI IRHPKYNRDT LDNDIMLIKL SSPAVINARV 

190 200 210 220 230 24£ 

STISLPTTPP AAGTECLISG WGNTLSFGAD YPDELKCLDA PVLTQAECKA SYPGKITNSM 



250 



260 



270 



280 



290 



300 
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FCVGFLBGGK DSCQRDSGGP WCttGQUQGV VSWGHGCAWK NRPGVYTKVY NYVDWIKDTI 



; AANS 



« Hide 



Isoform B. 

Checksum: CD8AA6E8072BCE56 
Show » 



FASTA 



260 



28.161 



Isoform C. 

Checksum: 01563666780A6607 
Show » 



FASTA 



247 



26.727 



References 



« Hide large scale' references 



[1] I "Cloning of the cDNA encoding human brain trypslnogen and characterization of Its 
I product." 

I Wiegand U., Corbach S., Minn A., Kang J., Mueller-Hil! B. 
' Gene 136:167-175(1993) [PubMed: 8294000] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS A AND B), VARIANT ALA-188. 
Tissue: Brain. 



[2] j "Nucleotide sequence of the human pancreatic trypsinogen ill cDNA." 

! Tani T., Kawashima I., Mita K., Takiguchi Y. 

i Nucleic Acids Res. 18:1631-1631(1990) [PubMed: 2326201] [Abstract] 

! Cited for : NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM C). VARIANTS ALA-188 AND CYS- 
' 232. 
Tissue : Pancreas. 

LJ-LJ-.l.l-rj...;.| L II 11 III 

[3] I Fukuoka S. 

Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases 
Cited for : NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM C), VARIANT ALA-188. 



1 



[4] 



"DNA sequence and analysis of human chromosome 9." 

Humphrey S. J., Oliver K., Hunt A.R., Plumb R.W., Loveland J.E., Howe K.L., Andrews T.D.. 

Searle S., Hunt S.E., Scott C.E,. Jones M.C., Ainscough R., Almeida J. P., Ambrose K.D.. 

Ashwell R.I.S.. Babbage A.K.. Babbage S., Bagguley C.L. lH Dunham I. 

Nature 429:369-374(2004) [PubMed: 15164053] [Abstract] 

Cited for : NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. 



[5] 



"Human mesotrypsin is a unique digestive protease specialized for the degradation of 
trypsin inhibitors." 

Szmola R.. Kukor Z.. Sahin-Toth M. 

J. Biol. Chem. 278:48580-48589(2003) [PubMed: 14507909] [Abstract] 
Cited for: FUNCTION. 



[6] 



"Crystal structure reveals basis for the inhibitor resistance of human brain trypsin. 

Katona G,, Berglund G.I., Hajdu J., Graf L.. Szilagyi L. 

J. Mol. Biol. 315:1209-1218(2002) [PubMed: 11827488] [Abstract] 

Cited for : X-RAY CRYSTALLOGRAPHY (1 .7 ANGSTROMS) (ISOFORM A). 



ft 



t Additional computationally mapped references. 
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Cross-references 


Sequence databases 




@EMBL 
® GenBank 
® DDBJ 


X72781 mRNA. Translation: CAB58178.1. 

X71345 mRNA. Translation: CAA50484.1. 

X15505 mRNA. Translation: CAA33527.1. 

D45417 mRNA. Translation: BAA08257.1. 

AL356489, AL139113, AL358573 Genomic DMA. Translation: 

CAH69873.1. 

AL358573, AL1391 13 Genomic DNA. Translation: CAI39514.1. 
AL358573. AL139113, AL356489 Genomic DNA. Translation: 
CAI3951 5 1 

AL139113. AL358573 Genomic DNA. Translation: CAI39655.1. 
AL139113, AL366489. AL358573 Genomic DNA. Translation: 
CAI39658.1, 


IPI 


IPI00015614. 
IPI00220839. 
1 IPI00843764. 


FIR 


812764. 
S33496. 


RefSeq 


NP_002762.2. NM_002771.3. 


UniGene 


Hs.664513. 


3D structure databases 




® RCSB PDB 
W PDBj 


entry Method Resolution Chain Positions PDosum 

(A) 

in4W A-ray 1.7U A o1-oU4 i»j 
2R9P x-ray 1.40 A/B/C/D 81-304 [»] 
3L33 X-ray 2.48 A/B/C/D 81-304 [»] 


ProteinModelPortal 


P35030. 


SMR 


P35030. Positions 81-304. 


ModBase 


Search... 


Protein-protein interaction databases 


STRING 


P35030. 


Protein family/group databases 




MEROPS 


SOI. 174. 


Proteomic databases 




PRIDE 


P35030. 


Genome annotation databases 




Ensembl 


ENST00000361005; ENSP00000354280; ENSG00000010438. 


GenelD 


5646. 


KEGG 


hsa:5646. 


UCSC 


uc003ztj.2. human. 



http://www.uniprot.org/uniprot/P35030 
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Organism-specific databases 



CTD 


5646. 


GeneCards 


GC09P033708. 




HGNC 


HGNC:9486. PRSS3. 


MIM 


613578. gene. 


neXtProt 


NX_P35030. 


GenAtlas 


Search... 




pnyiogenomic aataDases 






eggNOG 


maNOG08762. 




HOGENOM 


i HBG755338. 


HOVERGEN 

• 


1 HBG013304. 


mnaranoio 


P35030. 




OrthoDB 


EOG4SJ5FV. 


PhylomeDB 


P35030. 




cii£yiTie aiivi painway uaiaDases 






BRENDA 


3.4.21.4. 247. 




Gene expression databases 






ArrayExpress 


P35030. 




Bgee 


P35030. 


CleanEx 


HS_PRSS3. 


Genevestigator 


P35030. 


GemiOnline 


ENSG00000010438. Homo sapiens. 




Family and domain databases 






InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase 81 S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 




Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp^SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN HIS. 1 hit, 
PS00135. TRYPSIN_SER. False negative. 
[Graphical view] 


ProtoNet 


Search... 



http://www.uniprot.org/uniprot/P35030 
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Other Resources 




NextBio 


j 21934. 


SOURCE 


1 Search... 



Entry information 


Entry name 


f 

i TRY3JHUMAN 


Accession 


< 

\ Primary (citable) accession number: P35030 

1 Secondary accession number(s): A9Z1 Y4 l^i Q9UQV3 


Entrv hi^torv 

IbI III y iiiwiwiy 


f — . , — ^ — „ — 

integrateci into reoruary i, iyy4 

UniProtKB/Swiss- 

Prot: 

Last sequence October 14, 2008 
update: 

Last modified: April 5, 201 1 

This Is version 1 14 of the entry and version 2 of the sequence. 

[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Disclaimer 

: 

: 

: 
1 

1 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 



Relevant documents 



I Human chromosome 9 

Human chromosome 9: entries, gene names and cross-references to MIM 

Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 

Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 

MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 

PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 



EMBL-'EBI 
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Display Settings: GenPept 



trypsinogen [Homo sapiens] 



GenBanIc AAA61231.1 
FASTA Graphics 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOORCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



AAA61231 247 aa linear PRI 14-JAN-1995 

trypsinogen [Homo sapiens] . 

AAA61231 

AAA61231.1 GI: 521216 

locus KDMTRPSGNA accession M22612 . 1 

Homo sapiens (human) 

Homo sapiena 

Bukaryota,- Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi; 
Mammalia; Butheria; Buarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 247} 

Emi,M., Nakamura.Y., Ogawa,M., Yamamoto.T., Nishide^T., Mori«T. and 
Matsubara, K. 

Cloning, characterization and nucleotide sequences of two cDNAs 
encoding htiman pancreatic trypsinogens 
Gene 41 (2-3), 30S-310 (1986) 

3011602 



Protein 



sig peptide 



mat peptide 



Region 



ORIGIN 



Method: conceptual translation. 
IjOcation/Quali£iers 
1. .247 

/organisin="Homo sapiens" 
/db_xre f = " taxon : 9606 " 
/map- " 7q3 2 - q te r * 
/ t i s s ue_t ype - " pane r ea s " 
1 . . 247 ~ 

/produc t » " t r yps inogen * 
1. . 15 

/gene="TRYl" 
/note="G00-119-620" 
24. .247 
/gene="TRYl" 
/ pr oduc t="trypBinogen" 
/note-"GOO- 119-620" 
24.. 242 
/ re gi on_name = Tryp_SPc " 

/note= "Trypsin- like serine protease; Many of these are 

synthesized as inactive precursor zymogens that are 

cleaved during limited proteolysis to generate their 

active forms. Alignment contains also inactive enzymes 

that have substitutions of the catalytic...; cd00190'' 

/db_xre f = " CDD : 29152 - 
Site 24 

/site__type= "cleavage" 

/db_xre f = " CDD : 29152 " 
Site order (63, 107, 200) 

/ si te_type=" active" 

/db_xre f « " CDD : 29152 ■ 
Site order (194, 215, 217) 

/ si te_type= " other " 

/note- "substrate binding sites'* 

/ db_x re f »" CDD : 29152 " 
CDS 1..247 

/geneo"TRYl" 

/coded_by-''M22612.1:7. .750" 
/db_xre f « "6DB : GO 0 - 1 1 9 - 6 2 O " 

1 mnplliltfv aaalaapfdd ddkivggync eensvpyqvs Insgyhfcgg slineqwws 
61 aghcyksriq vrlgehniev legneqfina akiirhpqyd rktlnndiml iklssravin 
121 arvstislpt appatgtkcl isgwgntass gadypdelqc Idapvlsqak ceasypgkit 
181 snmfcvgfle ggkdscqgds ggpwcngql qgwswgdgc aqknkpgvyt kvynyv)cwik 
241 ntiaans 



http://www.ncbi.nlm.nih.gov/protein/aaA61231 



4/12/201 



cationic trypsinogen [Hom^^piens] - Protein result 




Page 1 of 



Protein 
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Display Settings: GanPepi 

cationic trypsinogen [Homo sapiens] 

GenBank AAG30943.1 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



AF314534_1 53 aa linear PRI 08-NOV-2000 

cationic trypsinogen [Homo sapiens] . 

AAG30943 

AAG30943.1 GI:H120616 
accession AF314S34 . l 

Homo sapiens (hiiman) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 53) 

Teich«N.« Ockenga,J., Hof fmeister , A. , Manns, M., Mossner.J. and 
Keim, V. 

Chronic pancreatitis associated with an activation peptide mutation 
that facilitates trypsin activation 
Gastroenterology 119 (2), 461-465 (2000) 

10930381 

1 to 53) 

Hof fmeister, A. , Manns, M., Mo8sner,J. and 



COMMENT 
FEATURES 

source 



2 {residues 
Teich,N., Ockenga,J., 

Keim, V. 

Direct Submission 

Submitted (18-OCT-2000) Medizinische Klinik und Poliklinik ii, 
Universitatsklinikum Leipzig, Philipp-Rosenthal-Str. 27, Leipzig 
04103, Germany 

Method: conceptual translation supplied by author. 
Loca t ion /Qua 1 i f ier s 
1. .53 

/organ isra= "Homo sapiens" 

/db_xre f = " taxon : 9606 " 

/chromosome ■ "7 " 

/map=''7q35* 
Protein <1..>53 

/productB'cationic trypsinogen" 

/ EC_number = " 3.4.21.4 " 

/names "peptidase" 
Region 10. . >S3 

/ region_name = " Tryp^SPc " 

/note="Trypsin-like serine protease; Mauiy of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cl00149" 
/db_xre£o " CDD : 153545 " 

Site 10 

/si te_type= " cleavage " 
/db_xre f •» " CDD : 29152 " 

CDS 1..S3 

/gene="TRypi" 
/allele="D22G" 
/coded_by- "AF3 14534 . 1 ; 
/codon start "3 



<102. .>261" 



ORIGIN 



// 



1 aapfdddgki vggynceens vpyqvslnsg yhfcggslin eqwwsaghc yks 



http://www.ncbi.nlm.nih.gov/protein/AAG30943 
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Page 1 of 



Protein 

Translations of Life 



Display Settings: GenPept 

cationic trypsinogen [Homo sapiens] 

GenBank: AAC50728.1 
PA$TA Gfaphtes 



Goto: 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
OBSOURCB 
KEYWORDS 
SOURCE 

ORGANISM 



PRI 08-OCT-1996 



AAC5072 8 84 aa linear 

cationic trypsinogen [Homo sapiens] . 
AAC50728 

AAC50728,X 01:1616766 
locus HSU70137 accession U70137 . 1 . 

Homo sapiens (hiiman) 

Homo sapiens 

Eiikaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (residues 1 to 84} 

Whitcomb, D.C. , Gorry,M.C., Preston, R. A. , Purey,w., 

Sossenheimer,M. J. , Ulrich.C.D., Martin, S. P., Gates, L,, Amann,S.T., 
Toskes.P.P., Liddle,R., McGrath,K.. Uomo,G., Post, J. C. and 
Ehrlich,G.D. 

Hereditary pancreatitis is caused by a mutation in the cationic 
trypsinogen gene 

Nat. Genet. 14 (2), 141-145 (1996) 

B841182 

2 (residues 1 to 84} 

Whitcorab,D.C. , Gorry,M.C., Preston, R. A. , Furey,W,, 
So8senheimer,M. J. , Ulrich,C.D., Martin, S. P., Gates, 
Amann,S.T., TosJces, P. P. , Liddle,R., McGrath,K., Uomo,6., Post, J. C. 
and Ehrlich,G.D. 
Direct Submission 

Submitted (09-SEP-1996) Pathology, University of Pittsburgh, S-792 
Scaife Hall, Pittsburgh, pa 15261, USA 
Location/Qualifiers 
1..84 

/organism^^Homo sapiens" 

/ db_xre f =" t axon : 9606 " 

/chromosomes "7* 

/map«"7q35" 
Protein 1 . . 84 

/product=" cationic trypsinogen" 
Region <1..>84 

/ r e g i on_name = " T ryp_S P c " 

/note*= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cl00149" 
/db_xre f = " CDD : 153S4S " 
1. .84 

/gene="TRYPl" 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

FEATURES 

source 



CDS 



ORIGIN 



/coded_by-«U70137.1:<186. .>439" 
/codon_start"2 

1 riqvrlgehn ievlegneqf inaakiirhp qydrktlnnd imliklssra vinahvstis 
61 Iptappatgt kclisgwgnt assg 



http://www.ncbi.nlm.nih.gov/protein/AAC50728 
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Protein 

Translations of Life 



Display Settings: GenPept 

trypsinogen [Homo sapiens] 



GenBank: AAA61232.1 



ixx:us 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
ADTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



AAA61232 247 aa 

trypsinogen [Homo sapiens] . 
AAA61232 

AAA61232.1 GI:521218 

locus HUMTRPSGNB accessioH K27602.X 



linear 



PRI 14-JAN'1995 



Homo 

Homo 



sapiens 

sapiens 



(hisman) 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Butheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 24 7) 

Bmi,M., Nakamura,y.« Ogawa,M., Yamamoto,T., Nishide,T,, Mori«T. and 
Matsubara, K. 

Cloning, characterization and nucleotide sequences of two cDNAs 
encoding human pancreatic trypsinogens , 
Gene 41 (2-3), 305-310 (1986) 

3011602 



Protein 

sig peptide 
mat p e ptide 

Region 



ORIGIN 



Method; conceptual translation. 
Location/Qualifiers 
1. .247 

/organisms "Homo sapiens" 
/db_xre f = " t axon : 9606 " 
/ 1 i s sue_type « " pancreas " 

1. .247 

/products "trypsinogen" 
1 . . 15 

/gene=»"TRY2" 
24. .247 
/gene=''TRY2" 
/products" trypsinogen" 
24 . .242 

/region_name= "Tryp_SPc " 
/notes "Trypsin- like serine protease; Many of these are 
synthesized as inaccive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have siibstitutions of the catalytic...; cd00190" 
/db_xre f = " CDD : 29152 " 
Site 24 

/site_types"cleavage» 
/ db_xre f =" CDD : 29152 " 
Site order (63,107,200) 

/ si te_type" "act ive " 
/db xref="CDD; 29152 * 
Site order (194,215, 217) 

/ si te_type smother" 
/note° "substrate binding sites" 
/ db_xre f s " CDD : 29152 " 
CPS 1..247 

/genes"TRy2» 

/coded_by-"M27€02.1:7. .750" 

1 mnllliltfv aaavaapfdd ddkivggyic eensvpygvs Insgyhfcgg sliseqwws 

61 aghcyksriq vrlgehniev legneqf ina akiirhpkyn srtldndill iklsspavin 

121 srvsaislpt appaagtesl isgwgntlss gadypdelqc Idapvlsqae ceasypgkit 

181 nnmfcvgfle ggkdscqgds ggpwsngel qgivswgygc aqknrpgvyt kvynyvdwik 
241 dtiaans 



// 



http://www.ncbi.nlm.nih.gov/protein/AAA6 1 232 
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w 




P00775 (TRYP^STRGR)* Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1. Version 80. 



Names and origin 



Protein names 


Recommended name: 

Trypsin 
EC=3.4,21.4 
Altemative name(s): 

SGT 


Gene names 


Name:sprT 


Organism 


Streptomyces griseus 


Taxonomic identifier 


1911 [NCBI] 


Taxonomic lineage 

• 


Bacteria > Actinobacteria > Actinobacteridae > Actinomycetales > 
Streptomycineae > Streptomycetaceae > Streptomyces 



Protein attributes 



Sequence length 


259 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 




form. 

■ 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Catalytic activity 


Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa. 


Sequence similarities 


Belongs to the peptidase S1 family. 
Contains 1 peptidase S1 domain. 



Ontologies 



Keywords 



Domain 


Signal 


Molecular function 


Hydrolase 




Protease 


j 


Serine protease 


PTM 


Disulfide bond 




Zymogen 


Technical temi 


3D-structure 


1 


Direct protein sequencing 



Gene Ontology (GO) 

Biological process . j proteolysis 

Inferred from electronic annotation. Source: InterPro 



http://www.uniprot.org/uniprot/P00775 
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Molecular function 



serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) j Length Description | Graphical view 



Feature identifipr 



El 


Signal peptide 


1 -32 


32 




1 




Propeptide 


33-36 


4 


Activation 
peptide 


II 


11 


Chain 


37 - 259 


223 


Trypsin 


1 



PRO 000002830 



PRO_000002^0 



Regions 

Domain 



37 - 257 



221 



Peptidase 
S1 



Sites 



□ 


Active site 


73 


1 


Charge 

relay 

system 


1 ] 


□ 


Active site 


118 


1 


Charge 

relay 

system 


1 I 




Active site 


208 


1 


Charge 

relay 

system 


4 

t 
1 

i- 


□ 


Site 


202 


1 


Required 
for 

specificity 


1 i 

« 
1 
1 



Amino acid modifications 








□ 


Disulfide bond 


58^ 74 






n 








o 


Disulfide bond 


177 192 












□ 


Disulfide bond 


204 ^233 






1 1 







Experimental info 



Sequence conflict 



95-96 



Missing AA 
seque nce 

Ref.2 I 



1 



http://v^ww.uniprot.org/uniprot/P00775 
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Secondary structure 



1 . 



* * • « 



.. 26'. 



I Helix □ Strand | Turn 



Details... 



Sequences 



Sequence 



Length Mass (Da) 



O P00775 [UniParc]. FASTA 259 

Last modified February 1, 1994. Version 2. 
Checksum: 050233AFF1F64823 



,10 20 30 40 50 60 

' MKHFLRALKR CSVAVATVAI AWGLQPVTA SAAPNPWGG TRAAQGEFPF MVRLSMGCGG 

i 70 80 90 100 110 120 

• AliYAQDIVLT AAHCVSGSGN NTSITATGGV VDIiQSSSAVK VRSTKVLQAP GYNGTGKDWA 



26,776 



130 140 150 160 170 

LIKIiAQPINQ PTLKIATTTA YNQGTFTVAG WGANREGGSQ QRYLLKANVP 



190 

YGNELVANEE 
250 

VYTEVSTFAS 



200 

ICAGYPDTGG 



AIASAARTTi 



210 

VDTCQGDSGG 



220 230 
PMFRKDNADE WIQVGIVSWG 



18£ 

FVSDAACRSA 

240 

YGCARPGYPG 



« Hide 



References 



[1] 



[2] 



"Molecular cloning and nucleotide sequence of Streptomyces griseus trypsin gene. 

Kim J.C., Cha S.H., Jeong ST., Oh S.K., Byun S.M. 

Biochem. Biophys. Res. Commun. 181:707-713(1991) [PubMed: 1755852] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [GENOMIC DNA], 
Strain: ATCC 10137 / IFO 3430 / NCIB 8232 / NCTC 6961 . 



[3] 



"Amino acid sequence of Streptomyces griseus trypsin. Cyanogen bromide fragments 
and complete sequence." 

Olafson R.W., Jurasek L., Carpenter M.R., Smillie L.B. 
Biochemistry 14:1168-1177(1975) [PubMed: 804314] [Abstract] 
Cited for. PROTEIN SEQUENCE OF 37-259. 



"Refined crystal structure of Streptomyces griseus trypsin at 1.7-A resolution. 

Read RJ., James M.N.G. 

J. Mol. Biol. 200:523-551(1988) [PubMed: 3135412] [Abstract] 

Cited for: X-RAY CRYSTALLOGRAPHY (1 .8 ANGSTROMS). 

Additional computationally mapped references. 



http://www.uniprot.org/uniprot/P00775 
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Cross-references 


Sequence databases 




@ EMBL 


M64471 Genomic DNA. Translation: AAA26820.1. Sequence 


® GenBank 


problems. 


® DDBJ 




PIR 


TRSMG. JQ1302. 



3D structure databases 

@PDBe 

® RCSB PDB 

® PDBj 



Entry Method 



10S8 
10SS 
1SGT 
2FMJ 
3BEU 
3177 
3178 



X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 



Resolution 
(A) 

1.55 
1.93 
1.70 
1.65 
1.05 
2.10 
3.00 



Chain Positions PDBsum 



A 
A 
A 
A 

A/B 
A 
A 



37-259 
37-259 
37-259 
37-251 
37-259 
37-259 
37-259 



[»] 
[»] 

[»: 

[»] 
[»] 
[»] 
[»1 



P rotei n Model Portal 


P00775. 


SMR 


P00775. Positions 37-259. 


Mod Base 


Search... 


Protein family/group databases 




MEROPS 


SOI. 101. 


Enzyme and pathway databases 




BRENDA 


3.4.21.4. 1270. 


Family and domain databases 




InterPro 


IPR009003. Pept cys/ser Trypsin-llke. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 

[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp^SPc. 1 hit. 

[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Gys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 



http://www.uniprot.org/uniprot/P00775 
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Entry information 


Entry name 


TRYP_STRGR 


Accession 


Primary (citable) accession number: P00775 


cniry nisiory 

• 


Integrated into July 21, 1986 

UniProtKB/Swiss- 

Prot: 

Last sequence reoruary 1, 1^^4 
update: 

Last modified: January 1 1 , 201 1 

This is version 80 of the entry and version 2 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Prokaryotic Protein Annotation Program 



Relevant documents 



PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 




SiB 



http://www.uniprot.org/uniprot/P00775 
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P35049 (TRYP_FUSOX)* Reviewed. UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 78. 



Names and origin 


Protein names 

i 


Recommended name: 
Trypsin 
EC=3.4.21.4 


wrganism 


F^iicarSiim /%vucn/^f*tinri /Panskmsi riic^skco fimnii^) 
1 tioariUiII vAyS>|JUrUlll |r^ailoliiicl Uldlfadc iUliyiJ^/ 


Taxonomic laentitier 




Taxonomic lineage 


Eukaryota > Fungi > Dikarya > Ascomycota > Pezizomycotina > 
Sordariomycetes > Hypocreomycetidae > Hypocreales > 
mitosporic Hypocreales > Fusarium > Fusarium oxysporum 
species complex 


Protein attributes 




Sequence length 


248 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Catalytic activity 


Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa. 


Subcellular location 


Secreted. 


Sequence similarities 


Belongs to the peptidase S1 family. 
Contains 1 peptidase S1 domain. 


Ontologies 




Keywords 




Cellular component 


Secreted 


Domain 


Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 
Zymogen 


Technical term 


3D-structure 


Gene Ontology (GO) 




Biological process 


1 proteolysis 



http://www.uniprot.org/uniprot/P35049 
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! Inferred from electronic annotation. Source: InterPro 



Cellular component 



; extracellular region 
Inferred from electronic annotation. Source: UniProtKB-SubCell 



Molecular function 



serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation... 



Sequence annotation (Features) 



Feature key Position(s) Length 



Description Graphical view 



Feature identifier 



Molecule processing 
Signal peptide 



.1 



1 -17 i 



□ Propeptide 



I 



18-25 



17 
8 



Potential 1 



i 



Activation 
peptide 



II 




! Chain 



26-248 I 223 Trypsin 



PRO 0000028302 



Regions 

Domain 



25 - 248 



224 i Peptidase 
i S1 



Sites 



□ 



Active site 



65 



1 



Charge 

relay 

system 



Active site 



108 



1 j Charge 
I relay 
I system 



Active site 



204 



1 i Charge 
I relay 
system 



Site 



198 



1 I Required 
for 

specificity 



Amino acid modifications 





Disulfide bond 


50 ^ 66 1 




Disulfide bond 


174^ 189 ! 

1 (- 


□ j Disulfide bond 


200 ^ 225 1 

1 



II 



II 



I I 



Secondary structure 
1 



248 



http://www.uniprot.org/uniprot/P35049 



4/12/2011 



Trypsin precursor - FusariuB^xysporum (Panama disease fungus) Page 3 of 5 





I Helix I Strand | Turn 

Details... 

Sequences 



Sequence Length Mass (Da) 

P35049 [UniParc], FASTA 248 24.576 

Last modified February 1, 1994, Version 1. 
Checksum: 1A0EBA88C3E70294 



j 1£ 20 30 40 50 60 j 

MVKFASWAL VAPLAAAAPQ EIPNIV6GTS ASAGDFPFIV SISRN6GPWC GGSLLNANTV ! 

70 80 90 100 110 120 ! 

LTAAHCVSGY AQSGFQIRAG SLSRTSGGIT SSLSSVRVHP SYSGNNNDLA ILKLSTSIPS | 

\ 

130 140 150 160 170 180 1 
GGNIGYARLA. ASGSDPVAGS SATVAGWGAT SEGGSSTPVN LLKVTVPIVS RATCRAQYGT 



i 19^ 200 210 220 230 240^ 

! SAITNQMFCA GVSSGGKDSC QGDSGGPIVD SSNTLIGAVS WGNGCARPNY SGVYASVGAL 

} 

} RSFIDTYA 

« Hide 



References 



[1] i "The sequence and X-ray structure of the trypsin from Fusarium oxysporum." 

Rypniewski W.R.. Hastrup S.. Betzel C, Dauter M., Dauter Z , Papendorf G., Branner S., 
Wilson K.S. 

Protein Eng. 6:341-348(1993) [PubMed: 8332590] [Abstract] 

Cited for : NUCLEOTIDE SEQUENCE [MRNA]. X-RAY CRYSTALLOGRAPHY (1.8 
ANGSTROMS), 



[2] 



"Structure of inhibited trypsin from Fusarium oxysporum at 1.55 A." 

Rypniewski W.R., Dambmann C, von der Osten C, Dauter M., Wilson K.S. 
Acta Crystallogr. D 51:73-84(1995) [PubMed: 15299338] [Abstract] 
Cited for : X-RAY CRYSTALLOGRAPHY (1 .55 ANGSTROMS). 



Additional computationally mapped references. 



Cross-references 



Sequence databases 



® EMBL 
® GenBank 
® DDBJ 



S63827 mRNA. Translation: AAB27568.1 



http://www.uniprot.org/uniprot/P35049 
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3D structure databases 



«) PDBe 

© RCSB PDB 

® PDBj 



Entry Method 



1FN8 

1FY4 

1FY5 

1GDN 

1GDQ 

1GDU 

1PPZ 

1PQ5 

1PQ7 

1PQ8 

1PQA 

1TRY 

1XVM 

1XVO 

2G51 

2G52 

2VU8 



X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 



Resolution 
(A) 
0.81 
0,81 
0.81 
0.81 
0.93 
1.07 
1.23 
0.85 
0.80 
1.00 
1.23 
1.55 
1.10 
0.84 
1.84 
1.84 
1.80 



Chain Positions PDBsum 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
E 



25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 
25-248 



[»] 
»] 

:»] 

[»] 

[»] 

» 

» 
» 

[»] 

[»] 
[»] 
[»] 

» 
» 



ProteinModelPortal 


P35049. 


O h /ID 

oMR 


; P35049. Positions 25-248. 


ModBase 


Search... 


Protein family/group databases 




MEROPS 


S01.103. 


Enzyme and pathway databases 




BRENDA 


3.4.21.4. 15244. 


Family and domain databases 




InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR01 81 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase SI SB. 
IPR001314, Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 

[Graphical view] 


SUPFAM 

< 


SSF50494. Pept_Ser_Cys. 1 hit. 



PROSITE 



ProtoNet 



Entry information 



PS50240. TRYPSIN_DOM. 1 hit. 
PS00134. TRYPSIN_HIS. 1 hit. 
PS00135. TRYPSIN^SER. 1 hit. 
[Graphical view] 

Search... 



Entry name 



TRYP FUSOX 



http://www.uniprot.org/uniprot/P35049 



4/12/2011 



Trypsin precursor - Fusariuc^xysporum (Panama disease fungus) 



w 




Page 5 of 5 



Accession 


Primary (citable) accession number: P35049 


Entry history 


Integrated into February 1, 1994 

UniProtKB/Swiss- 

Prot: 

Last sequence February 1 , 1 994 
update: 

Last modified: January 1 1 , 201 1 

1 nis IS version /o ot me entry a no version i or me sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Fungal Protein Annotation Program 


Relevant documents 




PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 


Peptidase families 

Classification of peptidase families and list of entries 


SIMILARITY comments 



Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 



EMBL-EBI 



•I* •»* * 




SIB 



http://www.uniprot.org/uniprot/P35049 
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Q99895 (CTRC_HUMAN)# Reviewed, UniProtKB/Swiss-Prot 

Last modified April 6, 2011. Version 98. 



Names and origin 


Protein names 


! 

1 Recommended name: 

i 

j Chymotrypsin-C 

1 EC=3.4.21.2 

Alternative name(s): 
1 Caldecrin 

! 


Gene names 


t" . - - 

( 

j Name: CTRC 
1 Synonyms:CLCR 

X 


Organism 


\ Homo sapiens (Human) [Complete proteomel 


Taxonomic identifier 


\ 9606 [NCBI] 


Taxonomic lineage 


1 Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
I Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Pnmates > Haplorrhinj > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


268 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function { Has chymotrypsin-type protease activity and hypocalcemic 

1 activity. 


Catalytic activity 

3 
J 


Preferential cleavage: Leu-|-Xaa, Tyr-|-Xaa, Phe-|-Xaa. Met-|- 
Xaa, Trp-|-Xaa, Gln-|-Xaa. Asn-|-Xaa. 


H „ - ^.....^.....i^.^....^^,..,,^^ 

Tissue specificity 


Pancreas. 


Involvement in disease { 

) 

1 

! 


Variations in CTRC influence susceptibility to chronic 
pancreatitis [MIM:1 67800]. Chronic pancreatitis is a persistent 
inflammatory disorder characterized by permanent destruction 
of the pancreatic parenchyma. 


Sequence similarities 1 


Belongs to the peptidase 81 family. Elastase subfamily. 
Contains 1 peptidase 81 domain. 


Ontologies 




Keywords 




Coding sequence diversity j 


Polymorphism 


Domain i 

f 


Signal 
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P17538 (CTRB1_HUMAN)# Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 112. 



Names and origin 



Protein names 


Recommended name: 
Ciiymotrypsinogen B 
EC=3.4.21.1 

Cleaved into the followina 3 chains: 

1 . Chymotrypsin B chain A 

2. Chymotrypsin B chain B 

3. Chymopypsin B chain C 


Gene names 


Name: CTRB1 
Synonyms: CTRB 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


263 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at transcript level. 



General annotation (Comments) 



Catalytic activity 



Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa, Phe-|-Xaa. Leu-| 
Xaa. 



Subcellular location 



Secreted > extracellular space. 



Sequence similarities 



Belongs to the peptidase SI family. 
Contains 1 peptidase SI domain. 



Ontologies 



Keywords 

Biological process 



; Digestion 



Cellular component 



Coding sequence diversity 



Secreted 



Polymorphism 



Domain 



; Signal 
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Molecular function 


■ ■ ■ i 

Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 
Zymogen 


Technical temn 

Gene Ontology (GO) 

Biological process 


Complete proteome 

digestion 

Inferred from electronic annotation. Source: UniProtKB-KW 
proteolysis' 

Non-traceable author statement. Source; UniProtKB 

■ 




Cellular component 


extracellular space 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Non-traceable author statement LR§LU. Source: UniProtKB 



Complete GO annotation... 



Sequence annotation (Features) 



Feature key Position(s) Length I Description 



Molecule processing 



Graphical view 



1 Feature ider 





Signal peptide 


1 - 18 


18 




m 


Chain 


19-263 


245 


Chymotrypsinogen 
B 


m 


Chain 


19-31 


13 


Chymotrypsin B 
chain A 




Chain 


34-164 


131 


Chymotrypsin B 
chain B 

...I,.!,. 


n 


Chain 


167-263 


97 


Chymotrypsin B 
chain C 


Regions 








IS 


Domain 


34 - 261 


228 


Peptidase SI j 



PRO OOOOC 



II 



PRO OOOOC 



PRO OOOOC 



I 



PRO OOOOC 



Sites 



□ 



Active site 



75 



Charge relay 

system t By similarity 1 



□ 



□ 



Active site 



120 



Charge relay 

system I By simi!arit7 



Active site 



213 



1 



Charge relay 



system I By similarity \ 
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Amino acid modifications 



□ 


Disulfide bond 


19 ^ 140 




1 By similarity 


1 1 

» 






Disulfide bond 


60 ^ 76 




1 By similarity 


II 1 






Disulfide bond 


154 ^ 219 




1 By simliarity | 


1 1 1 




□ 


Disulfide bond 


186^200 




By similarity 


II 1 






Disulfide bond 


209 238 




1 By similarity i 


1 1 j 





Natural variations 

Natural variant 

Natural variant 

t 
t 



I 



222 



250 



r 



D-> H. 

[dbSNP:rs8061550] I 



1 i T ^ A. 

[dbSNP:rs4737] 



VAR 05715 



I VAR 01456 



Sequences 



Sequence 



Length Mass (Da) 



P17538 [UniParc]. 

Last modified August 1, 1990. Version 1 
Checksum: 4C1C055A490B8701 



FASTA 



263 



27.870 



: 10 20 30 40 50 60^ 

. MAFLWLLSCW ALLGTTFGCG VPAIHPVLSG LSRIVNGEDA VPGSWPWQVS LQDKTGFHFC 

70 80 90 100 11£ 12£ 

GGSLISEDWV- VTAAHCGVRT SDVWAGEFD QGSDEENIQV LKIAKVFKNP KFSILTVNND 

I 130 140 150 160 170 180 

I ITLLKLATPA RFSQTVSAVC LPSADDDFPA GTLCATTGWG KTKYNANKTP DKIiQQ7\ALPL 

190 200 210 220 230 240 

1 LSNAECKKSW GRRITDVMIC AGASGVSSCM GDSGGPLVCQ KDGAWTLVGI VSWGSDTCST 

250 260 
SSPGVYARVT KLIPWVQKIL AAN 



« Hide 



References 





« Hide 'large scale' references 


[1] 


"IVIoiecular cloning and nucleotide sequence of human pancreatic prechymotrypsinogen 
cDNA." 

Tomita N.. Izumoto Y.. Horii A., Doi S., Yokouchi H., Ogawa IVI,, Mori T., Matsubara K. 
Biochem. Biophys. Res. Commun. 158:569-575(1989) [PubMed: 2917002] [Abstract] 
' Cited for: NUCLEOTIDE SEQUENCE [MRNA]. 
Tissue: Pancreas. 


[2] 


"Cloning of human full-length CDSs in BD Creator(TM) system donor vector." 

Kalnine N., Chen X., Rolfs A.. Halleck A.. Hines L., Eisenstein S.. Koundlnya M.. Raphael J., 
Moreira D., Kelley T., LaBaer J,, Lin Y.. Phelan M., Farmer A. 
Submitted (OCT-2004) to the EMBUGenBank/DDBJ databases 
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. 
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[3] 



"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian 
Gene Collection (MGC)." 

The MGC Project Team 

Genome Res. 14:2121-2127(2004) [PubMed: 16489334] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. 
Tissue: Pancreas. 



+ i Additional computationally mapped references. 



Web resources 



Wikipedia 

Chymotrypsin entry 




C ross -ref e re nces 




Sequence databases 




@ EMBL 
©GenBank 
m DDBJ 


M24400 mRNA. Translation: AAA52128.1. 
BT007356 mRNA, Translation: AAP36020.1. 
BC005385 mRNA. Translation: AAH05385.1. 


IPI 


IPi00015133. 


PIR 


A31299. 


UniGene 


Hs.610926. 


3D structure databases 




ProteinModelPortal 


P17538. 


SMR 


P17538. Positions 19-263. 


Mod Base 


Search... 


Protein-protein interaction databases 


STRING 


PI 7538. 


PTM databases 




PhosphoSite 


PI 7538. 


Proteomic databases 




PRIDE 


P17538. 


Genome annotation databases 




EnsembI 


ENST00000361017; ENSP00000354294; ENSG000001 68925. 


Organism-specific databases 




GeneCards 


GC16P061006, 


H-lnvDB 


HIX0013242. 


HGNC 


HGNC:2521, CTRB1. 
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MIM 


i 118890 aene 




neXtProt 


1 NX_P17538. 




GenAttas 


[ Search... 




Phylogenomic databases 






eggNOG 


prNOG07094. 




GeneTree 


ENSGT0055000007411 7. 


HOGENOM 


HBG755338. 


HOVERGEN 


HBG013304. . 


InParanoid 


P 17538. 




OrthoDB 


EOG4VMFG1. 




Enzyme and pathway databases 






BRENDA 


3.4.21.1. 247. 




Gene expression databases 






ArrayExpress 


PI 7538. 






HS_CTRB1. 


Genevestigator 


P 17538. 


GermOnline 


ENSG00000168928. Homo sapiens. 




Family and domain databases 






InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
1PR001254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 




Pfam 


PF00089, Trypsin. 1 hit. 

[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp^SPc, 1 hit. 

[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Gys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 




Otiier Resources 






SOURCE 


Search... 





Entry information 



Entry name 
Accession 



I^CTRB INHUMAN 



jPrimary (citable) accession number: P17538 
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Entry history 



i Integrated into August 1, 1990 



! UniProtKB/Swiss- 
Prot: 

Last sequence August 1 , 1990 
update: 

Last modified: April 5, 201 1 

This is version 1 12 of the entry and version 1. of the sequence. 
[Complete history] 



Entry status 



Reviewed (UniProtKB/Swiss-Prot) 



Annotation program 



Chordata Protein Annotation Program 



Disclaimer 



Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
I professional medical advice, diagnosis, treatment or care. 



Relevant documents 



Human chromosome 16 

Human chromosome 16: entries, gene names and cross-references to MIM 

Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 

Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 

MIM cross-references 

Online Mendellan Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



© 2002-201 1 UniProt Consortium | License & Disclaimer | Contact 
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Protein 

Translations of Life 

Display Settings: Gerrf^ept 

preprochymotrypsinogen (EC 3.4.21.1) [Homo sapiens] 

GenBank: AAA52128.1 
FASTA Gfaphtes 



Goto: 
LOCOS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOURCE 

ORGANISM 



AAA52128 263 aa 

pr eprochymot ryps inogen (EC 3.4.21.1} 

AAA52128 

AAA52128.1 GI: 181190 

locus HUMCTRP accession M244 00.1 



linear 
[Homo sapiens] 



PRI Ol-NOV-1994 



Homo 

Homo 



sapxens 

sapi ens 



(humcui) 



[1] kindly submitted 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Buteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini ; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 263) 

Tomita.N., IzumotOfY. , Horii,A. , Doi,S., Yokouchi.H., Ogawa,M., 
Mor i , T . and Mat subar a , K . 

Molecular cloning and nucleotide sequence of human pancreatic 
prechymotrypsinogen cDNA 

Biochem. Biophys. Res. Commun. 158 (2) « 569-575 (1989) 

2917002 

Draft entry and printed copy of sequence for 
by N.TOmita, 07-AUG-1989. 
Method: conceptual translation. 
Location/Qiialif iers 
1. .263 

/organism" "Homo sapiens** 
/ db_xre f =" t axon : 9606 " 
/map="16q23-q24 .1" 
Protein 1..263 

/names "preprochymotrypsinogen (EC 3.4.21.1)** 
1. . 18 

/note»"chymotrypsinogen signal peptide" 
34. .263 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



sig peptide 
mat peptide 

Region 



ORIGIN 



// 



/product =" chymotrypsinogen" 
34. .259 

/region_name = "Tryp_SPc " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 

/ db_xre f =" CDD : 21152 " 
Site 34 

/ si t e_type = " cl eavage " 

/db_xre f = " CDD : 291S2 " 
Site order (75, 120,213) 

/ si te_tyE>e='' active" 

/db_xre f = " CDD : 29 152 " 
Site order(207,232/23"4) ~ 

/site_type=" other" 

/note= "substrate binding sites" 

/db xref°"CDD; 29152 " 
CDS 1..263 

/gene = "CTRB'' 

/coded_by="M24400.1;17. .808" 
/db_xre f - "GDB : GOO - 1 19 - 82 0 " 

1 maflwllscw allgttfgcg vpaihpvlsg Isrivngeda vpgswpwqvs Iqdktgfhfc 
61 ggslisedwv vtaahcgvrt sdvwagefd qgsdeeniqv Ikiakvfknp kfsiltvnnd 
121 itllklatpa rfsqtvsavc Ipsadddfpa gtlcattgwg ktkynwktp dklqqaalpl 
181 Isnaeckksw grritdvmic agasgvsscm gdsggplvcq kdgawtlvgi vewgsdtcst 
241 sspgvyarvt klipwvqkil aan 



http://www.ncbi.nlni.nih.gov/protein/AAA52 128 
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Protein 

Translations of Life 



Display Settings: GenPept 

chymotrypsinogen B1 [Homo sapiens] 

GenBanK: AAP36020.1 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 
FEATURES 

source 



Protein 



Region 



Sice 



Site 



Site 



COS 



ORIGIN 



AAP36020 263 aa linear PRI 13-MAY-2003 

chymotrypsinogen Bl [Homo sapiens) . 

AAP36020 

AAP36020.1 GI:30583SS1 
accession BT007356 . 1 

Homo sapiens (human) 

Homo sapiens 

E\2karyota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (residues 1 to 263) 

Kalnine.N., Chen.X., Rolfs, A., Halleck,A., Hines,L., Eisenstein, S. , 
Koundinya,M. , Raphael^J., Moreira.D., Kelley,T., LetBaer^j., Lin,Y., 
Phelan,M. and Farmer, A. 

Cloning of human full-length CDSs in BD Creator (TM) System Donor 

vector 

Unpublished 

2 (residues l to 263 > 

Kalnine,N., Chen^X., Rolfs, A., Halleck.A., Hines.L., Eisenstein, S. , 
Koundinya , M . , Raphael, J., Moreira,D., KeIley,T., LaBaer^J., Lin,Y., 
Phelan.M. and Fanner, A. 
Direct Submission 

Submitted (13-MAY-2003) BD Biosciences Clontech, 1020 Bast Meadow 

Circle, Palo Alto, CA 94 3 03, USA 

Method: conceptual translation supplied by author, 
liocat ion/Qual i £ iers 

1. ;263 

/organisms "Homo sapiens** 
/db^xre f = " taxon : 9606 " 
/clone="GH00216X1.0'' 

/clone^liba^BD Creator (TM) CDS Library derived from MGC 
collection** 

/lab_host«'*DH5alpha Tl resistant" 
/notea ** Vector : pDNR-Dual" 
1. .263 

/product-** chymotrypsinogen Bl" 
34. .259 

/region_^name = "Tryp_SPc " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have sxibsti tut ions of the catalytic...; cd00l90" 
/db xref»"CDD; 29152 " 
34 

/site_type= " cleavage" 
/ db_xre f -*• CDD : 29152 " 
order (75, 120, 213) 
/ Si te_type= "active" 
/db_xref-"CDD! 2 9152 " 
order (207, 23 2, 234) 
/si te_typeo " other " 
/noteo" substrate binding sites** 
/db_xref - "CDD : 29152 " 
1. .263 

/coded_by= "8100735 6. 1:1. .792" 



// 



1 maflwllscw allgttfgcg vpaihpvlsg Isrivngeda vpgswpwqvs Iqdktgfhfc 

61 ggslisedw vtaahcgvrt sdvwagefd qgsdeeniqv Ikiakvfknp kfsiltvnnd 

121 itllklatpa rfsqtvsavc Ipsadddfpa gtlcattgwg ktkynanktp dklqqaalpl 

181 Isnaeckksw grritdvmic agasgvsscm gdsggplvcq kdgawtlvgi vswgsdtcst 

241 sspgvyarvt klipwvqkil aan 
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P07338(CTRB1_RAT)# Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 97. 



Names and origin 



Protein names 


Recommended name: 
Chymotrypsinogen B 

EC=3.4.21.1 

Cleaved into the followlna 3 chains: 

1 . Chymotrypsin B chain A 

2. Chymotrypsin B chain B 

3. Chymotrypsin B chain C 


Gene names 


Name: Ctrbl 
SynonymsiCtrb 


• 

Organism 


Rattus norvegicus (Rat) 


Taxonomic identifier 


10116 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Rodentia > Sciurognatni > Muroidea > Muridae > Munnae > 
Rattus 


Protein attributes 




Sequence length 


263 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 

form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Catalytic activity 


Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa. Phe-|-Xaa, Leu-|- 
Xaa. 


, , 

Subcellular location 


Secreted > extracellular space. 


Sequence similarities 

i 

1 


Belongs to the peptidase S1 family. 
Contains 1 peptidase S1 domain. 



Ontologies 



Keywords 




Biological process 


Digestion 


Cellular component 


Secreted 


Domain 


Signal 



http://v^ww.uniprot,org/uniprot/P07338 4/1 2/201 1 
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Molecular function 



PTM 



Technical tenn 

Gene Ontology (GO) 

Biological process 



I Hydrolase 



i 



Protease 



i Serine protease 



Disulfide bond 
j Zymogen 



I 3D-structure 



digestion 

Inferred from electronic annotation. Source: UniProtKB-KW 

positive regulation of apoptosis 

Inferred from mutant phenotype. Source: ROD 

protein catabolic process 

Inferred from direct assay. Source: RGD 

response to cytokine stimulus 

Inferred from direct assay. Source: RGD 

response to food 

Inferred from expression pattern. Source: RGD 
response to nutrient 

Inferred from expression pattern. Source: RGD 

response to peptide hormone stimulus 
Inferred from expression pattern. Source: RGD 

response to toxin 

Inferred from expression pattern. Source: RGD 



Cellular component 


extracellular space 




Inferred from electronic annotation. Source: UniProtKB-SubCell 




lysosome 




Inferred from direct assay. Source: RGD 


Molecular function 


protein binding 




Inferred from physical interaction. Source: RGD 




serine-type endopeptidase activity 




Inferred from direct assay. Source: RGD 



Complete GO annotation. 



Sequence annotation (Features) 



I 



[ Feature key Position(s) j Length Description 



Molecule processing 



Graphical view 



Feature id^nt 



Signal peptide 
Chain 




I 



245 i Chymotrypsinogen i I 



B 



13 


Chymotrypsin B 




chain A 



II 



PRO 000002 



PRO 000002 



http://www,uniprot.org/uniprot/P073 3 8 
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Chain 


34-164 


131 


Chymotrypsin B 
chain B 


1 1 


PRO_000(|02 


H 


Chain 


167-263 


97 


Chymotrypsin B 
chain C 


1 


PRO_000002 



Regions 

Domain 



34 - 261 



228 



Peptidase SI 



il 



Sites 



Active site 



75 



1 



Charge relay 

system 

By simiiarity | 



I 



Active site 



120 



1 



Charge relay 
system 



By similarity 



□ 



Active site 



213 



1 



Charge relay 
system 



By similarity 



Amino acid modifications 

Disulfide bond I 19 ^ 140 



□ 



Disulfide bond 1 60 76 



Disulfide bond [ 154 219 \ 



Disulfide bond I 186 ^ 200 



Disulfide bond \ 209 ^ 238 

Secondary structure 

1 . 

I Helix I Strand | Turn 

Details... 

Sequences 



By similarity 



By similarity 



By simiiarity 



II 



I I 



II 



I I 



Sequence 



Length Mass (Da) 



P07338 [UniParc]. FASTA 263 27.849 

Last modified April 1, 1988. Version 1. 
Checksum: ACAFDBACF8C4DA6D 



10 20 30 40 50 60^ 

MAFLWLVSCF ALVGATFGCG VPTIQPVLTG LSRIVNGEDA IPGSWPWQVS LQDKTGFHFC 

70 80 90 100 110 120 

GGSLISEDWV VTAAHCGVKT SDVWAGEFD QGSDEENIQV LKIAQVFKNP KFNMFTVRND 

130 140 150 160 170 180 

ITLLKUVTPA QFSETVSAVC LPNVDDDFPP GTVCATTGWG KTKYNALKTP EKIjQQAAIjPI 



190 



200 



210 



220 



230 



240 
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VSEADCKKSW GSKITDVMTC AGASGVSSCM GDSGGPLVCQ KDGVWTLAGI VSWGSGVCST 

250 260 
STPAVYSRVT ALMPWVQQIL BAN 

« Hide 



References 
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"Isolation and sequence of a rat chymotrypsin B gene." 
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Additional computationally mapped references. 



C ros s -ref e re n ces 



Sequence databases 

@EMBL 
GenBank 
DDBJ 



IPI 



PIR 

RefSeq 
UniGene 



3D structure databases 
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Entry information 


Entry name 


CTRB1 RAT 


Accession 


Primary (citable) accession number: P07338 


Pntrv hiQtrtrx/ 
III y 1 iioivi y 


iniegraiea into Apni 1, isoo 

UniProtKB/Swiss- 

Prot: 

Lasi sequence Mpni i , i s70d 
update: 

Last modified: April 5, 201 1 

This is version 97 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P80646(CTRB_GADMO)# Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 . 201 1 . Version 61 . 



Names and origin 


Protein names 


Recommended name: 
Chymotrypsin B 
EC=3.4.21.1 

Oleaved into the toiiowina z chains. 

1 . Chymotrypsin B chain A 

2. Chymotrypsin B chain B 


organism 


vjaous mornua (Aiianiic coq) 


Taxonomic identifier 


8049 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Actinopterygii > Neopterygii > Teleostei > 
Euteleostei > Neoteleostei > Acanthomorpha > 
Paracanthopterygii > Gadiformes > Gadidae > Gadus 


Protein attributes 




Sequence length 


245 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Catalytic activity 


Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa. Phe-|-Xaa. Leu-|- 
Xaa. 


Subcellular location 


Secreted > extracellular space. 


Sequence similarities 


Belongs to the peptidase SI family. 
Contains 1 peptidase S1 domain. 


Ontologies 




Keywords 




Biological process 


Digestion 


Cellular component 


Secreted 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 
Zymogen 
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Technical term 


Direct protein sequencing 




Gene Ontology (GO) 






Biological process 


digestion 

Inferred from electronic annotation. 


Source: UniProtKB-KW 




proteolysis 

Inferred from electronic annotation. 


Source: InterPro 


Cellular component 


extracellular space 

Inferred from electronic annotation. 


Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 
Inferred from electronic annotation. 


Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) Length Description 



Graphical view 



Feature iderjtifi 
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1 - 13 


13 


Chymotrypsin | 1 
B chain A j 
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Propeptide 
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Chain 


16-245 


230 
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Charge relay j 
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system j 

1 By similarity I 



Amino acid modifications 



□ 


Disulfide bond 
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1 By similarity 1 : 1 


1 






□ 


Disulfide bond 
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Disulfide bond 



191 ^220 



I By similarity \ 



I I 



Experimental info 





Sequence conflict 


9-11 


3 


QVT^ VIS 
AA sequence 

1 Ref.2 1 


□ 


Sequence conflict 


26 


1 


S^TAA 
sequence 

I Ref.2 1 




Sequence conflict 


28-29 


2 


PW — Y AA . 
sequence 

1 Ref.2 1 




Sequences 



Sequence Length Mass (Da) 



P80646 [UniParc]. FASTA 245 26.260 

Last modified October 1, 1996. Version 1. 
Checksum: 74FE0D425517AB02 



' 10 20 30 40 50 60 

; CGSPAIQPQV TGYARIVNGE EAVPHSWPWQ VSLQQSNGFH FCGGSLINEN WWTAAHCNV i 

t 

i 70 80 90 100 110 120 

! RTYHRVIVGE HDKASDENIQ ILKPSMVFTH PKWDSRTINN DISLIKLASP AVLGTNVSPV 

t ' 

i 

130 140 150 160 170 180 

! CLGESSDVFA PGMKCVTSGW GLTRYNAPGT PNKLQQAALP LMSNEECSQT WGNNMISDVM 

! . I 

( 190 200 210 220 230 240 \ 

I ICAGAAGATS CMGDSGGPLV CQKDhfVWTLV GIVSWGSSRC SVTTPAVYAR VTELRGWVDQ 

. i 

ILAAN ' 
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Cross-references 



3D structure databases 
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ModBase 


Search... 


Protein family/group databases 




MEROPS 


S01.437. 


Phylogenomic databases 




HOVERGEN 


HBG013304. 


Enzyme and pathway databases 




BRENDA 


3.4.21.1. 39336. 


Family and domain databases 




InterPro 


IPR009003, Pept_cys/ser_Trypsin-like. 
IPR01 8114. Peptidase S1/S6 AS. 
IPR001 254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 

[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 
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ProtoNet 


Search... 


Entry information 




Entry name 


CTRB_GADMO 


Accession 


Primary (citable) accession number: P80646 


Entry history 


Integrated into October 1, 1996 
1 UniProtKB/Swiss- 
I Prot: 

Last sequence October 1, 1996 
update: 

Last modified: January 1 1 , 201 1 

This is version 61 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 
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Annotation program 



Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P00768 (CTR2_VESOR) ^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 71 . 



Names and origin 


Protein names 


R&commBndQd na/ne* 

V %^^\^S^W W 99 9 9\^9 f Vf ^^Wf 9 9^99 9 9\^m 

Chymotrypsin-2 

EC=3.4.21.1 
Alternative name(s): 
Chymotrypsin II 


Organism 


Vespa orientalis (Oriental hornet) 


Taxonomic identifier 


7447 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Arthropoda > Hexapoda > Insecta > 
Pterygota > Neoptera > Endopterygota > Hymenoptera > Apocrita 
> Aculeata > Vespoidea > Vespidae > Vespinae > Vespa 


Protein attributes 




Sequence length 


216 AA. 


Sequence status 


Complete. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Catalytic activity 

, 


Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa, Phe-|-Xaa. Leu-|- 
Xaa. 


Subcellular location 

„. . 


Secreted > extracellular space. 


Sequence similarities 


Belongs to the peptidase SI family. 
Contains 1 peptidase S1 domain. 


Ontologies 




Keywords 




Cellular component 


Secreted 


Molecular function \ Hydrolase 

j Protease 
1 Serine protease 


PTM 


Disulfide bond 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 

1 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component j 


extracellular space 
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Molecular function 



Inferred from electronic annotation. Source: UniProtKB-SubCell 



serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key Position(s) Length Description ; Graphical view 



1 Feature identifier 



Molecule processing 



11 
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1 -216 


Regions 




IS 


Domain | 1-216 


Sites 






Active site 
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Active site j 1 73 

1 
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1 



216 



Chymotrypsin 
-2 



PRO 0000088677 



216 



Peptidase S1 



1 j Charge relay 
system 



1 I Charge relay 
system 



1 



Ref.2 



Charge relay 
system 



Amino acid modifications 
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Disulfide bond 



25 w40 
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Ref.1 



I I 
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II 



Disulfide bond 



169 193 



Ref.1 



I I 



Sequences 



Sequence 



Length Mass (Da) 



P00768 [UniParc]. 

Last modified July 21 . 1986. Version 1 
Checksum: F235BF992AEFEDE1 



FASTA 



216 



23.471 
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70 80 90 100 110 120 

AVYNAKELIV NKNYNSIRLI NDIGLIRVSK DISFTQLVQP VKL.PVSNTIK AGDPWLTGW 

130 140 150 160 170 180 

GRIYVNGPIP NNLQQXTLSI VNQQTCKSKH WGLTDSQICT FTKRGEGACH GDSGGPLVAN 
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GVQIGIVSYG HPCAIGSPNV FTRVYSFLDW IQKNQL 
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PROSITE 
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1 PS00134, TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 

[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


CTR2 VESOR 


Accession 


Primary (citable) accession number: P00768 


Entry history 

• 


Integrated into July 21. 1986 

UniProtKB/Swiss- 

Prot: 

Last sequence July 21, 1986 
update: 

Last modified: January 1 1 . 201 1 

This is version 71 of the entry and version 1 of the sequence. 
(Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P00769 (CTR2_VESCR)^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 11. 2011. Version 70. 



Names and origin 



Protein names 


Recommended name: 
Chymotrypsin-2 
EC=3.4.21.1 

Alternative name(s): 
Chymotrypsin II 


Organism 


Vespa crabro (European hornet) 






Taxonomic identifier 


7445 [NCBI] 



Taxonomic lineage 



Eukaryota > Metazoa > Arthropoda > Hexapoda > Insecta > 
Pterygota > Neoptera > Endopterygota > Hymenoptera > Apocrita 
> Aculeata > Vespoidea > Vespidae > Vespinae > Vespa 



Protein attributes 



Sequence length 


218AA. 


Sequence status 


Complete. 


Protein existence 


r ■ .... J-..- , . . ... 

Evidence at protein level. 



General annotation (Comments) 



Catalytic activity | Preferential cleavage: Tyr-|-Xaa. Trp-|-Xaa, Phe-|-Xaa. Leu-|- 
I Xaa. 



Subcellular location ^ Secreted > extracellular space. 



Miscellaneous | An additional Arg at the carboxyl end was found in some of the 

i molecules. 

-4- 



Sequence similarities i Belongs to the peptidase 81 family. 

1 

Contains 1 peptidase 81 domain. 



Ontologies 



Keywords 



Cellular component 


Secreted 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 
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Cellular component 



\ extracellular space 
Inferred from electronic annotation. Source: UniProtKB-SubCell 



Molecular function 



serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation.,. 



Sequence annotation (Features) 



Feature key Position(s) Length Description | Graphical view 
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Molecule processing 
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1 
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Amino acid modifications 



Disulfide bond 



B I Disulfide bond 



□ Disulfide bond 



25^40 
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171 ^ 195 



II 



II 



I I 



Sequences 



Sequence 

P00769 [UniParc]. 

Last modified July 21, 1986. Version 1. 
Checksum: 509AB50DE190EB39 



Length IMass (Da) 

FASTA 218 23,677 



r 
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130 140 150 160 170 180 

6WGRIYVN6P IPNNLQQITL SIVNQQTCKF KHWGLTDSQI CTFTKLGEGA CDGDSGGPLV 



http://www.uniprot.org/uniprot/P00769 



4/12/2011 



Chymotrypsin-2 - Vespa crakj^ (European homet) 



Page 3 of 4 



190 200 210 
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Entry information 


Entry name 


i CTR2 VESCR 


Accession 


Primary (citable) accession number: P00769 




integratea into July 21, l9oo 

UniProtKB/Swiss- 

Prot: 

Last sequence July 21 , 1986 
update: 

Last moaiTied. January ll, 2011 

This is version 70 of the entry and version 1 of the sequence, 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 
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SIMILARITY comments 
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0973&8 (CTR_PHACE)^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 57. 



Names and origin 


Protein names 


Recommended name: 
Chymotrypsin 

j EC=3.4.21.1 


Organism | Phaedon cochleariae (Mustard beetle) 


Taxonomic identifier 


1 80249 [NCBI] 


Taxonomic lineage 

Protein attributes 


Eukaryota > Metazoa > Arthropoda > Hexapoda > Insecta > 
i Pterygota > Neoptera > Endopterygota > Coleoptera > Polyphaga 
> Cucujiformia > Chrysomeloldea > Chrysomelidae > 
Chrysomelinae > Chrysomelini > Phaedon 


Sequence length 


276 AA, 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence | Evidence at transcript level. 

General annotation (Comments) 


Function 
Catalytic activity 

• 


Serine protease with chymotryptic and collagenolytic activities 

1 By similarity | j UniProtKB Q00871 | 

_ _ 

Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa, Phe-|-Xaa, Leu-|- 

Xaa 1 UniProtKB Q00871 j 


Subcellular location 


<=;f^rrPtf^H > PYtrpr.<^llij!ar Rp^rP By similarity UniProtKB Q00871 j. 


Tissue specificity 


Expressed in larval carcasses and gut. and adult gut. 1 ^^^-^ \ 


Developmental stage 


Ovarial and mature eggs, larvae and adult. 1 ^^^^ 1 


Sequence similarities 

Ontologies 


Belongs to the peptidase Si family. 
Contains 1 peptidase S1 domain. 


Keywords 

Biological process 


Collagen degradation 
Digestion 


Cellular component ! Secreted 
Domain , Signal 
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Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 




Aon A Ontnlnnx/ /GO) 






Biological process 


collagen catabolic process 
Inferred from electronic annotation. 


oource. unirroti\i3-i\vv 




digestion 

Inferred from electronic annotation. 


Source: UniProtKB-KW 




proteolysis 

Inferred from electronic annotation. 


Source: InterPro 




Cellular component 


extracellular space 

Inferred from electronic annotation. 


Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 
Inferred from electronic annotation. 


Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key Position(s) j Length Description 



Molecule processing 



Graphical view 



Feature identifier 



HI 


Signal peptide 


1-16 


16 


Potentiai I 




., — . ., - — — - ^i^. 


d 


Propeptide 


17-45 


29 


Activation 
peptide 










1 By similarity ! 










1 UniProtKBQ00871i| 




Chain 


46 - 276 


231 


Chymotrypsin | 


Regions 








m 


Domain 


46 - 272 


227 


Peptidase SI 



I I 









PRO. 




PRO 


1 





Sites 





Active site 


89 


1 


r 

Charge relay | 
system 1 

1 By similarity j | 
UniProtKB P00771 il 

< 




Active site 


135 


1 


! 

Charge relay 










system ! 










1 By similarity 










UniProtKB P00771|| 
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Active site 



229 



1 j Charge relay 
system 

I By similarity 1 



I 



UniProtKB PQ0771|1 



Amino acid modifications 



Q Glycosylation 



Glycosylation 



144 



193 



ED I Disulfide bond i 74 ^ 90 



O I Disulfide bond I 202^^215 



1 



1 



N-linked 
(GlcNAc .) 

I Potential | 



N-llnked 
(GlcNAc .) 

Potential 



By similarity 



II 



UniProtKB P00771i 



Disulfide bond I 225 ^ 250 



By similarity 




UniProtKB P00771 i 



By similarity 



I I 



UniProtKB P00771 1 



Sequences 



Sequence 



Length Mass (Da) 



097398 [UniParc]. 

Last modified May 1, 1999. Version 1 
Checksum: FC5FD05DB882A1DE 



FASTA 



276 



29,868 



10 20 30 40 50 60 

MKVALWLAIi FGVSLAASID NIEIPPSKNI YVEPINQPEV DPSLEIVNGQ EWPHSIPYQ 



70 

IFLVASA6ET 



80 

SWTCGGSLIT 



90 

KRYVLTAAHC 



100 

IQGAKSVHVT 



110 

LGAHNLAKHE 



12£ 

ASKVTVNGRS 



130 140 150 

WVIHEKYDST NIDNDIGVIQ LERNLTLTRS 
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IQLARLPSLR DVGINLEGRT ATVSGWGIiTN 



190 200 210 220 230 240 

GIFQTTTDVIi RANNTIISNK ECNDVFKIVQ PTEVCLSIAG GRSACSGDSG GPLVIDNVQH 

250 260 270 

GIVSYGSSYC RSTPSVFTRV SSYLNWLQTH SEWRAQ 
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Cross-references 






@ EMBL 
# GenBank 

w UUdJ 


Y17904 mRNA. Translation: CAA76928.1. 


3D structure databases 




HSSP 


HSSP built from PDB template 1NN6 based on UniProtKB 
P23946 


ProteinModelPortal 


097398. 


SMR 


097398. Positions 46-270. 


ModBase 


Search... 


Enzyme and pathway databases 




BRENDA 


3.4.21.1.290922. 


Fannily and domain databases 




InterPro 


IPR009003. Pept_cys/ser Trypsin-like. 
IPR01 8114. Peptidase S1/S6 AS. 
IPR001254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 

• > • 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 1 

t 

.„ — „ .... _ _ _i 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN^SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 
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Entry information 


Entry name 


CTR_PHACE 


Accession 


Primary (citable) accession number: 097398 
Secondary accession number(s): P81521 


cmiy nisiory 

I 
1 

i 


Integrated into January 15, 2008 

UniProtKB/Swiss- 

Prot: 

Last sequence May 1, 1999 
update: 

Last modified: January 1 1 , 201 1 

This is version 57 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status | Reviewed (UniProtKB/Swiss-Prot) 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P55091 {CJRCJRAT)lk Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8, 2011. Version 83. 



Names and origin 


Protein names 


Recommended name: 

Chymotrypsin-C 
EC=3.4.21.2 
Alternative name(s): 

oaiaecnn 

Serum calcium-decreasing factor 


Gene names 


Name:Ctrc 


Oraanism 


Rattus norveaicus ^Rat) 

^ ^ ^ 


1 oAUl lUrillU lUCIIllllCI 


1 \J i Iw ^iNwDIJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalta > Euthena > Euarcnontoglires > uiires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > 
Rattus 


Protein attributes 




Sequence length 


268 AA. 


i 

Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 



Protein existence | Evidence at protein level. 



General annotation (Comments) 

Function \ Has chymotrypsin-type protease activity and hypocalcemic 

i activity. 

Catalytic activity i Preferential cleavage: Leu-|-Xaa, Tyr-|-Xaa, Phe-|-Xaa, Met-|- 

I Xaa, Trp-|-Xaa, Gln-|-Xaa, Asn-|-Xaa. 

Tissue specificity | Pancreas. 

Belongs to the peptidase S1 family. Elastase subfamily. 

] Contains 1 peptidase S1 domain. 

i Was originally (EB^I) thought to be elastase IV. 



Ontologies 



Sequence similarities 

I 

Caution 



Keywords 

r 

Domain j Signal 



Molecular function | Hydrolase 

; Protease 
I Serine protease 
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PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Technical term 


Direct protein sequencing 




Gene Ontology (GO) 






Biological process 


proteolysis 

Inferred from electronic annotation. 


Source: InterPro 


Molecular function 


serine-type endopeptidase activity 
Inferred from electronic annotation. 


Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) Length Description 



Graphical view 
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1R j Potential 


I ! 
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13 1 Activation peptide 
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1 By similarity I 




1 




□ 


Active site 
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[ By similarity 1 
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1 


Charge relay system 

1 By simtiarity I 
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Anfiino acid modifications 
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N-linked (GlcNAc ) 1 Potential 
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Disulfide bond 
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1 






Disulfide bond 
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1 By similarity 
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Disulfide bond 
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Disulfide bond 
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Experimental info 



□ 



Sequence conflict 



Sequence conflict 
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I I 



Sequences 



Sequence 



Length Mass (Da) 



P55091 [UniParc]. 

Last modified October 1. 1996. Version 1. 
Checksum: 33B67AF34D0F8583 



FASTA 



268 



29.374 
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13£ 140 150 160 170 180 
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©DDBJ 


S80379 nnRNA. Translation: AAB35830.1. 
X59014 mRNA. Translation: CAA41 753.1. 


IPI 


IPI00213599. 


FIR 


JQ1473. 


RefSeq 


NP_001071117.1. NM_001 077649.1. 


UniGene 


Rn.19214. 




3D structure databases 






ProteinModelPortal 


P55091. 




SMR 


P55091. Positions 17-268. 


Mod Base 


Search... 




Protein-protein interaction databases 




STRING 


P55091. 




Protein family/group databases 






MEROPS 
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Proteomic databases 
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Genome annotation databases 






EnsembI 
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GenelD 
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KEGG 
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Enzyme and pathway databases 




BRENDA 


3.4.21.2. 248. 


Gene expression databases 




Array Express 


P55091. 


Genevestigator 


P55091. 


GermOnline 


ENSRNOG00000013745. Rattus norvegicus. 


Family and domain databases 




InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001254. Peptldase_S1_S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 
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SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 
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PS50240. TRYPSIN DOM. 1 hit. 
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[Graphical view] 
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Search... 
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Entry information 





Entry name 
Accession 



Entry history 



Entry status 



I CTRC_RAT 



I Primary (citable) accession number: P55091 
j Sec ondary accession number(s): Q63188 



Annotation program 



Integrated Into October 1, 1996 

UniProtKB/Swiss- 

Prot: 

Last sequence October 1, 1996 
update: 

Last modified: March 8, 201 1 

This is version 83 of the entry and version 1 of the sequence. 
[Complete history] 
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Chordata Protein Annotation Program 
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P04189 (SUBT_BACSU)'*' Reviewed, UniProtKB/Swiss-Prot 

Last modified February 8, 2011. Version 108. 



Names and origin 


Protein names 


Recommended name: 
Subtilisin E 

EC=3.4.21.62 


Gene names 


Name: aprE 
Synonyms: apr, aprA. sprE 
L/roereo locus iNames.ooLf luouu 


Organism 


Bacillus subtilis [Complete proteome] [HAMAP] 


Taxonomic identifier 


i 1423 [NCBI] 


1 axonomic lineage 


Dactena > rirmicutes > Daciuaies > Daciiiaceae > Daciiius 


Protein attributes 




Sequence length 


381 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Subtilisin is an extracellular alkaline serine protease, it 
catalyzes the hydrolysis of proteins and peptide amides. 


Catalytic activity 

: 
I 


Hydrolysis of proteins with broad specificity for peptide bonds, 
and a preference for a large uncharged residue in PI . 
Hydrolyzes peptide amides. 


Cofactor > Binds 2 calcium ions per subunit. 1 ^^^ ''^ 1 


Subcellular location 


Secreted. 


Domain 


The propeptide functions as an intramolecular chaperone which 
is essential for the correct folding of the protease domain but is 
not required for enzymatic function of the folded protein. It is 
autoprocessed and degraded after completion of the folding 

prnneR55 1 Ref.1i I 


Miscellaneous 

1 


Secretion of subtilisin is associated with onset of sporulation, 
and many mutations which block sporulation at early stages 
affect expression levels of subtilisin. However, subtilisin is not 
necessary for normal sporulation. 


Sequence similarities | 


Belongs to the peptidase S8 family. 
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Ontologies 


Keywords 




Biological process 


Sporulation 


Cellular component 


Secreted 


Domain 


Signal 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 
Zymogen 


Technical term 1 3D-structure 

1 Complete proteome 

1 Direct protein sequencing 


Gene Ontology (GO) 




Biological process 

; 
1 

] 
\ 

„„„ , i 


negative regulation of catalytic activity 

Inferred from electronic annotation. Source: InterPro 

proteolysis 

Inferred from electronic annotation. Source: InterPro 

sporulation resulting in formation of a cellular spore 
Inferred from electronic annotation. Source: UniProtKB-KW 


Cellular component j 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 



Molecular function 



i identical protein binding 
Inferred from electronic annotation. Source: InterPro 

metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

serine-type endopeptidase activity 
Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 
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Secondary structure 



I -3 



I Helix I Strand | Turn 



Details... 



Sequences 



Sequence 



Length Mass (Da) 



P04189 [UniParc]. 

Last modified June 16, 2009. Version 3. 
Checksum: 5C8A4B0E42FCE83D 



FASTA 
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subtilisin (gtg start codon) [Bacillus subtilis] 

GenBanlc AAA22742.1 
PASTA QffiPh'W 
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DEFINITION 

ACCESSION 

VERSION 

DBSOimCB 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



AAA22742 381 
subtilisin (gtg start codon) 
AAA22742 

AAA22742.1 GI:143520 

locus BACSBTL accession K01988.1 



aa linear 
[Bacillus subtilis] . 



BCT 26-APR-1993 



Bacillus 
Bacillus 



subtilis 
subtilis 



FEATURES 

source 



Bacteria; Pirmicutes; Bacillales; Bacillaceae; Bacillus. 
1 (residues 1 to 381) 
Stahl,M.L. and Ferrari, E. 

Replacement o£ the Bacillus subtilis subtilisin structural gene 
with an In vitro^derlved deletion mutation 
J. Bacteriol. 158 (2), 411*418 (1984) 

€427178 

{1] reports an in vitro-derived deletion mutant (delta-apr-684) , 
which produces only 10% of wild- type serine protease activity. The 
deletion is from base 178 to 871. 
Method: conceptual translation. 
Location/Qualifiers 
1. .381 

/organisms "Bacillus subtilis" 

/strain="W168" 

/ sub__s t r ain= " PY 7 9 " 

/db_xre f » " taxon ; 1423 " 
Protein 1..381 

/name»"BXJbtili8in (gtg start codon) " 
Region 37 . . 104 

/ region_name = " I nhi bi tor_1 9 " 

/note- "Peptidase inhibitor 19; pfam05922" 

/c3b_xre f »" CDD : 147849 ■ 
Region 131 . . 359 

/ regi on__name = " Pept ida se 8_S 8__Subt i 1 i s in^subse t " 

/notes "Peptidase S8 family domain in Subtilisin proteins; 

cd07477" 

/db xref="CDD: 173803" 



Site 



Site 



CDS 



order(138, 170,213,232,263,327) 
/si te_type= "active" 
/db xref="CDD: 173803 " 
order (138, 170, 327) 
/si t e_t ype ■ " other " 
/notes "catalytic residues" 
/db xrefs"CDD; 173803 " 
1 



.381 

/coded_by="K01988 . 1 : 137 
/transl tableau 



1282" 



ORIGIN 



1 
61 
121 
181 
241 
301 
361 



mrskklwisl 
ekggkvqkqf 
alhsqgytgs 



// 



Ifaltliftm afsnmsaqaa gksstekkyi vgfkqtmsam ssakkkdvis 
kyvnaaaatl dekavkelkk <3^svayveed hiaheyaqsv pygisqikap 
nvkvavidsg idsshpdlnv rggasfvpse tnpyqdgssh gthvagtiaa 
Innsigvlgv spsaslyavk vldstgsgqy swlingiewa isnnrndvinm slggptgsta 
Iktwdkavs sgiwaaaag negssgstst vgypakypst iavgavnssn qrasfssags 
eldvmapgvs iqstlpggty gayngtsmat phvagaaali Iskhptwtna qvrdrlesta 
tylgnsfyyg kglinvqaaa q 
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pre-pro-subtilisin [Bacillus subtilis] . 

AAA22744 

AAA22744.1 GI: 143523 

locus BACSBTLA aCcession M16635. 1 

Bacillus subtilis 

Bacillus subtilis 

Bacteria; Pirmicutes; Bacillales; Bacillaceae; Bacillus. 

1 (residues 1 to 156) 

Ikemura , H . , Takagi , H . and Inouye , M . 

Requirement of pro- sequence for the production of active subtilisin 
E in Escherichia coli 

J. Biol. Chem. 262 (16). 7859-7864 (1987) 

3108260 

Method: conceptual translation. 
Loca t ion/Qiiali f ie r s 
1. . 156 

/organisms "Bacillus subtilis" 
/strain- "Wie 8" 
/ sub_s t rai n- " PY7 9 " 
/db xref o"taxon; 14 23 " 
1. . 156 

/name- "pre-pro-subtilisin" 

1 . . 29 

/note= "pre-pro-subtilisin signal peptide" 
30. . >156 

/product=''pro-sxibtilisin" 
37 . . 104 

/region_name="Inhibitor_I9" 
/note* "Peptidase inhibitor 19; pfam05922* 
/db xref ="CDD; 147849 * 
107. .>156 

/produc t ■ " subt i 1 i s in " 

122, .>156 

/region_name="Peptidases_S8_S53 " 

/notes "Peptidase domain in the S6 and S53 families; 
C110459" 

/db_xre f - " CDD : 175216 " 
1. .156 

/coded_by-"M16639, 1:133. .>600* 
/transl table-11 



ORIGIN 



// 



1 mrsldclwisl Ifaltliftm afsnrosaqaa gksstekkyi vgfkqtmsam ssakkkdvis 
61 ekggkvqkgf kyvnaaaatl dekavkelkk dpsvayveed hiaheyaqsv pygisqi)cap 
121 alhsqgytgs nvkvavidsg idsshpdlnv rggasf 
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P00781 (SUBD_BACLI)^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 76. 



f 

Names and origin 


Protein names 


Recommended nsme: 
Subtilisin DY 

EC=3.4.21.62 


Gene names 


Name:apr 


Organism 


Bacillus licheniformis 


Taxonomic identifier 


1402 [NCBI] 


Taxonomic lineage 

Protein attributes 


Bacteria > Firmicutes > Bacillales > Baciliaceae > Bacillus 


Sequence length j 274 AA. 

' ■■ III M ■ ■ 1 1 ■ 1 


Sequence status | Complete. 


Protein existence | Evidence at protein level. 

General annotation (Comments) 


Function 


Subtilisin is an extracellular alkaline serine protease, it 
catalyzes the hydrolysis of proteins and peptide amides. 


Catalytic activity 


Hydrolysis of proteins with broad specificity for peptide bonds, 
and a preference for a large uncharged residue in PI. 
Hydrolyzes peptide amides. 


-M, 

Cofactor 


Binds 2 calcium ions per subunit. 


Subcellular location 


Secreted. 


Miscellaneous 


Secretion of subtilisin is associated with onset of sporulation, 
and many mutations which block sporulation at early stages 
affect expression levels of subtilisin. However, subtilisin is not 
necessary for normal sporulation. 


Sequence similarities | Belongs to the peptidase S8 family. 
Ontologies 


Keywords 

Biological process 
Cellular component 


Sporulation 
Secreted 


LIgand 


Calcium 
Metal-binding 


Molecular function 

: 

( 
t 

1 


Hydrolase 
Protease 
Serine protease 
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Technical term 


3D-structure 

Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 

sporulatton resulting in formation of a cellular spore 
Inferred from electronic annotation. Source: UniProtKB-KW 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 





Feature key 


Position 


Length 






(s) 




Molecule processing 




n 


Chain 


1 - 274 


274 



Description { Graphical view 



DY 



Feature identifier 



PRO 0000076417 



Sites 



□ 


Active site 


32 


1 


Charge 

relay 

system 


1 






Active site 


63 


1 


Charge 

relay 

system 


1 




□ 


Active site 


220 


1 


Charge 

relay 

system 






□ 


Metal binding 




1 


Calcium 1 








Metal binding 


41 


1 


Calcium 1 


1 






Metal binding 


74 


1 


Calcium 1; 
via carbonyl 
oxygen 


1 






Metal binding 


76 


1 


Calcium 1 


' 




□ 


Metal binding 


80 


1 


Calcium 1; 
via carbonyl 
oxygen 


1 i 

i 
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□ 


1 

Metal binding 


168 


1 


Calcium 2; 
via carbonyl 
oxygen 


1 




□ 


Metal binding 


170 


1 


Calcium 2; 
via carbonyl 
oxygen 






n 


Metal binding 


173 


1 


Calcium 2; 
via carbonyl 
oxygen 


1 





Secondary structure 
1 

I Helix I Strand | Turn 

Details... 

Sequences 



* * 



... 274 



Sequence 



Length Mass (Da) 



P00781 [UniParc]. 

Last modified July 21, 1986. Version 1 
Checksum: 0154696E22F46533 



FASTA 



274 



27,436 



10 20 30 40 . 50 60 

\ AQTVPYGIPL IKADKVQAQG YKGANVKVGI IDTGIAASHT DLKWGGASF VSGESYNTDG 

' 70 80 90 100 110 12 0^ 

! NGHGTHVAGT VAALDNTTGV LGVAPNVSLY AIKVLNSSGS GTYSAIVSGI EWATQNGLDV 

» 130 140 150 16^ 170 180 

INMSLGGPSG STALKQAVDK AYASGIVWA AAGNSGSSGS QNTIGYPAKY DSVIAVGAVD 

ISO 20£ 212 220 230 240 

SNKNRASFSS VGAEIiEVMAP GVSVYSTYPS NTYTSU7GTS MASPHVAGAA ALILSKYPTL 

250 26 0 270 

; SASQVRNRLS STATNLGDSF YYGKGLINVK AAAQ 



« Hide 



References 



[1] i "Primary structure of subtilisin DY." 

I Nedkov P., Oberthur W., BraunitzerG. 

I Hoppe-Seyler*s Z. Physiol. Chem. 364:1537-1540(1983) [PubMed: 6420308] [Abstract] 
Cited for : PROTEIN SEQUENCE, 
Strain : DY. 

[2] i "Crystal structure of subtilisin DY, a random mutant of subtilisin Carlsberg." 

I Eschenburg S., Genov N., Peters K., Fittkau S., Stoeva S., Wilson K.S.. Betzel C. 
Eur. J. Biochem. 257:309-318(1998) [PubMed: 9826175] [Abstract] 
Cited for : X-RAY CRYSTALLOGRAPHY (1.75 ANGSTROMS), 
Strain: DY. 



http://www.uniprot.org/uniprot/P0078 1 



4/12/2011 



Subtilisin DY - Bacillus licbiniformis 




Page 4 of 5 



C ros s -ref e re n ces 


3D structure databases 
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PDBsum 
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SMR 


P00781. Positions 1-274. 


ModBase 


Search... 




Protein familv/arouD databases 






MEROPS 






Enzyme and pathway databases 






BRENDA 
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3.4.21.62. 1017. 




Family and domain databases 






InterPro 


IPR000209. Peptidase S8/S53. 
IPR022398. Peptidase_S8/S53_AS. 
IPR01 5500. Peptidase_S8_subtilisin-rel. 
[Graphical view] 






GeneSD 


G3DSA:3.40. 50.200. Pept_S8_S53. 1 hit. 




,-,.„ , . .... - .... 

PANTHER 


PTHR10795. SubtiiSerProt. 1 hit. 




Pfam 


PF00082. Peptidase_S8. 1 hit. 
[Graphical view] 





PRINTS 



PR00723. SUBTILISIN. 



SUPFAM 



4- 



SSF52743. Pept_S8_S63. 1 hit. 



PROSITE 


PS00136. SUBTILASE ASP. 1 hit. 
PS00137. SUBTILASE HIS. 1 hit. 
PS00138. SUBTII-ASE^SER, 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


SUBD_BACLI 


Accession 


Primary (citable) accession number: P00781 


Entry history 


Integrated into July 21,1 986 
UniProtKB/Swiss- 



Prot: 

Last sequence July 21 , 1 986 
update: 

Last modified: January 1 1 , 201 1 

This is version 76 of the entry and version 1 of the sequence. 
[Complete history] 



Entry status 



Reviewed (UniProtKB/Swiss-Prot) 



Annotation program 



Prokaryotic Protein Annotation Program 
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P29599 (SUBB_BACLE) ^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 11, 2011. Version 72. 



Names and origin 



Protein names 


Recommended name: 
Subtilisin BL 
EC=3.4.21.62 

AlfiV f fCICf 1r C7 f fCIf f fCJ|Oy. 

Alkaline protease 


Organism 


Bacillus ientus 


Taxonomic identifier 


1467 [NCBI] 


Taxonomic lineage 


Bacteria > Firmlcutes > Baciilales > Bacillaceae > Bacillus 


Protein attributes 




Sequence length 


269 AA. 


Sequence status 


Complete. 


Protein existence 


Evidence at protein level. 



• 



General annotation (Comrnents) 



Function | Subtilisin is an extracellular alkaline serine protease, it 

I catalyzes the hydrolysis of proteins and peptide amides. 

j Hydrolysis of proteins with broad specificity for peptide bonds, 
I and a preference for a large uncharged residue in PI. 
j Hydrolyzes peptide amides. 

j Binds 2 calcium ions per subunit. 

j Secreted. 

Miscellaneous I Secretion of subtilisin is associated with onset of sporulafion, 

I and many mutations which block sporulation at early stages 
I affect expression levels of subtilisin. However, subtilisin is not 
I necessary for normal sporulation. 

■ ■■ M,. I I ■ r II , II I n il - .Ill ■■ ■ y * 

Sequence similarities I Belongs to the peptidase S8 family. 



Catalytic activity 



Cofactor 

Subcellular location 



Ontologies 



Keywords 



i 

Biological process I Sporulation 



Cellular component 


Secreted 


Ligand 


Calcium 




Metal-binding 


Molecular function 


Hydrolase 




Protease 




Serine protease 
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Technical term j 3D-structure 

Gene Ontology (GO) 



Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 

sporulation resulting in formation of a cellular spore ^ 
Inferred from electronic annotation. Source: UniProtKB-KW 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 

■ 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 


Complete GO annotation... 





Sequence annotation (Features) 



I Feature key 



It 
Position j Length 

I (s) 



Description i Graphical view 



Feature identifier 



i 



Molecule processing 

Chain 



1-269 



\ 



269 i Subtilisin 
i BL 



Sites 



PRO 0000076416 
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Active site 


32 
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Charge 

relay 

system 
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Active site 

^ 1 


62 
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Charge 

relay 

system 
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Metal binding 
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Secondary structure 
1 

I Helix I Strand | Tum 

Detaiis... 

Sequences 



• ■ « • 



. .. 269 



Sequence 



Length Mass (Da) 



P29599 [UniParc]. 

Last modified April 1 , 1 993. Version 1 . 
Checksum: E8AFF1A6A9E2676B 



FASTA 



269 



26.824 



I 



10 20 30 40 50 60 

i AQSVPWGISR VQAPAAHNRG LTGSGVKVAV LDTGISTHPD LNIRGGASFV PGEPSTQDGN 



1 



t 



70 80 90 100 110 120 

GHGTHVAGTI AALNNSIGVL GVAPSAELYA VKVLGADGRG AISSIAQGLE WAGNNGMHVA 



130 140 150 160 170 180 

i NLSLGSPSPS ATLEQAVNSA TSRGVLWAA SGNSGASSIS YPARYANAMA VGATDQNNNR 

i 190 200 210 220 230 240 

■ ASFSQYGAGL DIVAPGVNVQ STYPGSTYAS LNGTSMATPH VAGAAALVKQ KNPSWSNVQI 



250 260 
RNHLKNTATS IjGSTNLYGSG LVNAEAATR 

« Hide 
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C ross -ref e re n ces 



3D structure databases 



® PDBe 

W RCSB PDB 

® PDBj 


Entry Method Resolution Chain Positions PDBsum 

(A) 

1ST3 X-ray 1.40 A 2-269 [»] 


ProteinModelPortal 


P29599. 


SMR 


P29599. Positions 1-269. 


Mod Base 


Search... 


Protein family/group databases 




MEROPS 


1 S08.003. 


Enzyme and pathway databases 




BRENDA 


3.4.21.62. 18569. 


Family and domain databases 




InterPro ! IPR000209. Peptidase S8/S53. 

j IPR022398. Peptidase_S8/S53_AS. 
1 IPR015500. Peptidase_S8_subtilisin-rel. 
1 [Graphical view] 


GeneSD ; G3DSA:3.40.50.200. Pept_S8_S53. 1 hit. 


PANTHER 


PTHR10795. SubtilSerProt. 1 hit. 


Pfam 


PF00082. Peptidase_S8. 1 hit. 
[Graphical view] 


PRINTS 


PR00723, SUBTILISIN, 


SUPFAM 


SSF52743. Pept_S8_S53. 1 hit. 


PROSITE 


PS00136. SUBTILASE ASP. 1 hit. 
PS00137. SUBTILASE HIS. 1 hit. 
PS00138. SUBTILASE_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


SUBB BACLE 


Accession 


Primary (citable) accession number: P29599 


Entry history 


Integrated into April 1, 1993 

UniProtKB/Swiss- 

Prot: 

Last sequence April 1, 1993 
update: 

Last modified: January 1 1 . 201 1 

This is version 72 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Prokaryotic Protein Annotation Program 
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P12544(GRAA_HUMAN)# Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 131. 



Names and origin 


Protein names 


Recommended name: 
Granzyme A 

EC=3.4.21.78 

j CTL tryptase 

1 Cytotoxic T-lymphocyte proteinase 1 
j Fragmentin-1 
1 Granzyme-1 
Hanukkah factor 

Short name=H factor 
Short name=HF 


Gene names 


Name: GZMA 
Synonyms:CTLA3. HFSP 


Organism 


1 Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Pnmates > Haplorrhmi > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


262 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence j 


Evidence at protein level. 


General annotation (Comments) 


Function | 

1 

1 

1 


This enzyme is necessary for target cell lysis in cell-mediated 
immune responses. It cleaves after Lys or Arg. Cleaves APEX1 
after •Lys-31' and destroys its oxidative repair activity. Involved 

in apopto<5i<5 1 Ref.10 i 


Catalytic activity { 

f 

f 


Hydrolysis of proteins, including fibronectin, type IV collagen 
and nucleolin. Preferential cleavage: -Arg-|-Xaa-, -Lys-|-Xaa- 
» -Phe-|-Xaa- in small molecule substrates. 


Subunit structure 


Homodimer; disulfide-linked. Interacts with APEX1 . f^e^ io 


Subcellular location 


Isoform alpha: Secreted. Cytoplasmic granule. 


Induction 1 

1 


Dexamethasone (DEX) induces expression of isoform beta and 
represses expression of isofonn alpha. The alteration in 
expression is mediated by binding of glucocorticoid receptor to 
independent promoters adjacent to the alternative first exons of 
isoform alpha and isoform beta. 1 ^Q^-^ 1 
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Protein 

Translation$ of Life 



Display Settings: GenPept 

Hanukah factor serine protease precursor [Homo sapiens] 

GenBank: AAA52647.1 
PASTA Graphics 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AOTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 

» 

FEATURES 

source 



Protein 

sig peptide 
mat, peptide 
Region 



Site 



Site 



Site 



CDS 



ORIGIN 



AAA52647 262 aa linear PRI 08-NOV-1994 

Hanukah factor serine protease precursor [Homo sapiens) . 

AAA52647 

AAA52647.1 01:306845 

locus HUMHFSP accession M18737 . 1 

• , • 

Homo sapiens (human) 

Hocnp sapiens 

Bukaryota; Metazoa; Chordata; Craniata; vertebrata; Buteleostomi; 
Mammalia; Eutheria; Buarchontoglires; Primates; Haplorrhini/ 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 262) 

Gershenfeld, H.K. , Her8hberger,R. J. , Shows«T.B. and Weissman, I.L. 
Cloning and chromosomal assignment of a human cDNA encoding a T 
cell- and natural killer cell-specific trypsin-like serine protease 
Proc. Natl. Acad. Sci . U.S.A. 85 (4), 1184-1188 (1988) 
32S7S74 

On Jul 26, 1993 this sequence version replaced gi: 184023. 
Method: conceptual translation. 

Location/Qualifiers 

1. .262 

/organisms "Homo sapiens" 
/ db_xr e f = " t axon ; 9606 " 
/map"*S" 
1. .262 

/names "Hanukah factor serine protease precursor" 
1. .28 

/not e= "Hanukah factor serine protease signal peptide" 
29. .262 

/product="Hanukah factor serine protease" 
29. .254 

/ r eg i on_name = " Tryp_SPc " 

/notes "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms . Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/ db_xre f »" CDD : 29152 • 
29 

/site_type=" cleavage" 
/ db_xre f CDD : 29152 ■ 
order (69, 114,212) 
/si te_type= "active" 
/db_xref- "CDD: 29152 " 
order (206, 227, 229) 
/site_typeo "other " 
/note= "substrate binding sites" 
/ db_xre f - " CDD : 29X52 " 
1. .262 

/gene="GJAlPl" 
/coded__by-"M18737. 1:1. .789" 
/db_xref-"GDB:G00-125-920" 



// 



1 mrnsyrflas slsvwslll ipedvcekii ggnevtphsr pymvllsldr kticagalia 
61 kdwvltaahc nlnkrsqvil gahsitreep tkqimlvkke fpypcyc^at regdlkllql 
121 tekakinkyv tilhlpkkgd dvkpgtmcqv agwgrthnsa swsdtlrevn itiidrkvcn 
181 drnhynfnpv igmnmvcags Irggrdscng dsgspllceg vfrgvtsfgl enkcgc^rgp 
241 gvyillskkh Inwiimtikg av ' 
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Protein 

Translations of Life 

Display Settings: GenPept 

granzyme A [Homo sapiens] 

GenBank: AAO00009.1 
PASTA Gfaphfcs 



SSLiSl 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



linear PRI 25-MAR-1999 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 
FEATURES 

source 



Protein 



CDS 



ORIGIN 



// 



AAD00009 23 aa 

granzyme A [Homo sapiens] . 
AAD00009 

AAD00009.1 01:4096786 
locus HSU40006 accession U40006.1 

Homo sapiens (human) 

Homo sapiens 

Bukaryota; Metazoa; Chorciata; Craniata; Vertebrata; Buteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (residues 1 to 23) 
Gorals)cl,T. J. and Krensky,A.M. 

The upstream region of the human granzyme A locus contains both 
positive and negative transcriptional regulatory elements 
J. Immunol. (1995) In press 

2 (residues 1 to 23) 
Goral8)ci, T. J. and Krensky^A.H. 
Direct Submission 

Submitted (03-NOV- 1995) T.J. Goralski, Pediatrics, Stanford 
CAilversity School of Medicine, Stanford, CA 94305, USA 
Method: conceptual translation. 
Location/Qualifiers 
1. .23 

/organism^ "Homo sapiens" 
/db_xr e f = " taxon : 9606 " 
1. .23 

/products "granzyme A" 
1..23 

/coded_by="U40006. 1:1541. .>1610" 
/note>" serine protease** 

1 mrnsyrflas slsvwslll ipe 
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P1 1032 (GRAA_MOUSE) # Reviewed, UniProtKB/Swiss-Prot 
Last modified IVIarch 8, 2011. Version 119. 



Names and origin 


Protein names 


Recommended name: 

EC=3.4.21.78 
Alternative name(s): 
Autocrine thymic lymphoma granzyme-like serine protease 
CTLA-3 
Fragmentin-1 

T cell-specific serine protease 1 

Short name=TSP-1 


Gene names 


Name: Gzma 
Synonyms:Ct[a-3, CtlaS, Mtsp-1 

,,. ___________ - ■ ■■ - — — 


Organism 


Mus musculus (Mouse) 


Taxonomic identifier 


10090 [NCBI] 


Taxonomic lineage | Eukaryota > Metazoa > Chordata > Craniata > Vertebrate > 

j Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
i > Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 
1 > Mus 


Protein attributes 




Sequence length 


260 AA. 


Sequence status 


Complete. 


Sequence processing 

■ 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


This enzyme is necessary for target cell lysis in cell-mediated 
immune responses. It cleaves after Lys or Arg. Cleaves APEX1 
after 'Lys-30' and destroys its oxidative repair activity. Involved 

in apoptOSiS 1 similarity 1 


Catalytic activity 

, 1 


Hydrolysis of proteins, including fibronectin, type IV collagen 
and nucleolin. Preferential cleavage: -Arg-|-Xaa-, -Lys-|-Xaa- 
» -Phe-|-Xaa- in small molecule substrates. 


Subunit structure 


Intf^rartRwith APFX1 1 By similarity | Homodimer; disulfide-linked. 


Subcellular location 

t 


Secreted. Cytoplasmic granule. Note: Cytoplasmic granules of 
cytolytic T-lymphocytes. 


Tissue specificity 


Found in cytotoxic lymphocytes and in normal lymphoid tissues 
such as thymus and spleen. 


Sequence similarities 


Belongs to the peptidase S1 family. Granzyme subfamily. 
Contains 1 peptidase S1 domain. 



http://www.uniprot.org/uniprot/Pl 1032 



4/12/2011 



Granzyme A precursor - M^piusculus (Mouse) 



Page 2 of 8 



Caution 



The predicted cleavage site for the activation peptide of HF2 is 
uncertain. It could have either 2 (ER) or 7 (KRGGCER) AA. 



Ontologies 



Keywords 



Biological process 


Apoptosis 
Lrytoiysis 


Cellular component 


Secreted 


Coding sequence diversity 


Alternative splicing 


Domain 


Signal 


IVIolecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


cleavage of lamin 

Inferred from sequence or structural similarity. Source: 
UniProtKB 

cytolysis 

Inferred from electronic annotation. Source: UniProtKB-KW 

negative regulation of DNA binding 

Inferred from sequence or structural similarity. Source: 

UniProtKB 

negative regulation of endodeoxyribonuclease activity 
Inferred from sequence or structural similarity. Source: 
UniProtKB 

negative regulation of oxidoreductase activity 
inferred from seauence or structural similaritv. Source: 
UniProtKB 

positive regulation of apoptosis 

Inferred from sequence or structural similarity. Source: 
UniProtKB 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


protein homodimerization activity 

Inferred from sequence or structural similarity. Source: 

UniProtKB ' 

serine-type endopeptidase activity 

Inferred from sequence or structural similarity. Source: 



t UniProtKB 



Complete GO annotation. 
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Alternative products 



This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] 
Isoform HF1 (identifier: P1 1032-1) 

This isoform has been chosen as the 'canonical' sequence. All positional infomiation in this entry 
refers to it. This is also the sequence that appears in the downloadable versions of the entry. 

Note: More abundant in lymphoid tissues than isoform HF2. 



Isoform HF2 (identifier: P1 1032-2) 

The sequence of this isoform differs from the canonical sequence as follows: 
1-23: MRNASGPRGPSLATLLFLLLIPE — MSKEMNEILLSWEINLSSKR 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Posttion(s) Length Description 



I Graphical view 





Signal peptide 


1 -26 


26 




1 

u u 


o 


Propeptide 

! 


27 - 28 


2 


Activation peptide 


' ! 


. . — 1 

1^1 


■ ' — 1 

Chain 


29 ~ 260 


232 


Granzyme A | 1 \ 

< 1 



Regions 



m \ Domain 



I 29 - 257 



229 



Peptidase S1 



Sites 

\ Active site 




69 i 



^ I Charge rela y system 

By similarity I 



^ i Charge rela y system 

I By similarity 1 



Active site 



211 



1 j Charge rela y system 

I By stm}iarity~| 



Amino acid modifications 
Glycosylation 



□ 
□ 



Glycosylation 
Disulfide bond 



Disulfide bond 



Disulfide bond 



54 



157 
169 
70 



147.^217 



Disulfide bond 



178 
207 



196 



232 



1 
1 



I N-linked (GlcNAc.) I Potential 

HsJ-linked (GlcNAc.) I Poten'tiai 



J. 



By similarity 



II 



By similarity | 



By similarity I 



By similarity I 



1 1 I 
I I 
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Natural variations 



Alternative sequence 



1 -23 



23 



I 



MRNAS...LLIPE — 
MSKEMNEILLSWEINLSSKR 

in isoform HF2. 



Sequences 



Sequence 



Length 



Mass (Da) 



Isoform HF1 [UniParc]. 

Last modified July 1, 1993. Version 2. 
Checksum: 7705352F08DEED8D 



PASTA 



260 



28.599 



10 

MRNA.SGPRGP 



20 

SLATLLFLLL 



30 

IPEGGCERII 



40 

GGDTWPHSR 



50 

PYMALLKLSS 



SO 

NTICAGAIilE 



70 80 90 100 110 120 

KNWVLTAAHC NVGKRSKFIL GAHSINKEPE QQILTVKKAF PYPCYDEYTR EGDLQLVRLK 



130 

KKATVNRNVA 

190 

EKHYNFHPVI 



140 

ILHLPKKGDD 

200 

GIjNMJCAGDL 



150 160 170 

VKPGTRCRVA GWGRFGNKSA PSETLREVNI 



210 

RGGKDSCNGD 



220 

SGSPLIiCDGI 



230 

LRGITSFGGE 



180 

TVIDRKICND 
240 

KCGDRRWPGV 



250 260 
YTFLSDKHLN WIKKIMKGSV 



« Hide 



Isoform HF2. 

Checksum: 7D67F97E532F7440 
Show » 



FASTA 



267 



28.541 



References 



« Hide large scale' references 
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Cited for : NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOFORMS HF1 AND HF2). 



http://www.uniprot.Org/uniprot/P 1 1 032 



4/12/2011 



Granzyme A precursor - Mu^iiusculus (Mouse) 



• 



Page 5 of 8 



[4] } "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian 
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lymphocytes upon T cell receptor stimulation." 

Jenne D.E.. Tschopp J. 

Immunol. Rev. 103:53-71(1988) [PubMed: 3292396] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [MRNA] OF 1-37. 
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[7] 
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"Cloning of a cDNA for a T cell-specific serine protease from a cytotoxic T lymphocyte. 

Gershenfeld H.K., Weissman I.L. 

Science 232:854-858(1986) [PubMed: 2422755] [Abstract] 

Cited for : NUCLEOTIDE SEQUENCE [MRNA] OF 12-260. 

"A family of serine esterases in lytic granules of cytolytic T lymphocytes." 

Masson D., Tschopp J. 

Cell 49:679-685(1987) [PubMed: 3555842] [Abstract] 
Cited for: PROTEIN SEQUENCE OF 29-48, 



— 1- 
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"Identification of granzyme A isolated from cytotoxic T*lymphocyte-granules as one of 
the proteases encoded by CTL-specific genes." 

j Masson D., Zamai M., Tschopp J. 

i FEBS Lett. 208:84-88(1986) [PubMed: 3533635] [Abstract] 

I Cited for : PROTEIN SEQUENCE OF 29-53. 

"Induction of T cell serine proteinase 1 (TSP-1 )-specific mRNA in mouse T 
lymphocytes." 

Simon H.G., Fruth U.. Eckerskorn C, Lottspeich F., Kramer M.D.. Nerz G,. Simon M.M. 
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Sequence databases 




@EMBL 
® GenBank 
©DDBJ 


X14799 mRNA. Translation: CAA32905.1. 
X62542MX60311 Genomic DNA. Translation: CAA44426.1. 
L01429liii L01441 Genomic DNA. Translation: AAA99898.1. 
L01429 W L01441 Genomic DNA. Translation: AAA99897.1. 
BC061146 mRNA. Translation: AAH61 146.1. 
M26183 mRNA. Translation: AAA37735.1. 
Ml 3226 mRNA. Translation: AAA40134.1. 


IPI 


IPI001 11385. 
IPI00229002. 


PIR 


A24807. 
A45061. 
B45061. 


RefSeq 


NP_034500.1. NM_01 0370.2. 


UniGene 

* 


Mm. 15510. 


3D structure databases 




ProteinModelPortal 


P1 1032. 
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SMR 


P11032. Positions 29-258, 


Mod Base 


Search... 


Protein-protein interaction databases 


STRING 


j P11032. 


Protein family/group databases 




MEROPS 


1 S01.135. 






PRIDE 


P11032. 


Genome annotation databases 




EnsembI 


ENSMUST00000023897; ENSMUSP00000023897; 

1 ENolvlUoC3000u00231o2. 
ENSMUST000001 09243; ENSIVIUSP000001 04866; 
^NoivlUoGu0u0002o132. 


GenelD 


14938. 


KEGG 


mmu: 14938. 


UCSC 


1 uc007rxc. 1 . mouse. 


v/ryanisni-speciTic oaiauases 




GTD 


14938. 


MGt 


MGI: 109266. Gzma. 


Knyiogenomic aataDases 




eggNUG 


roNOG1 1241. 


GeneTree 


ENSGT00580000081 367. 


HOGENOM 


HBG765338. 




nbo01oo04. 


InParanoid 


P1 1032. 


DMA 


KEHDIML 


OrthoDB 


EOG4SXNDS. 


PhylomeDB 


P11032. 


cnzyrne ana painway uaiaoases 




BRENDA 


3.4.21.78.244. 


Gene expression databases 




ArrayExpress 


P1 1032. 


Bgee 


P11032. 


CleanEx 


MM_GZMA. 


Genevestigator 


P1 1032. 
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GermOnline { ENSMUSG00000023132. Mus musculus. 



Family and domain databases 



InterPro 


IPR009003 Peot cvs/ser TrvDsin-like 
IPR01 81 14. Peptidase 81/86 AS. 
IPR001254. Peptidase S1 S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. GHYMOTRYPSIN. 


SMART 


SM00020. Tryp^SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Gys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN^SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 



Other Resources 

NextBio I 287259. 

SOURCE I Search... 



Entry information 



Entry name 


GRAA^MOUSE 


Accession 


Primary (citable) accession number: P11032 
Secondary accession number(s): P151 18 


Entry history 


Integrated into July 1. 1989 

UniProtKB/Swiss- 

Prot: 

Last sequence July 1 , 1 993 
update: 

Last modified: March 8. 20 11 

This is version 1 19 of the entry and version 2 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



MGD cross-references 

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot 
Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P10144(GRAB_HUMAN)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5. 201 1 . Version 1 32. 



Names and origin 


Protein names 


Recommended name: 

Granzyme B 
EC=3.4.21.79 
Alternative name(s): 

C11 

CTLA-1 

Cathepsin G-like 1 

Short name=CTSGL1 
oyioioxic 1 -lympnocyie proieinase z 

Short name=Lymphocyte protease 

Fragmentin-2 
Granzyme-2 

Human lymphocyte protein 

Short name=HLP 

SECT 

T-cell serine protease 1 -3E 


Gene names 


Name: GZMB 

Synonyms:CGL1, CSPB. CTLA1, GRB 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 

Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length i 247 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function " 


This enzyme is necessary for target cell lysis in cell-mediated 
immune responses. It cleaves after Asp. Seems to be linked to 
an activation cascade of caspases (aspartate-specific cysteine 
proteases) responsible for apoptosis execution. Cleaves 
caspase-3, -7, -9 and 10 to give rise to active enzymes 
mediating apoptosis. 


Catalytic activity 


Preferential cleavage: -Asp-|-Xaa- » -Asn-|-Xaa- > -Met-|-Xaa- 


Enzyme regulation 


Inactivated by the serine protease inhibitor 
diisopropylfluorophosphate. 1 Ref.12 | 
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Subcellular location 



Cytoplasmic granule. Note: Cytopla smic gra nules of cytolytic T- 
lymphocytes and natural killer cells. I | 



Induction 



Sequence similarities 



By staphylo coccal e nterotoxin A (SEA) in peripheral blood 
leukocytes. I i 



Belongs to the peptidase S1 family. Granzyme subfamily, 
j Contains 1 peptidase S1 domain. 



Ontologies 



Biological process 


Apoptosis 

Cytolysis 




Coding sequence diversity 
Domain 


Polymorphism 
Signal 




Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 
Zymogen 


Technical term 


3D-structure 
Complete proteome 
Direct protein sequencing 



Gene Ontology (GO) 

Biological process 



! 



activation of pro-apoptotic gene products 
Traceable author statement. Source: Reactome 

cleavage of lam in 

Inferred from direct assay. Source: UniProtKB 
cytolysis 

Inferred from electronic annotation. Source: UniProtKB-KW 

induction of apoptosis by intracellular signals 
Traceable author statement. Source: Reactome 



Cellular component 



cytosol 

Inferred from experiment. Source: Reactome 
immunological synapse 

Traceable author statement. Source: UniProtKB 
nucleus 

Traceable author statement. Source: UniProtKB 



Molecular function 



protein binding 

Inferred from physical interaction. Source: UniProtKB 

serine-type endopeptidase activity 

Traceable author statement. Source: UniProtKB 



Complete GO annotation. 
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Binary interactions 



With 


Entry 


#Exp. 


IntAct 


Notes 


PRF1 


P 14222 


2 


EBI-2505785,EBI-724466 




SRGN 


PI 01 24 


1 


EBI-2505785,EBI-74491 5 





Sequence annotation (Features) 



I Feature key 



Position(s) Length Description 



Graphical view 



Featu 



re 



Mol( 


scule processing 

Signal peptide 


1 - 18 


18 


Ref. 1 4 


1 






□ 


Propeptide 


19-20 


2 


Activation peptide 


1 


PRO. 
PRO. 


.0' 




Chain 


21 -247 


227 


Granzyme B 


1 


hoi 



Regions 



! 



Domain 



21 - 245 



225 



Peptidase SI 



Sites 





1 Active site 


64 


1 


Charge relay system 


1 1 




n 


Active site 


108 


1 


Charge relay system 


1 ' 1 




Ami 

o 


Active site 

no acid modificatio 

Glycosylation 


203 

ns 

■ 71 


1 
1 


Charge relay system 
N-linked (GlcNAc.) 


1 

1 








Glycosylatlon 


104 


1 


N-linked (GlcNAc.) 

1 Potential j 


1 




□ 


Disulfide bond 


49*- 65 






1 1 








Disulfide bond 


142^209 






1 1 






□ 


Disulfide bond 


173 188 






II 







Natural variations 



□ 



□ 



Natural variant 



Natural variant 



55 




1 



1 



R-> Q. 

[dbSNP:rs8192917] 

I Ref.l irRefXH Ref.4l 



VAR 



I Ref.5 \ fRef-S \ j Ref.8 



I Ref. 17 



[dbSNP:rs1 1 539752] 

t I Ref.6 i 1 Ref.2 j 




0' 
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Natural variant 



247 



1 



Y-> H. 

[dbSN P:rs223 6338] 

I Ref.sll Ref.17 { 



VAR 



0- 



Experimental info 



□ 



Sequence conflict 



32-33 



Sequence conflict 



1 



72 



Sequence conflict 



212 



RP PR AA 



sequence I Re^ ^^ 



V — Gin 

AAA52118. I 3 I 



V ^ C in 
AAB59528. [M± 



Secondary structure 



I Helix I Strand | Turn 



Details. 



Sequences 



Sequence 



Length Mass (Da) 



P10144 [UniParc]. 

Last modified January 1 1 , 201 1 . Version 2. 
Checksum: C652271918EF24F9 



FASTA 



247 



27.716 



10 20 30 40 50 60 

MQPILIiLIiAF liLIiPRADAGE IIGGHEAKPH SRPYMAYLMI WDQKSLKRCG GFLIRDDFVL 

7£ .8^ 90 10£ 110 120^ 

j TAAHCWGSSI NVTIiGAHNIK EQEPTQQFIP VKRPIPHPAY NPKNFSNDIM LLQLERKAKR 

t 

■ 130 140 150 16£ 170 180^ 

I TRAVQPLRLP SNKAQVKP6Q TCSVAGV«3QT APIiGKHSHTL QEVKMTVQED RKCESDI^RHY 

i 

190 200 210 220 230 24£ 

YDSTIEIiCVG DPEIKKTSFK GDSGGPLVCN KVAQGIVSYG RNNGMPPRAC TKVSSFVHWI 



KKTMKRY 



« Hide 



References 



« Hide 'large scale' references 



[1] "Induction of mRNA for a serine protease and a beta-thromboglobulin-like protein in 
mitogen-stimulated human leukocytes." 
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J. Immunol. 139:250-256(1987) [PubMed: 2953813] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [MRNA], VARIANT GLN-55. 



[2] "Structure and differential mechanisms of regulation of expression of a serine esterase 
gene in activated human T lymphocytes." 
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Additional computationally mapped references. 
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Sequence databases 



@EMBL 
® GenBank 
® DDBJ 



M 1701 6 mRNA. Translation: AAA36627.1. 
J03189 mRNA. Translation: AAA36603.1. 
J04071 mRNA. Translation: AAA52118.1. 
J03072 Genomic DNA. Translation: AAB59528.1. 
M38193 Genomic DNA. Translation: AAA67124.1. 
M28879 Genomic DNA. Translation: AAA75490.1. 
AL1 36018 Genomic DNA. No translation available. 
BC030195 mRNA. Translation: AAH30195.1. 




IPI00744785. 



A61021. 



t 

I 



NP 004122.2. NM 004131.4. 



UnlGene 



Hs.1051 
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3D structure databases 



#PDBe 

® RCSB PDB 

® PDBj 


1 Entry Method Resolution Chain Positions PDBsum 

(A) 

1FQ3 X-ray 3,10 A/B 21-247 [»] 
1 1IAU X-ray 2.00 A 21-247 [»] 


ProteinModelPortal 


P10144, 


Mod Base 


Search... 


Protein-protein interaction databases 


IntAct 


j P10144. 5 interactions. 


MINT 


1 MINT-4528791. 


STRING 


1 P10144. 


Protein family/group databases 




MEROPS 


1 S01.010. 


Proteomic databases 




PRIDE 


P10144. 


Genome annotation databases 




EnsembI | ENST00000216341; ENSP00000216341; ENSG000001 00453. 

1 ENST00000382542; ENSP00000371982: ENSG000001 00453. 
1 ENST00000415355; ENSP00000387385; ENSG000001 00453. 


GenelD 


3002. 


UCSC 


ucOOIwps.1. human. 


Organism-specific databases 




GeneCards 


GC14M005214. 


H-lnvDB 


H 1X00 11 578. 


HGNC 


HGNC:4709. GZMB. 


HPA 1 


CAB000376. 
HPA003418. 


MIM 1 


123910. gene. 


neXtProt 


NX_P10144, 


GenAtlas 


Search... 


Phylogenomic databases 




eggNOG 


prNOG07952. 


HOVERGEN 


HBG013304. 


InParanoid 


PI 01 44. 


OrthoDB 


E0G4KWJTW, 
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Enzyme and pathway databases 



BRENDA 


3.4.21.79. 247. 


Pathway J nteraction_DB 


caspase_pathway. Caspase cascade in apoptosis. 
cdStcrdownstreampathway. Downstream signaling in naive 
CD8+ T cells. 

il12_2pathway. IL12«mediated signaling events. 


Reactome 

Gene expression databases 

ArrayExpress 


REACT_578. Apoptosis. 
P10144. 


Bgee 


P10144. 


CleanEx 


i HS G2MB, 


Genevestigator P10144. 


GermOnline 

ramiiy ana aomain aaiaoases 

InterPro 


1 ENSG000001 00453. Homo sapiens. 

• 

IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001 254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 

rCnrsnhicfil \/i^w1 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN^SER. 1 hit. 
[Graphical view] 


ProtoNet 

Other Resources 

NextBio ! 


Search... 
11904. 


PMAP-CutDB 


P10144. 


SOURCE 

Entry information 


Search... 


Entry name 


GRAB_HUMAN 


Accession 


Primary (citable) accession number: P10144 
Secondary accession number(s): Q8N1D2, Q9UCC1 


Entry history 


Integrated into July 1, 1989 

UniProtKB/Swiss- 

Prot: 



Last sequence 
update: 



January 11. 2011 
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Entry status 


Last modiTiea: Apnl 5» 201 1 

This is version 132 of the entry and version 2 of the sequence. 

[Complete history] 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Disclaimer 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 



Relevant documents 



Human chromosome 14 

Human chromosome 14: entries, gene names and cross-references to MIM 



Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 

Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 



MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 

PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 
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Display Settings: GenPept 

serine esterase [Homo sapiens] 

GenBanIc AAA52118.1 
FASTA Graphics 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FUBMED 
COMMENT 



FEATURES 

source 



Protein 
sig peptide 
mat peptide 
Region 



Site 



Site 



Site 



CDS 



ORIGIN 



AAA52118 24 7 aa linear PRI Ol-NOV-1994 

serine esterase [Homo sapiens] . 

AAA52118 

AAA52118.1 GI: 181137 

locus HUMCSB accession J04071 . 1 

Homo sapiens (human) 

Homo sapiens 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Butheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 247) 

Trapani, J.A. , Klein, J. L., White, P. C. and Dupont,B. 

Molecular cloning of an inducible serine esterase gene from human 

cytotoxic ly!i5>hocytes 

Proc. Natl. Acad. Sci . U.S.A. 85 (16), 6924-6928 (1988) 
3261871 

Draft entry and clean copy of seq^uence for [1] kindly provided by 
J.A.Trapani, 08 -SEP- 1988. 
Method: conceptual translation. 

Location/Qualifiers 

1. .247 

/organisms "Homo sapiens" 
/db_xre f - " t axon : 9606 " 
/map="14qll.2" 
1. .247 

/products«>serine esterase* 
1. .20 

/note-"MHC serine esterase signal peptide" 
21. .247 

/product-"MHC serine esterase" 

21. .243 

/region_name= "Tryp_SPc " 

/notea "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 

that have substitutions of the catalytic ; cd00190'' 

/ db_xre f CDD : 29152 " 
21 

/si te_type» "Cleavage " 
/ db_xre f = CDD : 29152 " 
order (64, 108, 203) 
/si te_t ype n " ac t i ve " 
/db_xref="CDD: 29152 * 
order (197, 218, 220) 
/si te_type" "other" 
/note» "substrate binding sites" 
/db_xre f = " CDD : 29152 " 
1. ,247 

/gene««"CTLAl" 

/coded_by="J04 071. 1:17. .760" 
/db_xref="GDB: GOO- 120-744" 



// 



1 mqpillllaf lllpradage iiggheakph srpymaylmi wdqkslkrcg gfliqddfvl 
61 taahcwgssi ngtlgahnik eqeptqqfip vkrpiphpay np)cnfsndim llqlerkakr 
121 travqplrlp snkaqvkpgq tcsvagwgqt aplg}chshtl qevkmtvqed rkcesdlrhy 
181 ydstielcvg dtpeikktsfk gdsggplvcn kvaqgivsyg mngn^rac tkvssfvhwi 
241 kktmkry 
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linear VRJ 08-AUG-1995 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



ORIGIN 



AAB59528 247 aa 

serine protease B [Homo sapiens) . 
AAB59528 

AAB59528.1 GI:181159 
locus HUMCSPB accession JQ3072 . 1 

Homo sapiens (human) 

Homo sapiens 

Bukaryot:a; Metazoa; Chordata; Craniata; Vertebrata; Euteleo8t:omi; 
Matnmalia; Butheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues l to 247) 

Klein, J. L., Shows, T.B., Dupont,B. and Trapani,J.A. 
Genomic organization and chromosomal assignment for a serine 
protease gene (CSPB) expressed by human cytotoxic lyiqphocytes 
Genomics 5 (1), 110-117 (1989) 

2788607 

Draft entry and computer -readable sequence for [1] Icindly provided 
by J.Klein, 29-MAR-1989. 

Aberrant splicing of the mRNA at position 1143 often takes place 
and produces a nonfunctional protein. 
Method: conceptual translation. 
Location/Qualifiers 
1. .247 

/organism="Homo sapiens" 
/db_xre f = " taxon ; 9606 " 
/map= " 14ql 1.2" 
/clones "lambda-6P2 6 . " 

1 . .247 

/names "serine protease B" 
21. .243 

/ r e g i on_name = " T ryp_S P c " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db xref»"CDD; 29152 " 
21 

/site_type=" cleavage" 
/db_xre f = "CDD : 29152 " 
order(64, 108,203) 
/ si te_type= "active" 
/ db_xre f CDD ; 29152 " 
order (197, 218, 220) 
/si te_type= "other " 
/notes" Slabs t rate binding sites" 
/db_xre f = " CDD : 29152 " 
l.,247 

/gene»"CTLAl" 

/coded_byo "join ( J03072 . 1 : 233 . . 287 , J03072 . 1 : 1290 . . 1437 , 
J03 072 .1:1904 . . 203 9 , J03072 . 1 : 2247 . .2507, 
J03072 . 1 :3148 . .3291) " 
/db_xref = "GDB : GOO - 120-744 " 

1 mqpillllaf lllpradage iiggheakph srpymaylmi wdgkslkrcg gfliqddfvl 
61 taahcwgssi nvtlgahnik eqeptqqfip vkrpiphpay np)cnfsndim llqlerkakr 
121 travqplrlp snkaqvkpgq tcsvagwgqt aplgkhshtl qevlontvqed rkcesdlrhy 
181 ydstielcvg dpeikktsfk gdsggplvcn kcaqgivsyg rnngmpprac t)cvssfvhwi 
241 kktmkry 



Protein 
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Site 
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CDS 
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Display Settings; GenPept 

serine protease-like protein precursor [l-lomo sapiens] 

GenBank: AAA36627.1 
FASTA Graphics 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



Method: 



Protein 
sig peptide 
mat peptide 

Region 



Site 



Site 



Site 



CDS 



ORIGIN 



AAA36627 247 aa linear PRI 03-AU6-1993 

serine protease -like protein precursor [Hotno sapiens] . 

AAA36627 

AAA36e27.1 GI:338296 

locus HUMS PI 3 B accession M17016 . I 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 247) 
Schmid.J. and Weissmann.C. 

Induction o£ mRNA for a serine protease and a 

beta- thromboglobulin- like protein in mitogen- stimulated human 

leukocytes 

J, Immunol. 139 (1), 250-256 (1987) 

2953813 

conceptual translation. 
Location/Qualifiers 
1..247 

/organ isms "Homo sapiens" 
/db_xre f = " t axon : 9606 " 
1. .247 

/name =" serine protease -like protein precursor" 

1. .20 

/notes "serine protease -like protein signal peptide" 
21.. 247 

/products "serine protease -like protein* 
21. .243 

/ r eg i on_name - " Tryp_SPc " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db_xr e f = " CDD : 291S2 " 
21 

/si te_type=" cleavage" 
/ db_xre f =" CDD : 29152 " 
order (64, 108,203) 
/ si te_type= "active" 
/db_xref=''CDD: 29152 " 
order (197. 218, 220} 
/ si te_type- "other" 
/note= "substrate binding sites" 
/ db_xre f =" CDD : 29152 " 
1. .247 

/coded_by»"M17016. 1:8. .751" 



// 



1 mqpillllaf lllpradage iiggheakph srpymaylmi wdqkslkrcg gfliqddfvl 

61 taahcwgssi nvtlgahnik eqeptqqfip vkrpiphpay npknfsndim llqlerkakr 

121 travqplrlp snkaqvkpgq tcsvagwgqt aplgkhshtl qevkmtvqed rkcesdlrhy 

181 ydstielcvg dpeikktsfk gdsggplvcn kvaqgivsyg rnngmpprac tkvssfvhwi 
241 kktmkry 
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cytotoxic T-lymphocyte^ssociated serine esterase 1 [Homo sapiens] 

GdnBank: AAA67124.1 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOtJRCE 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 

COMMENT 
FEATURES 

source 



Method: 



Protein 



Region 



Sit:e 



Site 



Site 



CDS 



ORIGIN 



AAA67124 247 aa linear PRI 23-MAY-1995 

cytotoxic T- lymphocyte -associated serine esterase 1 [Homo sapiens] . 
AAA67124 

AAA67124.1 01:306682 
locus HUMCTLAl accession M38193 . 1 

Homo sapiens (hiiman) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Matranalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo, 

1 (residues 1 to 247) 

Caputo,A., Sauer,D.E. and Rowe,P.B, 

Nucleotide sequence and genomic organization of a human T 

lymphocyte serine protease gene 

J. Immunol. 145 (2), 737-744 (1990) 

2365998 

conceptual translation. 
Location/Qualifiers 
1, .247 

/organisms "Homo sapiens" 
/ dt>_xre f = " taxon ; 960 6" 
/map="14qll.2" 
1. -247 

/products "cytotoxic T-lyiiQ>hocyte -associated serine 
esterase 1" 
21. .243 

/ region_name = " Tryp_SPc " 

/note= "Trypsin -like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db_xre f - " CDD : 29152 " 
21 

/sit e_t ype = " cl eavage " 
/db_xre f = • CDD : 29152 " 
order (64, 108,203) 
/si te_type»" active" 
/ db__xr e f = " CDD : 29152 " 
order (197,218, 220) 
/site_type= "other" 
/notes "substrate binding sites" 
/db_xre f = * CDD : 29152 
1. .247 

/gene= "CTLAl " 

/ coded__by = "join(M38193. 1:910 
M38193 .1:2611. . 2746 ,M38193 . 1 
M38193. 1:3856. .3999)" 
/db_xref » "GDB : GOO - 120- 744 » 



. 964, M38193 .1:2008 . .2155, 
2952. .3212, 



// 



1 mqpillllaf lllpradage iigghea)cph srpymaylmi wdqkslkrcg gfliqddfvl 

61 taahcwgssi nvtlgahnik eqeptqqfip vkrpiphpay npknfsndira llqlerkakr 

121 travqplrlp snkaqvkpgq tcsvagwgqt aplg)chshtl qevkmtvqed rkcesdlrhy 

181 ydstielcvg ^eikktsfk gdsggplvcn Icvaqgivsyg mngo^rac tkvssfvhwi 
241 kktmkry 
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P04187 (GRAB^MOUSE) # Reviewed. UniProtKB/Swiss-Prot 

Last modified March 8. 2011. Version 119. 



Names and origin 



Protein names 



Recommended name: 
Granzyme B(G,H) 
EC=3.4.21.79 

Alternative name(s): 
CTLA-1 

Cytotoxic cell protease 1 

Short name=CCP1 

Fragmentin-2 



Gene names 



j Name: Gzmb 

I Synonyms: Ctla-1, Ctlal 



Organism 

Taxonomic identifier 



Mus musculus (Mouse) 



t 



i 10090 [NCBI] 



Taxonomic lineage 



Eukaryota > Metazoa > Ghordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 



Protein attributes 


Sequence length 


247 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 




form. 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Function 



Catalytic activity 



I This enzyme is necessary for target cell lysis In cell-mediated 
\ immune responses. It cleaves after Asp. Seems to be linked to 
> an activation cascade of caspases (aspartate-specific cysteine 
proteases) responsible for apoptosis execution. Cleaves 
caspase-3, -7, -9 an d 10 to give rise to active enzymes 
mediating apoptosis I simnaruQ 

Preferential cleavage: -Asp-|-Xaa- » -Asn-|-Xaa- > -Met-|-Xaa- 
, -Ser-|-Xaa-. 



Subcellular location 



Sequence similarities 



Cytoplasmic granule. Note: Cytoplasmic granules of cytolytic T- 
lymphocytes and natural killer cells. 



Belongs to the peptidase S1 family. Granzyme subfamily. 
Contains 1 peptidase SI domain. 
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Ontologies 


Keywords 


• 


Biological process 


Apoptosis 
Cytolysis 


Domain 


Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Technical term 


3D-structure 

Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


T cell mediated cytotoxicity 

Inferred from mutant phenotype. Source: MGI 

apoptosis 

Inferred from electronic annotation. Source: UniProtKB-KW 
cytolysis 

Inferred from electronic annotation. Source: UniProtKB-KW 

inHi if^tinn anonto^i^ hv/ nr;5n7\/mp 

Inferred from direct assay. Source: MGI 
proteolysis 

Inferred from electronic annotation. Source: InterPro 




cvtoolasm 

Inferred from direct assay. Source: MGI 


Molecular function 


protein binding 

Inferred from physical interaction. Source: MGI 

serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation... 



Sequence annotation (Features) 



Feature key i Position(s) ! Length j Description ! Graphical view 



Feature identifier 



MoU 


$cule processlni 
Signal peptide 


9 

1 -18 


18 




1 






Propeptide 


19-20 


2 


Activation 
peptide 


1 


PRO_0000027401 


HI 


Chain 


21 - 247 


227 


Granzyme 
B(G.H) 


1 


PRO_0000027402 
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Regions 




Domain 


Sites 


□ 


Active site 




Active site 


□ 


Active site 



21-245 



225 I Peptidase 
SI 



Amino acid modifications 



Secondary structure 
1 

I Helix I Strand | Tum 

Details... 

Sequences 



64 


1 


Charge 

relay 

system 


1 




j By similarity 


108 


1 


Charge 

relay 

system 

j By similarity j 


1 




j 203 

! 

1 

t 


1 


Charge 
relay 

system 


1 




1 By stmiiarity 





Glycosylation 


71 


* 

1 i N-linked 
1 (GlcNAc.) 

} Potential I 
i ^ ' 


1 1 

! 

r 1 
t i 


□ 


Glycosylation 


182 


1 1 N-linked 
1 (GlcNAc.) 

! j Potential j 

i 


>- — - - ■ ; ■■ — 

1 

f 

N 1 




Disulfide bond 


49 65 


■■■■ ! — ■■ ■ 

i By simiiarity 




Disulfide bond 


142 ^ 209 


1 By similarity | 


"ill 


□ 


Disulfide bond 


1 73 188 


~| By similarity 


II 1 



247 



Sequence 



Length Mass (Da) 



Pa4187[UniParc]. FASTA 247 27.470 

Last modified March 20, 1987. Version 1. 
Checksum: 996BCD1 99965C6D6 
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TAAHCEGSII NVTLGAHNIK EQEKTQQVIP MVKCIPHPDY NPKTFSNDIM LLKLKSKAKR 



http://www.uniprot.org/uniprot/P04 1 87 



4/12/2011 



Granzyme B(G,H) precursoj^Mus musculus (Mouse) Page 4 of 7 



I 130 14 0 150 

TRAVRPLNLP RRNVNVKPGD VCYVAGWGRM 

I 

190 200 21£ 

= YNKTNQICAG DPKTKRASFR GDSGGPLVCK 



160 I'VO 180 

APMGKYSNTL QEVELTVQKD RECESYFKNR 

220 230 24£ 

KVAAGIVSYG YKDGSPPRAF TKVSSFLSWI 



I KKTMKSS 

4 

j 

« Hide 
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« Hide 'large scale' references 


T A 1 
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Science 232:858-861(1986) [PubMed: 3518058] [Abstract] 
Cited for: NUCLEOTIDE SEQUENCE fMRNA]. 


[2] 


"Organization of two genes encoding cytotoxic T lymphocyte-specific serine proteases 
CCPI and CCPIL" 
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R.C. 
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Cited for: NUCLEOTIDE SEQUENCE fMRNAl. 


[4] 


The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian 
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Cited for: NUCLEOTIDE SEQUENCE OF 227-247. 
Strain: C57BL/6J. 
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"A family of serine esterases in lytic granules of cytolytic T lymphocytes." 

Masson D.. Tschopp J. 

Cell 49:679-685(1987) [PubMed: 3555842] [Abstract] 
Cited for: PROTEIN SEQUENCE OF 21-40. 
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"Comparative molecular model building of two serine proteinases from cytotoxic T 
lymphocytes." 

Murphy M.E.P.. Moult J., Bleackley R.C. Gershenfeld H.. Weissman LL, James M.N.G, 
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1 Cited for: 3D-STRUCTURE MODELING, 
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C ross -ref e re n ces 


Sequence databases 




@ EMBL 
® DDBJ 


X04072 mRNA. Translation: CAA27715.1. 
! M 12302 mRNA. Translation: AAA37383.1. 
M22526 Genomic DNA. Translation: AAB61 756.1. 
BC002085 mRNA. Translation: AAH02085.1. 
U05707 Genomic DNA. Translation: AAB60470.1. 


IPI 


IPI001 14066 


FIR 


PRMSCL. A94288. 


RefSeq 


i NP_038570.1. NM_01 3542.2. 


UniGene 


j Mm.14874. 


3D structure databases 




® RCSB PDB 
® PDBj 


tntry Metnoo K6soiution onain Kositions KUDSum 

(A) 

2CP1 model - A 21-247 [»] 


ProteinModelPortal 


P04187. 


SMR 


P04187. Positions 21-245. 


Mod Base 


Search... 


Protein-protem interaction databases 


DIP 


DIP-562N. 


STRING 


P04187. 


Protein family/group databases 




MEROPS 


S01.136. 


Proteomic databases 




PRIDE 


P04187. 


Genome annotation databases 




EnsembI 


ENSMUST0000001 6581 ; ENSMUSP0000001 5581 ; 
ENSMUSG0000001 5437. 


GenelD 


14939. 


KEGG 


mmu: 14939. 


UCSC 


uc007ubv.1. mouse. 


Organism-specific databases 




CTD 1 


14939. 


MGI ' 


MGI: 109267. Gzmb. 


Phylogenomic databases 




eggNOG 


roNOG09047. 
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HOGENOM 


j HBG755338. 


HOVERGEN 


1 HBG013304. 


InParanoid 


P04187. 


OMA 


YDSTVEL. 


OrthoDB 


EOG4KWJTW. 


PhylomeDB 


t 

P041 87. 


Enzyme and pathway databases 




BRENDA 


3.4.21.79. 244. 


Gene expression databases 




ArrayExpress 


P04187. 


Bgee 


P041 87. 


CleanEx 


MM^GZMB. 


Genevestigator 


P04187. 


GermOnline 


ENSMUSG00000015437. Mus musculus. 


Family and domain databases 




InterPro 


IPR009003. Pept_cys/ser_Trypsin-like. 

IPRnift114 Ppntirl;:ic;p AS 

IPR001254. Peptidase S1 S6. 
IPR001314. Peptldase„S1A, 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN_HIS. 1 hit. 

[Graphical view] 


ProtoNet 


Search... 


Other Resources 




NextBio 


287263. 


SOURCE 

• 


Search... 


Entry information 




Entry name 


GRAB_MOUSE 


Accession 


Primary (citable) accession number: P04187 


Entry history 


Integrated into March 20, 1987 

UniProtKB/Swiss- 

Prot: 
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Last sequence March 20, 1 987 
update: 

Last modified: March 8, 201 1 

This is version 119 of the entry and version 1 of the sequence. 
[Complete history] 



Entry status 



Reviewed (UniProtKB/Swiss-Prot) 



Annotation program 



Chordata Protein Annotation Program 



Relevant documents 



MGD cross-references 

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot 

PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 
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P51124(GRAM__HUMAN)lfc^ Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5. 201 1. Version 96. 



Names and origin 


I lUiwIII IICIIIIwO 


Granzyme M 

EC=3.4.21.- 
Aitemative name(s): 
Met-1 serine protease 

i 

j Short name=Hu-Met-1 

1 Met-ase 

1 Natural killer cell granular protease 


Gene names 


Name: 6ZMM 
I Synonyms: MET1 


Organism 


1 Homo sapiens (Human) [Complete proteomel 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


257 AA. 


Sequence status 


Complete. 


Sequence processing | The displayed sequence is further processed into a mature 

1 form. 


H — 

Protein existence 1 Evidence at protein level. 


General annotation (Comments) 


Function | 

! 
i 


Cleaves peptide substrates after methionine, leucine, and 
norleucine. Physiological substrates include EZR, alpha- 
tubulins and the apoptosis inhibitor BIRC5/Survivin. Promotes 
caspase activation and subsequent apoptosis of target cells. 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Granules of large 
granular lymphocytes. 


Tissue specificity 


Highly and constitutiveiy expressed in activated natural killer 
(NK) cells. 


Sequence similarities 

f 


Belongs to the peptidase S1 family. Granzyme subfamily. 
Contains 1 peptidase S1 domain. 



Ontologies 



Keywords 
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Biotogical process 


, Apoptosis 
Cytolysis 
Immunity 
Innate immunity 


Cellular component 


Secreted 


Coding sequence diversity 


Polymorphism 


Domain 


Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Technical term 


3D-structure 
Complete proteome 


Gene Ontology (GO) 




Biological process 

• 

• 


apoptosis 

Inferred from electronic annotation. Source: UniProtKB-KW 
cytolysis 

liltdiCU 1 1 Ui 1 1 dC^i^LI L;i IIO dl II iL/icf LiDI i. OwUiOc?. wi 1 ir i wirxD'iX V v 

Innate immune response 

Inferred from electronic annotation. Source: UniProtKB-KW 
proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation.,. 



Sequence annotation (Features) 



1 

Feature key ' Position(s) Length Description 



Graphical view 



j Feature ic^er 



Molecule processing 
I Signal peptide 



1-23 



23 i I Potential 



1 < 1 »■>» ■! II ■ 


Propeptide 


24-25 


2 


i Activation peptide 

j 1 Potential I 


t 
t 


1 


- — — - i 




Chain 


26 - 257 
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Sequence 

P51124 [UniParc]. 

Last modified May 18, 2010. Version 2. 
Checksum: B4E815CE455F7371 



Length Mass (Da) 

FASTA 257 27.546 



1^ 20 30 40 50 60 

! MEACVSSLLV LALGALSVGS SFGTQIIGGR EVIPHSRPYM ASLQRNGSHL CGGVLVHPKW 
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Di^>iay Settings: GenPspl 

metase [Homo sapiens] 

GenBank: AAA595d2.1 

PASTA Qaaiiiss. 
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VERSION 

DBSOURCE 

KEYWORDS 

SODRCE 

ORGANISM 



REFERENCE 
ADTHORS 
TITLE 

JOURNAX. 
PUBMED 
COMMENT 

FEATURES 

source 



Protein 
siq peptide 
mat j pep tide 

Region 



Site 



Site 



Site 



CDS 



AAA59582 257 aa linear PRI 07- JAN- 1995 

metase [Homo sapiens] . 

AAA59582 

AAA59582.1 GI:508988 

locus HUMMETIA accession L23134 . 1 

Homo sapiens (humeui) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 257} 

Smyth, M. J., SayerSfT.J., wiltrout,T., Powers, J. C. and Trapani.J.A. 
Met-ase: cloning and distinct chromosomal location of a serine 
protease preferentially expressed in humfm natural killer cells 
J. Immunol. 151 (11), 6195-6205 (1993) 
8245461 

On Jul 13, 1994 this sequence version replaced gi:438641. 
Method: conceptual translation. 

Location/Qualifiers 

1. .257 

/organisms* "Homo sapiens" 
/db_xre f t axon : 9606 " 

/cell_line-*' Human Lopez natural killer cell leukemia" 
1. .257 

/products "metase" 
1. .25 

/gene= "MET- 1 " 
26. .257 
/gene="MET-l" 
/product- "me tase " 

26 . . 252 

/ region_name = " Tryp_SPc " 

/note- "Trypsin- like serine protease; Many of these are 

synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db_xre f = " CDD : 29152 " 
26 

/si te_type = " cl eavage " 
/db_xre f =" CDD ; 29152 • 
order(66,lll,207) 
/site_type=" active" 
/db xref="CDDi 29152 " 
order (201, 224, 226) 
/ s i t e__t ype = " o t he r " 
/notes "substrate binding sites" 
/db xref°"CDD; 29152 " 
1. .257 

/gene- "MET- 1" 

/coded_by=" join (L2 3 134 .1: 1. . 75, L23134 . 1: 76 . .771, 
L23134 .1:785. .787) " 



ORIGIN 



// 



1 meacvssllv lalgalsvgs sfgtqiiggr eviphsrpym aslqmgshl cggvlvhpkw 

61 vltaahclaq rmaqlrlvlg Ihtldspglt fhikaaiqhp rykpvpalen dlallqldgk 
121 vlcpsrtirpl alpskrqwa agtrcsmagw glthqggrls rvlreldlqv Idtrmcnnsr 
181 fwngslspsm vclaadskdq apckgdsggp Ivcgkgrvla gvlsfssrvc tdif)qppvat 
241 avapyvswir kvtgrsa 
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portion of prepropeptide 
AA?^57262 

AAA57262.1 GI:602469 

locus HUMPOP accession L36922 



18 aa 

[Homo sapiens] . 



linear PRI 15 -DEC- 1994 



CDS 



ORIGIN 



Homo sapiens (human) 

Homo Bapiens 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Horainidae; Homo. 

1 (residues 1 to 18} 

Smyth, M. J., Sayer s , T . J . , Wiltrout.T,, Powers, J. C. and Trapani,J.A. 
Met-ase: cloning and distinct chromosomal location of a serine 
protease preferentially expressed in human nattiral killer cells 
J. Immunol. 151 (11), 6195-6205 (1993) 

8245461 

2 (residues 1 to 18) 

Pilat,D., Fink,T., Obermaier-Skrobanek, B. , Zimmer,M., Wekerle,H., 
Lichter,P. and Jenne.D.E. 

The human Met-ase gene (GZMM) : structure, sequence, and close 
physical linkage to the serine protease gene cluster on 19pl3.3 
Genomics 24 (3), 445-450 (1994) 

7713495 

Method: conceptual translation. 
Location/Qualif iers 
1. . 16 

/organisms "Homo sapiens" 
/db__xre f » " taxon : 9606 " 
1. .18 

/name>= "portion of prepropeptide" 
1 . . 18 

/coded_by»"L36 922. 1:845. .899" 



1 meacvssllv lalgalsv 
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Q03238 (GRAM_RAT)* Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 83. 



Names and origin 


Protein names 


! 

Recommended name: 

Granzyme M 
EC=3.4.21- 
Altemative name(s): 

Met-ase 

Natural killer cell granular protease 
RNK-Met-1 


Gene names 


H— ■ ' - • ^ 

NameiGzmm 


(jryanism 


rvaiius norvegK^us \r\ai/ 

— ■—»«-— — - — " ' ■■ 


■ ■ ' ■ ■"" 

Taxonomic identifier 


10116 [NCBI] 


Taxonomic lineage 

Protein attributes 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murlnae > 
Rattus 


Sequence length 


258 AA. 


Sequence status 


Fragment. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 

General annotation (Comme 


Evidence at protein level. 

nts) 


Function 


Cleaves peptide substrates after methionine, leucine, and 
norleucine. Physiological substrates include EZR, alpha- 
tubulins and the apoptosis inhibitor BIRCS/Survivin. Promotes 
caspase activation and subsequent apoptosis of target cells 

1 By similarity 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Granules of large 
Granular Ivmohocvtes 


Sequence similarities 

1 

Ontologies 


Belongs to the peptidase SI family. Granzyme subfamily. 
Contains 1 peptidase SI domain. 


Keywords 

Biological process 

1 


Apoptosis 
Cytolysis 
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Molecular function 
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Protease 
Serine protease 


PTM 


Disulfide bond 
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Zymogen 


1 ecnnicai lerm 


Direct protein sequencing 






Gene Ontoloav (GO) 








Biological process 


apoptosis 

Inferred from electronic annotation. 


Source: 


UniProtKB-KW 




cytolysis 

Inferred from electronic annotation. 


Source: 


UniProtKB-KW 




Innate Immune response 

Inferred from electronic annotation. 


Source: 


UniProtKB-KW 




proteolysis 

Inferred from electronic annotation. 


Source: 


InterPro 


Cellular component 

■ 


extracellular region 

Inferred from electronic annotation. 


Source: 


UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 
Inferred from electronic annotation. 


Source: InterPro 


Complete GO annotation... 








Sequence annotation (Featui 


res) 
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Molecule processing 
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Amino acid modifications 



Experimental info 
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N-linked 
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1 Potential t 
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Glycosylation 

; 

; 
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N-linked 
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1 Potential I 
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1 

Disulfide bond 
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1 By similarity | j | | 




■ 

Disulfide bond 


142 4-^210 
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■-1 
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Disulfide bond 
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1 By similarity 1 | 
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Disulfide bond 


200 ^ 226 




j By similarity 1 
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Sequences 



Sequence 



Length IVIass (Da) 



Q03238 [UniParc]. 

Last modified June 1, 1994. Version 1 
Checksum: B89DC10EF54DF495 



FASTA 



258 



28.339 



10 20 3^ 40 SO 60 

liLLLIALKTL WAVGNRFEAQ IIGGRKAVPH SRPYMVSLQN TKSHMCGGVL VHQKWVLTAA 

70 80 90 100 110 12£ 
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190 200 210 220 230 240 

GVIiTDSMLCL KAGAKGQAPC KGDSGGPLVC GKGKVDGXLS FSSKNCTDIF KPTVATAVAP 

250 
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008643 (O08643_MOUSE) * Unreviewed, UniProtKBATrEMBL 

Last modified March 8. 201 1. Version 80. 



Names and origin 


Protein names 


{ Submitted name: 

1 Gran7ymp M EMBL BAA78336.1 

Submitted name: 

GranyymA M (1 ymphnryte mftt-aRft 1)1 EMBL AAI392 18.1 j 

Submitted natne' 
Granzyme M (Lymphocyte met-ase 1), isoform CRA_b 

1 EMBL EDL31677.1 | 

Submitted name: 

Granzyme prerurserl EMBL AAB5 1606.1 j 


Gene names 


Name- G7mm 1 MGI 99549 | 

Synonyms: LMet-1 i emblaab5i606.i | mmft-1 

1 EMBL BAA78336.1 
ORF mnn 1 EMBLEDL31677.1 I 

Names: 


Oraanlsm 


Mus mu<srtilij« /n/iou^p) 1 EMBL AAB51606.1 


1 axonomic lueniiiier 




Taxonomic lineage 

: 

1 

Protein attributes 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Euthena > Euarchontoghres > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 


Sequence length 


264 AA. 


Sequence status 


Complete. 


Sequence processing { 


The displayed sequence is further processed into a mature 
form. 


Protein existence } Evidence at transcript level. 
General annotation (Comments) 


Sequence similarities | 

Ontologies 


RftlongR tn the ppptiHas*^ Pii family 1 RuleBase RU000360V4 


Keywords 

Domain 

Molecular function 
Gene Ontology (GO) 


5^ignall EMBL AAB5 1606.1 

Hydrolase 
Protease 

55#»rin*a proteaR^a 1 RuleBase RU004260V0 I 
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Biotogical process 



Molecular function 



cytolysis 

Inferred from direct assay. Source: MGI 
proteolysis 

Inferred from electronic annotation. Source: InterPro 



protein binding 

Inferred from physical interaction.. Source: MGI 

serine-type endopeptidase activity 
Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Position 
(s) 



Length Description 



Graphical view 



Feature identifier 



Molecule processing 



Signal peptide 



Chain 



1-26 



26 



27-264 



238 



I Potential | 



I 



EMBL AAB51606.1l 



I Potential 



4 



PRO 5000142980 



EMBL AAB51606.1! 



Sequences 



Sequence 



008643 [UniParc]. 

Last modified July 1, 1997. Version 1 
Checksum: 496E644D2630CD14 



FASTA 



Length 

264 



Mass (Da) 

29,122 



10 20 30 40 50 60 

MEVCWSLLLL LALKTLWAAG NRFETQIIGG REAVPHSRPY M^^LQKAKSH VC6GVLVHRK 

70 80 90 100 110 120 

WVLTAAHCLS EPLQNLKIiVL GLHNLHDIiQD PGLTFYIREA IKHPGYNHKY ENDIiALLKLD 

13^ 140 150 160. 170 180 

RRVQPSKNVK PLALPRKPRS KPAEGTWCST AGWGMTHQGG PRARALQELD LRVLDTQMCN 

190 200 210 220 230 240 

NSRFWNGVLI DSMLCLKAGS KSQAPCKGDS GGPLVCGKGQ VDGILSFSSK TCTDIFKPPV 

250 260 

ATAVAPYSSW IRKVIGRWSP QSLV 
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"Cloning and expression of tlie recombinant mouse natural Iciller cell granzyme IVIet 
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Kelly J.M., O'Connor M.D.. Hulett M.D., Thia K.YT.. Smyth M.J. 
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Immunogenetics 44:340-350(1996) [PubMed: 8781119] [Abstract] 
Cited for : NUC LEOTIDE SEQUE NCE. 
Strain : BALB/c I embl AABsisogT] 
Tissue : Spleen i emblaab51606.i | 



[2] 



[3] 



"A connparison of whole-genome shotgun-derived mouse chromosome 16 and the 
human genome." 

Mural R.J., Adams M.D., Myers E.W., Smith H.O., Miklos G.L, Wides R., Halpern A.. Li P.W., 

Sutton G.G., Nadeau J., Salzberg S.L, Holt R.A., Kodira CD.. Lu F., Chen L. Deng Z.. 

Evangelista C.C.. Gan W. I±±l Stephenson L.D. 

Science 296:1661-1671(2002) [PubMed: 12040188] [Abstract] 

Cited for : NU CLEOTIDE SEQU ENCE. 

Strain: Mixed I embl edl31677.i j 



Mural RJ., Adams M.D., Myers E.W., Smith H.O., Venter J.C. 
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases 
; Cited for : NU CLEOTIDE SEQU ENCE. 
' Strain: Mixed 1 embl EPLaisTTTI 



[4] 



"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian 
Gene Collection (MGC)." 

The MGC Project Team 

Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] 
Cited for : NU CLEOTIDE SEQU ENCE [LARGE SCALE MRNA]. 

Tissue: Brain I EMBLAAi392i8n 



"High expression of alternative transcript of granzyme M in the mouse retina. 

Taniguchi M., Tani N., Suemoto T,, Ishimoto I., Shiosaka S., Yoshida S. 
Neurosci. Res. 34:115-123(1999) [PubMed: 10498337] [Abstract] 
Cited for: NUCLEOTIDE SEQUENCE. 



+ Additional computationally mapped references. 



Cross-references 



Sequence databases 



@EMBL 
O GenBank 
©DDBJ 



L76741 mRNA. Translation: AAB51 606.1. 
BC139217 mRNA. Translation: AAI39218.1, 
BC139218 mRNA. Translation: AAI39219.1, 
AB015728 mRNA. Translation: BAA78336.1. 
CH466553 Genomic DNA. Translation: EDL31677.1 



IPI 


IPI001 14906. 


RefSeq 


NP_032530.1. NM_008504.2. 


UnlGene 


Mm,378960. 


3D structure databases 




HSSP 


HSSP built from PDB template 1FI8 based on UniProtKB 
PI 8291. 


ProteinModelPortal 


008643. 


Mod Base 


Search... 


Protein-protein interaction databases 


STRING 


008643. 
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Protein family/group databases 






MEROPS 


S01.139. 




Proteomic databases 






PRIDE 


008643. 




Genome annotation databases 






EnsembI 


ENSMUST00000020549; ENSMUSP00000020549; 
' ENSMUSG00000054206. 




GenelO 


16904. 


KEGG 


mmu; 16904. 


UCSC 


uc007fzj.1, mouse. 




Oraanisin-sDecific databases 






CTD 


16904. 




MGI 


MGI:99549. Gzmm. 




Phylogenomic databases 






eggNOG 


maNOG 18874. 




HOGENOM 


HBG755338. 


HOVERGEN 


HBG013304. 


InParanoid 


008643. 


OMA 


KDQAPCK. 


PhylomeDB 


008643. 




Gene expression databases 






ArrayExpress 


008643. 




Bgee 


008643. 




Genevestigator 


008643. 




Family and domain databases 






InterPro 

III 1 1 


IPR009003. Pept cys/ser Trypsin-like. 
IPR018114. Peptidase S1/S6 AS, 
IPR001254, Peptidase S1 S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 




Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 




PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 

[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Gys. 1 hit. 


PROSITE 

f 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN^HIS. 1 hit. 
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ProtoNet 

Other Resources 

NextBio 


1 PS00135.TRYPSIN„SER. 1 hit. 
1 [Graphical view] 


1 Search... 
290930. 


SOURCE 

Entry information 


Search... 


Entry name 


O08643„MOUSE 


Accession 


Primary (citable) accession number: 008643 


■ 

Entry history 

! 
i 


Integrated into July 1 , 1 997 

UniProtKB/TrEMBL: 

Last sequence July 1, 1997 

update: 

Last modified: March 8, 201 1 

This is version 80 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Unreviewed (UniProtKBiTrEMBL) 
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P32197 (ELAS_GADMO) # Reviewed, UniProtKB/Swiss-Prot 

Last modified March 2. 2010. Version 44. 



Names and origin 


Protein names 


Recommended name: 
Elastase 

EC=3.4.21.- 


Vi./! ydl iloi f 1 


^^sarliio lYmrhiisi /A^tisntir* 
UdUUo illi/rilUa OviU^ 


1 axonomic loentiiier 




1 axonomic lineage 


■ 

cUKaryoia > ivietazoa > onoraaia > oraniaia > verceDraia > 
Euteleostomi > Actlnopterygii > Neopterygii > Teleostei > 
Euteleostei > Neoteleostei > Acanthomorpha > 
Paracanthopterygii > Gadiformes > Gadidae > Gadus 


Protein attributes 




Sequence length 


20 AA. 


Sequence status 


Fragment. 


— — — 

Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Function 



I Digests most rapidly at the C-terminal side of alanine residues, 
I but also cleaves at valine and leucine residues. 



Sequence similarities 



Belongs to the peptidase S1 family. Elastase subfamily. 
Contains 1 peptidase S1 domain. 



Ontologies 



Keywords 




Molecular function 


Hydrolase 
Protease 
Serine protease 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Molecular function 


serine-type peptidase activity 

Inferred from electronic annotation. Source: UniProtKB-KW 


Complete GO annotation... 
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Sequence annotation (Features) 



Feature key 



• 

Position 


Length 


Description 


Graphical view 


(s) 









Feature identifier 



Molecule processing 

Chain 

Regions 

Domain 



1 ->20 



>20 { Elastase 



PRO 000008867 



1 ->20 



>20 Peptidase 
S1 



Experimental Info 

LJ I Non-terminal residue i 



20 



1 



Sequences 



Sequence 



Length Mass (Da) 



P32197 [UniParc]. 

Last modified October 1, 1993. Version 1. 
Checksum: 51D4B08262AC84BC 



PASTA 



20 



2,284 



? 10 20 

1 WGGEVARAH SWPWQISLQY 



« Hide 



References 



[1] 



I M 



Properties of elastase from Atlantic cod, a cold-adapted proteinase. 

Asgeirsson B., Bjarnason J.B. 

Biochim. Biophys. Acta 1164:91-100(1993) [PubMed: 8518301] [Abstract] 
Cited for : PROTEIN SEQUENCE. 
Tissue: Intestine. 
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ModBase 



833787. 
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Protein family/group databases 

MEROPS 

Family and domain databases 



j S01.153. 



PROSITE 


PS50240. TRYPSIN 
PS00134. TRYPSIN 
PS00135. TRYPSIN, 
[Graphical view] 


.DOM. Partial match. 
^HIS. Partial match. 
'SER. Partial match. 


ProtoNet 


Search... 




Entry information 






Entry name 


EU\S_GADMO 



Accession 



Entry history 



Entry status 



Annotation program 



Relevant documents 



Primary (citable) accession number: P32197 



Integrated into October 1 , 1 993 

UniProtKB/Swiss- 

Prot: 

Last sequence October 1 , 1 993 
update: 

Last modified: March 2, 2010 

This is version 44 of the entry and version 1 of the sequence. 
[Complete history] 



Reviewed (UniProtKB/Swiss-Prot) 
Chordata Protein Annotation Program 



Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 
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P23946 (CMA1_HUMAN)* Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 118. 



r 



Names and origin 


Protein names 


Recommended name: 
Chymase 
EC=3.4.21.39 

Alternative name(s): 
Alpha-chymase 
Mast cell protease 1 


Gene names 


Name: CMA1 
SynonymsiCYH, CYM 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > napiorrnini > Lraiarrnini > nominiaae > nomo 


Protein attributes 




Sequence length 


247 AA. 


Sequence status 


Complete. 


Sequence processing 


1 iiiiiiii II II II 

The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Major secreted protease of mast cells with suspected roles in 
vasoactive peptide generation, extracellular matrix degradation, 
and regulation of gland secretion. 


Catalytic activity 


Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 
-l-Xaa. 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Mast cell granules. 


Tissue specificity 


Mast cells in lung, heart, skin and placenta. Expressed in both 
normal skin and in urticaria pigmentosa lesions. L^§LlJ 


Sequence similarities 


Belongs to the peptidase 81 family. Granzyme subfamily. 
Contains 1 peptidase S1 domain. 


Ontologies 




Keywords 




Cellular component 


Secreted 


Coding sequence diversity 


Polymorphism 
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Domain 


Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 
Zymogen 


Technical temi 


3D-structure 
Complete proteome 
Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


interleukin-1 beta biosynthetic process 
Inferred from direct assay. Source: BHF-UCL 

proteolysis 

Inferred from electronic annotation. Source: InterPro 

regulation of inflammatory response 
Inferred by curator. Source: BHF-UCL 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UnlProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Non-traceable author statement 1 I Source: UniProtKB 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Position(s) j Length j Description 



Graphical view 



I Feati|i 



re 



Molecule processing 


















Signal peptide 


1 


-19 


19 




1 






m 


Propeptide 


20 


-21 


2 


Activation peptide 


1 


PRO. 


-01 




Chain 


22- 


247 


226 


Chymase 


1 


PRO. 




Regions 


















Domain 


22- 


245 


224 


Peptidase SI 


1 1 







Sites 



□ 

□ 



Active site 



Active site 



Active site 



66 



110 
203 



1 
1 



Charge relay system 



Charge relay system 



Charge relay system 



r 
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Amino acid modifications 

□ Glycosylation 



□ 



□ 



Glycosylation 



Disulfide bond 



Disulfide bond 



80 



103 



51 *-*67 



144 ^ 209 



1 



1 



N-linked (Glc NAc.) 

I Ref.9 1 j Ref.1 | 



N-linke d (GlcNAc.) 

I Ref.9 I 



II 



I I 



□ 



Disulfide bond 



i 175 ^ 188 



II 



Natural variations 



□ 


Natural variant 


46 


1 


G ^ R. 

[dbSNP:rs5246] 


1 

1. .. 




Natural variant 


66 


1 

..... — .. — 1 


H ^ R. 

[dbSNP:rs5247] 
1 Ref.6 1 

[ ' 




□ 


Natural variant 


98 


1 


R-^ H. 
I [dbSNP:rs1 3306252] 




Experimental info 










□ 


Sequence conflict 


28 


1 


C-*Sin 

AAB26828. 1 j 


1 




Sequence conflict 


131 -132 


2 


FP-^AVAA 
sequence 1 ^^^-^ 1 





VARIO 



VAR 1 0- 



VAR I o: 



i I 



Secondary structured 

1 , 

I Helix I Strand | Turn 

Details... 

Sequences 



A m 



Sequence 



Length Mass (Da) 



P23946 [UniParc]. 

Last modified March 1, 1992. Version 1 
Checksum: DC1464A049ED6B00 



FASTA 



247 



27,325 



10 20 30 40 50 60 

MLLLPLPLLL FLLCSRAEAG EIIGGTECKP HSRPYMAYLE IVTSNGPSKF CGGFLIRRNF 

70 80 90 100 110 1 20 

VLTAAHCAGR SITVTLGAHN ITEEEDTWQK LEVIKQFRHP KYNTSTLHHD IMLLKLKEKA 

130 140 150 160 170 180 

SLTIiAVGTLP .FPSQFNFVPP GRMCRVAGWG RTGVLKPGSD TLQEVKLRLM DPQACSHFRD 

ISO 200 210 220 230 240 

FDHNLQIiCVG NPRKTKSAFK GDSGGPLLCA GVAQGIVSYG RSDAKPPAVF TRISHYRPWI 
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PIR 
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3D structure databases 
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PDBsum 


® RCSB PDB 


(A) 












® PDBj 


1KLT X-ray 


1.90 


A 


22-247 




[»] 


1 NN6 X-ray 


1.75 


A 


20-247 




[»] 




1PJP X-ray 


2.20 


A 


22-247 




[»] 




1T31 X-ray 


1.90 


A 


22-247 




► • 

» 






2HVX X-ray 


2.60 


A 


22-247 




» 






3N70 X-ray 


1.80 


A 


22-247 




» 

• < 




ProteinModelPortal 


P23946. 


SMR 


P23946. Positions 24-247. 












Mod Base 


Search... 









http://www.uniprot.org/uniprot/P23946 
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Protein-protein interaction databases 



STRING 


P23946. 


Protein family/group databases 




MEROPS 


801,140. 


PTM databases 




PhosphoSite 


P23946. 


Proteomic databases 




PRIDE 


P23946. 


Genome annotation databases 




EnsembI 


ENST00000206446; ENSP0000Q206446; ENSG00000092009. 
ENST00000250378; ENSP00000250378; ENSG00000092009. 


GenelD 


.1215. 


KEGG 


hsa:1215. 


UCSC 


ucOOIwpp.1. human. 


v^iyaniSiTi'spedTic uaiaDases 




CTD 


1215. 


GeneCards 


GC14M005089. 


H-lnvDB 


HIX0202104. 




noNo.iiUy/. LflvlAl. 


HPA 


CAB000363. 


MIM 


1 1 8938. gene. 


neXtProt 


NX_P23946. 


PharmGKB 


PA26623. 


GenAtlas 


Search... 


r iiy iviydiuiiiiv uciiciucidco 




eggNOG 


prNOG07867. 


GeneTree 


ENSGT00580000081281 . 


^v/OdN^«/lVl 


riDO / OOOOO. 


HOVERGEN 


HBG013304. 


InParanoid 


P23946. 


OMA 

! 


HHDIMLL 

1 : 


OrthoDB 


EOG4R23VK. 


PhylomeDB 


P23946. 


Enzyme and pathway databases 




BRENDA 


3.4.21.39. 247. 



http://www.uniprot.org/uniprot/P23946 
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Gene expression databases 

ArrayExpress 



P23946, 



Bgee 



! P23946, 



CleanEx 
Genevestigator 



t 



GermOnline 



Family and domain databases 



HS_CMA1. 
P23946. 

ENSG00000092009. Homo sapiens. 



InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase S1 S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PFOOOoS. Trypsin. 1 nit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSr50494. Pept_Ser_Cys. 1 nit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 

rGraDhical viewl 


ProtoNet 

Other Resources 

BIndingDB 


Search... 
P23946. 


NextBio 


5003. 


PMAP-CutDB 


P23946. 


SOURCE 


Search... 


Entry information 




Entry name 


CMA1_HUMAN 


Accession 


Primary (citable) accession number: P23946 
Secondary accession number(s): B5BUM8 S Q9UDH5 


Entry history 


Integrated into March 1. 1992 



Entry status 
Annotation program 
Disclaimer 



-J 



UniProtKB/Swiss- 
Prot: 

Last sequence March 1 . 1 992 
update: 

Last modified: April 5» 201 1 

This is version 118 of the entry and version 1 of the sequence. 
[Complete history] 

Reviewed (UniProtKB/Swiss-Prot) 

Chordata Protein Annotation Program 

Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 



http://v^ww.uniprot.org/uniprot/P23946 
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only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 



Relevant documents 



Human chromosome 14 

Human chromosome 14: entries, gene names and cross-references to MIM 

Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 

Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 

MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 
PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 
Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



© 2002-201 1 UniProt Consortium | License & Disclaimer | Contact 




http://www.uniprot.org/uniprot/P23946 
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Translations of Life 



Display Settings: GenPept 

chymase [Homo sapiens] 

GenBanlc AAB26828.1 



Goto: 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



linear PR! 19-OCT-1993 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
PUBMED 
REMARK 

COMMENT 
FEATURES 

source 



Protein 



ORIGIN 



AAB26828 226 aa 

chymase [Homo sapiens] . 
AAB26828 

AAB26828.1 GI:409011 
accession S61334 . 1 

• « 

Homo sapiens (human) 

Homo sapiens 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Buteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhlni; 
Catarrhini; Hominidae; Homo. 
1 (residues l to 226) 

Sukenaga,Y., Kido.H., Ne)ci,A., Enomoto,M., Ishida^K., Takagi,K. and 
Katunuma, N. 

E>uri£ication and molecular cloning of chymase from human tonsils 
FEES Lett. 323 (1-2), 119-122 (1993) 
8495723 

GenBan)c staff at the National Library of Medicine created this 
entry [NCBI gibbsq 132634] from the original journal article. 
Method: conceptual translation supplied by author. 
Loca t ion/Qual i f ier s 
1. . 226 

/organism="Homo sapiens" 
/db__xre f " " taxon : 9606 " 

l.,226 

/products " chymase " 
Region 1 . . 222 

/ re g i on_name = " Tryp_S Pc " 

/note="Trypsin-lilce serine protease; Many of these are 

synthesized as inactive precursor zymogens that are 

cleaved during limited proteolysis to generate their 

active forms. Alignment contains also inactive enzymes 

that have substitutions of the catalytic...; cd00190" 

/db xref»"CDD: 2 915 2 " 
Site 1 

/site_tyi)e=" cleavage" 

/db_xre f - " CDD : 29152 " 
Site order (45,89, 18 2) 

/site_type= "active " 

/ db_xre f =" CDD : 29152 " 
Site order(176, 197, 199) 

/site_type= "other" 

/note»" substrate binding sites" 

/ db_xre f «" CDD : 29152 • 
CDS 1..226 

/gene = "chymase" 

/coded_by="S6l334 . 1:1. .681" 

/notes "Sequence start is N-terminal of purified protein" 

1 iiggteskph srpymaylei vtsngpskfc ggflirrnfv Itaahcagrs itvtlgahni 
61 teeedtwqkl evikqfrhpk yntstlhhdi mllklkekas Itlavgtlpf psqfnfvppg 
121 rmcrvagwgr tgvllcpgsdt Iqevklrlmd pqacshfrdf dhnlqlcvgn prktksafkg 
181 dsggpllcag vaqgivsygr sdalqipavft rishyrpwin qilqan 



http://www.ncbi.nlm.nih.gov/protein/AAB26828 
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Protein 

Translations of Life 



Display Settings: GenPept 



mast cell chymase [Homo sapiens] 



GenBank: AAA52020.1 
FASTA Graphics 



Goto. 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOORCB 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



Protein 



sig peptide 



mat peptide 



Region 



Site 



Site 



Site 



CDS 



AAA52020 247 aa linear PRI Ol-NOV-1994 

mast cell chymase (Homo sapiens] . 

AAA52020 

AAA52020.1 GI: 180542 

locus HUMCHYMASE accession M64269. 1 

Homo sapiens (human) 

Homo sapiens 

BuJcaryota; Metazoa; Chordata; Craniata; Vertebrata; Buteleostomi ; 
Mammalia; Eutheria; Buarchontoglires; Primates; Haplorrhini; 

Catarrhini,- Hominidae; Homo. 
1 (residues 1 to 247) 

Caughey,G.H. , Zerweck^E.H. and Vanderslice, P. 

Structure, chromosomal assignment, and deduced amino acid sequence 

of a human gene for mast cell chymase 

J. Biol. Chem. 266 (20), 12956-12963 (1991) 

20715S2 

Method: conceptual translation. 
Location/Qualifiers 

1. .247 

/organisms "Homo sapiens" 
/db_xre £ = " taxon : 9606 " 
/maps " Unas s i gned " 
/eel l_type= "mast cell" 
/ ti 8 sue_type« "placenta" 
1. .247 

/products "mast cell chymase" 
1. .21 

/gene="CMAl" 
/note="G00-127-603" 
22. .247 

/gene="CMAl" 

/products "mast cell chymase" 
/note-"GOO- 127-603" 
22 . . 243 

/ r e g i on_name = " T ryp_S P c " 

/note="Trypsin-like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cdO0l90" 
/db_xre f = "CDD : 29152 " 
22 

/site_type=" cleavage" 
/dbxr e f =" CDD : 29152 " 
order (66, 110, 203) 
/8ite_type= "active" 
/db_xref-*CDD; 291S2 " 
order (197,218, 220) 
/ site_typeo" other" 
/notes "substrate binding sites" 
/db_xre f» "CDD : 29152 » 
1 . . 247 

/genes^CMAl" 

/coded_by»" join (M64269 . 1 : 5156 . . 5215 , M64269 . 1 : 5886 . . 6036 , 
M64269 . 1 : 678 6 . . 692 1 , M64269 . 1 : 7108 . .7362, 
M64269. 1;7733. .7876) " 
/db xref»"GDB:GD0-127-603" 



ORIGIN 



// 



1 mlllplplll fllcsraeag eiiggteckp hsrpymayle ivtsngpskf cggflirrnf 

61 vltaahcagr sitvtlgahn iteeedtwqk levikqfrhp kyntstlhhd imllklkeka 

121 sltlavgtlp fpsqfnfvpp grmcrvagwg rtgvlkpgsd tlqevklrlm dpqacshfrd 

181 fdhnlqlcvg nprktksafk gdsggpllca gvaqgivsyg rsdakppavf trishyrpwi 
241 nqilqan 



http://www.ncbi.nlm.nih.gov/protein/AAA52020 
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Display Settings: G«nPept 

chymase [Homo sapiens] 

GenBanlc AAAS2021.1 

FASTA Gfaohtes 



CSoto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
PUBMED 

COMMENT 

FEATURES 

source 



Protein 



Region 



Sit:e 



Site 



Site 



CDS 



ORIGIN 



AAA52021 247 aa linear PRI Ol-NOV-1994 

chymase {Homo sapiens] . 

AAA52021 

AAA52021.1 01:180544 

locus HUMCHYMB accession M69137. 1 

Homo sapiens (human) 

Homo sapiens 

Eiikaryota; Metazoa; Chordata; Craniata; Vertebrata; Buteleostomi; 
Mammalia; Eutheria; Buarchontoglires; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 
1 (residues 1 to 247) 

Urata.H., Kinoshita,A . , Perez, D.M., Misono<K.S., Biinipus , F . M . < 
Graham, R.M. and Husain^A. 

Cloning of the gene and cDNA for human heart chymase 
J. Biol. Chem. 266 (26), 17173-17179 (1991) 
1894611 

Method: conceptual translation. 
Location/Qualifiers 
1. .247 

/organism="Homo sapiens" 
/db xref g" taxon; 9606 " 
/maps "Unassigned" 
/tis8ue_types "heart # 117" 
1. . 247 

/product =" chymase " 
/ EC_number - " 3 .4 .21.39 * 
22. .243 

/region_name= ''Tryp_SPc" 

/note» "Trypsin- li)ce serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db_xref""CDD : 29152 " 
22 ~ 

/ s i t e_type = " cl eavage " 
/db xref="CDD; 29152 " 
order (66, 110, 203) 
/ s i te_ t ype = " ac t i ve " 
/db_xre f =" CDD : 29152 " 
order (197, 218, 220) 
/si t e_t ype = " other " 
/notes"8ubstrate binding sites" 
/db_xref= " CDD : 29152 " 
1. . 247 

/genes "CMAl" 

/8tandard_name> "angiotensin I convertase" 
/coded_by=" join(M69137 . 1 : 259 . . 3 16 , M69137 . 1 : 989 . .1139, 
M69137. 1:1886. . 2021, M69137 . 1 : 2208 . .2462, 
M69137. 1:2835. .2978) " 
/db_xref»"GDB:G0O-127-6O3" 



// 



1 mlllplplll fllcsraeag eiiggteckp hsrpymayle ivtsngpskf cggflirmf 
61 vltaahcagr sitvtlgahn iteeedtwgk levikqfrhp kyntstlhhd imllklkeka 
121 sltlavgtlp fpsqfnfvpp grmcrvagwg rtgvlkpgsd tlqevklrlm dpqacshfrd 
181 fdhnlqlcvg nprktksafk gdsggpllca gvaqgivsyg rsdakppavf trishyrpwi 
241 nqilqan 



http://www.ncbi.nlm.nih.gov/protein/AAA52021 
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G«nBanlc AAA52019.1 
FAgJ^ Graphics 



Goto: 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCB 
KEYWORDS 
SOORCB 

ORGANISM 



REFERENCE 
AITTHORS 

TITLE 
JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



ORIGIN 



AAA52019 247 aa linear PRI 01-NOV>1994 

chymase [Hotno sapiens] . 
AAAS2019 

AAA52019.1 GI; 180540 
locus HUMCHYMA accession M69136 . 1 

* 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

Catarrhinl; Hominidae; Homo, 
1 (residues 1 to 247) 

Urata,H., Kinoshita, A. , Perez, D.M., MiBono,K.S., Bumpus,P.M., 
Graham, R.H. and Hu8ain,A. 

Cloning of the gene and cDNA for human heart chymase 
J. Biol. Chem. 266 (26) « 17173-17179 (1991) 
1894611 

Method: conceptual translation. 
Location/Qualifiers 
1. .247 

/organisms "Homo sapiens" 

/db_xre f = " taxon ; 9606 " 

/maps " Unas s i gned ** 

/tiBSue_type="heart « 117" 
Protein 1..247 

/product" "chymase " 

/EC^nuraber = " 3.4 .21.39 " 
Region 22. . 243 

/ re g i on^name = " Tryp_S Pc " 

/notes "Trypsin- like serine protease,- Many o£ these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...,- cd00190" 
/db_xref- "CDD: 29152" 
22 

/si te_type= " cleavage " 
/db_xre f - "ODD : 29152 " 
order (66, 110, 203) 
/ si t e^type = " ac t ive " 
/db xref-"CDD; 29152 " 
order (197, 218, 220) 
/ si te_type=" other" 
/notes "substrate binding sites" 
/db_xre f ODD : 29152 " 
1. . 247 

/genes "CMAl" 

/standard_^names"angioten8in I convertase" 
/coded_by="M69136 . l: 16 . . 759" 
/db_xref-"GDB:G00-127-603" 

1 mlllplplll fllcsraeag eiiggteckp hsrpymayle ivt8ngps)cf cggflirmf 
61 vltaahcagr sitvtlgahn lteeedtwq)c levikqfrhp kyntstlhhd imllklkeka 
121 sltlavgtlp fpsqfnCvpp grmcrvagwg rtgvlkpgsd tlqevklrlm dpqacshfrd 
181 fdhnlqlcvg nprktksafk gdsggpllca gvaqgivsyg rsdakppavf trishyrpwi 
241 nqilqan 



Site 



Site 



Site 



CDS 
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P21842 (CMA1_CANFA)* Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8, 2011. Version 79. 



Names and origin 



Protein names 


Recommended name: 
Chymase 
EC=3.4,21.39 

Alternative name(s): 
Alpha-chymase 
Mast cell protease 1 


Gene names 


Name:CMA1 


Organism 


Canis famlliaris (Dog) (Canis lupus famiiiaris) 


Taxonomic identifier 


9615 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Carnivora > Caniformia > Canidae > Canis 



Protein attributes 


Sequence length 


249 AA, 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 

form. 


Protein existence 


Evidence at protein level. 



Genera! annotation (Comments) 



Function Major secreted protease of mast cells with suspected roles in 

1 vasoactive peptide generation, extracellular matrix degradation, 
j and regulation of gland secretion. 

Catalytic activity j Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 

{ -|-Xaa. 

Subcellular location ™ I f^prrptg>ri I sy simiiarity]^ Cytoplas mic granule I By simiiarityl Note: 

j Mast cell granules I By similarity 1 

Sequence similarities | Belongs to the peptidase SI family. Granzyme subfamily. 

i Contains 1 peptidase SI domain. 



Ontologies 



Keywords 




Cellular component 


1 Secreted 


Domain 


j Signal 



http://www.uniprot.org/uniprot/P2 1 842 
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Molecular function 


Hydrolase 




Protease 




Serine protease 


n 1 IVI 


r^iciilfiHo K\/*tnH 
L/loUIMUC Uv^llU 




Zymogen 


Technical temn 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


proteolysis 




Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) Length Description Graphical view 



Feature identifier 





Signal peptide 


1 - 19 


19 




1 


□ 


Propeptide 


20-21 


2 

228 


Activation 
peptide 


1 




Chain 


22 - 249 


Chymase I 1 



I 

I 

1 

.1- 



PRO_0000027j43 
PRO_0000027|43 



Regions 



B Domain 



22 - 245 



224 



Peptidase 
SI 



Sites 



□ 


Active site 


66 


1 


Charge 
relay 

system 

1 Bysimllan'ty I 


1 






Active site 


110 


1 


Charge 

relay 
system 

1 By similarity 


1 




□ 


Active site 


203 


1 


Charge 

relay 

system 

1 By simHarlty 


1 

■ 


1 

f 

I 



Amino acid modifications 





Disulfide bond 


51 67 


j 1 By similarity | | | ^ 




□ 


Disulfide bond 


144 209 




i By similarity 


1 1 




□ 


Disulfide bond 


175^ 188 




1 By similarity 


II 





http://www.uniprot.org/uniprot/P2 1 842 
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Experimental info 



□ 


Sequence conflict 


29 


1 


K — RAA 
sequence 


1 






□ 


Sequence conflict 


38 


1 


H — QAA 


1 














sequence 
















1 Ref.3 1 










Sequence conflict 


45 


1 


R-^TAA 
sequence 

i Ref.3 1 


1 







Sequences 



Sequence 



Length IMass (Da) 



P21842 [UniParc]. 

Last modified May 1» 1991. Version 1 
Checksum: 3BBD0A6C2855F540 



FASTA 



249 



27,812 



10 

MHCLPLTLLL 
70 



20 

LliLCSRAEAE 
80 



30 

EIIGGTESKP 
90 



40 

HSRPYMAHLE 
100 



50 

IliTLRNHLAS 
110 



60 j 
CXSGFIilRRNF 



_____ 120 
VLTAAHCAGR FIMVTLGAHN IQKKEDTWQK LEVIKQFPHP KYDDLTLRHD IMLLKLKEKA 



130 

NLTLAVGTLP 



14£ 

LSPQFNFVPP 



150 

GRMCRVAGWG 



160 

KRQVNGSGSD 



170 

TLQEVKLRLM 



180 

DPQACRHYMA 



190 200 210 220 230 24£ 

FDHNIiQLCVG NPRKTKSAFK GDSGGPLLCA GVAQGIVSYG QNDAKPPAVF TRISHYRPWI 



NKVLKQNKA 



« Hide 
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C ross -ref e re n ces 



Sequence databases 



® EMBL 
® GenBank 
#DDBJ 


J02904 mRNA, Translation: AAA30835.1. 
U89607 Genomic DISJA. Translation: AAB94641.1. 


PIR 


A35842. 


RefSeq 


NP 001013442,1. NM 001013424.1. 
XP_855610.1. XM_850517.1. 


UniGene 


Cfa. 16336. 


3D structure databases 




ProteinModelPortal 


P21842. 


' SMR 


P21842. Positions 22-247. 


Mod Base 


Search... 


Protein-protein interaction databases 


STRING 


P21842. 


Protein family/group databases 




MEROPS 


O V/ i . i *tV^. 


Genome annotation databases 




Ensembt 


ENSCAFT00000019746; ENSCAFP00000018316; 
ENSCAFG0000001 2443. 


GenelD 


490628. 


KEGG 


cfa:490628. 


Organism-specific databases 




CTD 


490628. 


Piiyiogenomic databases 




eggNOG 


maNOG16150. 


GeneTree 


ENSGT00580000081 281 . 


HOVERGEN 


HBG013304. 


InParanoid 


P21842. 


OrthoDB 


EOG4R23VK. 


PhylomeDB 


P21842. 


Enzyme and pathway databases 




BRENDA 


3.4.21.39. 463. 
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Family and domain databases 



inierrro 


inrsuuyuuo. Kepi cys/ser i rypsm-iiKe. 
IPR01 81 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
lorapnicai viewj 


Pfam 


PF00089. Trypsin. 1 hit. 

[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 

[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet j Search... 
Entry Information 


Entry name | CMA1_CANFA 


Accession j Primary (citable) accession number: P21842 


Entry history 


Integrated into May 1, 1991 

UniProtKB/Swiss- 

Prot: 

Last sequence May 1, 1991 
update: 

Last modified: March 8, 201 1 

This is version 79 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P21844(CMA1_MOUSE)# Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8, 2011. Version 102. 



Names and origin 


Protein names 


Recommended name: 

EC=3.4.21.39 
Alternative name(s): 
Alpha-chymase 
Mast cell chymase 1 
Mast cell protease 5 

Short name=mMCP-5 

Mast cell protease 1 


Gene names 


Name: Cmal 
Synonyms: McptS 


L/ryanism 


ivius muscuius \iviouse| 


Taxonomic identifier 


10090 [NCBIJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 

Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 


Protein attributes 




Sequence length 


247 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Major secreted protease of mast cells with suspected roles in 
vasoactive peptide generation, extracellular matrix degradation, 
and regulation of gland secretion. 


Catalytic activity 


Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 
-l-Xaa. 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Secretory granules. 

1 Ref.6 1 


Tissue specificity 


Mast cells. 


Sequence similarities 


Belongs to the peptidase SI family. Granzyme subfamily. 
Contains 1 peptidase SI domain. 


.. - _ ..... . 

Sequence caution 


The sequence CAA48705.1 differs from that shown. Reason: 
Erroneous initiation. 
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Ontologies 



Keywords 

Cellular component 



J. 



I Secreted 



Domain 



Molecular function 



Signal 



Hydrolase 





Serine protease 


PTM 


.Disulfide bond 




Glycoprotein 




Zymogen 


Technical term 


Direct protein sequencing 



Gene Ontology (GO) 



Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


intracellular 

Inferred from direct assay. Source: MGI 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



I Position(s) Length Description 1 Graphical view 



Feature identifier 



m 


Signal peptide 


1-19 


19 


Potential I 


1 








Propeptide 


20,-21 


2 


Activation 
peptide 


1 


PRO_0000027 


45 


H 


Chain 


22 - 247 


226 


Chymase 


1 


PRO_0000027 





Regions 

Domain 



22 - 245 



224 



Peptidase 
81 



Sites 

□ Active site 



66 



Charge 
relay 
system 

By similarity j ! 



□ j Active site 



.1 



110 



1 j Charge 
' relay 
! system 



L 



I By similarity 
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□ 



Active site 



203 



1 1 Charge 
I relay 
system 

I By similarity 



Amino acid modifications 

Glycosylation 



80 



N-linked 
(GlcNAc .) 

I PotentiaT 



Disulfide bond 



51 ^67 



I By simiiaritv 



II 



Disulfide bond 
Disulfide bond 

Experimental info 



144 

175 



209 
188 



j By stmilarity 



I I 



i I By similarity j 



t 



II 



□ 


Sequence conflict 


5 


1 


T — > A in 
AAA40105, 


1 




1 Ref.l 


□ 


Sequence conflict 


51 


1 


C — RAA 
sequence 

1 Ref.5 1 


1 






Sequence conflict 


224 


1 


A R In 

AAA39492. 

1 Ref.3 \ 


1 





Sequences 



Sequence 



Length iViass (Da) 



m P21844 [UniParc]. 



FASTA 



247 



27.686 



Last modified December 1, 1992. Version 2. 
Checksum: 24C290CF61237DC7 



10 20 30 40 50 60 

MHLLTLHLLL LLIiGSSTKAG EIIGGTECIP HSRPYMAYLE IVTSENYLSA CSGFLIRRNF 

70 80 90 100 110 120 

VliTAAHCAGR SITVLLGAHN KTSKEDTWQK LEVEKQFLHP KYDENLWHD IMLLKLKEKA 

130 140 150 160 170 180 

KLTLGVCTLP LSANFNFIPP GRMCRAVGWG RTNVNEPASD TLQEVKMRLQ EPQACKHFTS 

ISO 200 210 220 230 240 

FRHNSQLCVG NPKKMQNVYK GDSGGPLLCA GIAQGIASYV HRNAKPPAVF TRISHYRPWl 



i NKILREN 



« Hide 
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i 
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Cited for : NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-19. 
Strain: 129/Sv. 
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Reynolds D.S., Stevens R.L., Lane W.S., Carr M.H., Austen K.F., Serafin W.E. 
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I 



Additional computationally mapped references. 



C ros s -ref e re n ces 



Sequence databases 



® EMBL 
® GenBank 
® DDBJ 


M73759 mRNA. Translation: AAA40105.1. 

M73760 mRNA. No translation available. 

X68805 mRNA. Translation: CAA48705.1. Different initiation. 

M68898 mRNA. Translation: AAA39492.1. 

AF119364 Genomic DNA. Translation: AAD43901,1. 


IPI 


IPI00230426. 


PIR 


S23504. 
S26043. 


RefSeq 


NP_034910,1. NM_01 0780.2. 


UniGene 


Mm. 1252. 


3D structure databases 




ProteinModelPortal ! P21844. 


SMR ^ P21844. Positions 22-247. 
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ModBase 


[ Search... 




Protein-protein Interaction databases 




STRING 


1 P21844 




Protein family/group databases 






MEROPS 


S01.150. 




PTM databases 






PhosphoSlte 


P21844. 




Genome annotation databases 






Ensembl 


ENSMUST00000022834; ENSMUSP00000022834; 
ENSMUSG00000022225. 


— 


C3enelD 






KEGG 


mmu: 17228. 




UCSC 


uc007ubf.1. mouse. 




uryanism-speciTic aaia oases 






GTD 


17228. 




MGI 


MGI:96941. Cmal. 


« 


mm • ■ M • 

Phylogenomic databases 






oene i ree 


ENSGT00580000081 281 . 




HOVERGEN 


HBG013304. 


InParanoid 


P21844. 




OrthoDB 


EOG4R23VK. 




enzyme ana pain way aaiaoases 






BRENDA 


3.4.21.39. 244. 




Gene expression databases 






ArrayExpress 


P21844. 




Bgee 


P21844. 

- — ~-~ 




GleanEx 


MM_CMA1 . 




Genevestigator 


P21844. 


GermOnline 


ENSMUSG00000022225. Mus musculus. 




Family and domain databases 






InterPro 


IPR009003. Pept cys/ser Trypsin-like. 
IPR0181 14. Peptidase S1/S6 AS. 
IPR001254. Peptidase SI S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 
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Pfam 


PF00089. Trypsin. 1 hit. 
TGraohical viewl 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494, Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS001 34. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 

Other Resources 
NextBio 


Search... 
291648. 


SOURCE i Search... 

i 

Entry information 


Entry name | GMA1_MOUSE 


Accession 


Primary (citable) accession number: P21844 
Secondary accession number(s): Q9R1F0 


Entry history 


Integrated into May 1, 1991 

UniProtKB/Swiss- 

Prot: 

Last sequence December 1 , 1 992 
update: 

Last modified: March 8. 201 1 

This is version 102 of the entry and version 2 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



MGD cross-references 

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot 



Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 
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P56435 (CMA1_MACFA)* Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 . 201 1 . Version 60. 

r— ■ ' '■ 



Names and origin 


Protein names 


Recommended name: 

Chymase 
EC=3.4,21.39 
Aitemative name(s): 

Alpha-chymase 


Gene names 


Name:CMA1 


Organism 


Macaca fascicularls (Crab-eating macaque) (Cynomolgus 
rnonKey/ 


Taxonomic identifier 


9541 [NCdIJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Cercopithecidae > 
uercopitnecinae > Macaca 


Protein attributes 




Sequence length 


247 AA. 


Sequence status 


complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at transcript level. 


General annotation (Comments) 


Function 


Major secreted protease of mast cells with suspected roles in 
vasoactive peptide generation, extracellular matrix degradation, 
and regulation of gland secretion 1 By similarity |. 


Catalytic activity 


Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 
-l-Xaa. 


Subcellular location 


Sprrfttftri 1 By similarity. Cytoplasmic graniilA 1 By similarity I Nntft- 

Mast cell granules 1 By similarity | 


Sequence similarities 


Belongs to the peptidase 81 family. Granzyme subfamily. 
Contains 1 peptidase 81 domain. 


Ontologies 




Keywords 




Cellular component 


Secreted 


Domain 


Signal 
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Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key { Position(s) j Length | Description | Graphical view 
Molecule processing 



Feature identifier 



J Signal peptide 
□ 1 Propeptide 



1- 



1 -19 1 
20-21 1 



-|9 j i By similarityl j | 



.4......... 



2 j Activation 



I 



peptide 



Chain 



22 - 247 



226 I Chymase 



PRO 0000027435 



PRO 0000027436 



Regions 

Domain 



22 - 245 



224 



Peptidase 
S1 



Sites 



□ 



Active site 



Active site 



Active site 



66 



110 



203 



1 



1 



Charge 

relay 
system 

I By similarity 



Charge 
relay 
system 

i By similarity I 



1 



Charge 

relay 

system 



By similarity 
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Amino acid modifications 



Glycosylation 



80 I 



1 



N-linked 
(GlcNAc ) 

I Potential j 



Glycosylation 



103 



Disulfide bond 



N-linked 
( GlcNAc ) 

Potentiall 



51 ^67 



By similanty | 



Disulfide bond 



144 ^ 209 



By similarity 



I i 



Disulfide bond 



175^ 188 



By similarity j 



II 



Sequences 



Sequence 



Length fVlass (Da) 



P56435 [UniParc]. FASTA 

Last modified July 15, 1998. Version 1. 
Checksum: 6D049B34377FC8B9 



247 



27.400 



, 10 20 30 40 

i MLLLPLPLLL FFLCSRABAG EXIGGTECKP HSRPYMAYLE 



50 

IVTSNGPSKS 



60 

CX3GFLIRRNF 



i 70 80 90 100 110 120 

; VliTAVHCAGR SITVTLGAHN ITEKEDTWQK LEVIKQFRHP KYNTSTLHHD IMLLKLKEKA 

, 13^ 140 15£ 160 170 180 

: SLTLAVGTLP FPSQFNFVPP GRMCRVAGWG RTGVLKPGSD TLQEVKLRLM DPQACSHFRY 

j 

I 19^ 200 210 220 230 240 

! FDHNLQLCVG NPRKTKSAFK GDSGGPLLCA GVAQGIVSYG RLDAKPPAVF TRISHYRPWI 



NKILQAN 



« Hide 
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Takai S., Shiota N., KobayashI S., Matsumura E., Miyazaki M. 
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Cited for : NUCLEOTIDE SEQUENCE [MRNA]. 
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Sequence databases 
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AB000823 mRNA. Translation: BAA22070.1 
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3D structure databases 



ProteinModelPortal 


1 P56435. 

i . .... — ■■ — ' — ■ 


SMR 


i P56435. Positions 22-247. 


Mod Base 


! Search... 


Protein family/group databases 




MEROPS 


S01.140. 


Phylogenomic databases 




HOVERGEN 


I HBG013304. 


Enzyme and pathway databases 




BRENDA 


: 3.4.21.39. 3438. 

» 


Family and domain databases 




InterPro 


1 IPR009003. Pept cys/ser Trypsin-like. 
IPR018114. Peptidase_S1/S6_AS. 

IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 

[Graphical view] 






SMART 


— — ~ — 

SM00020. Tryp_SPc. 1 hit. 

[vsrapnicai viewj 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM, 1 hit, 
PS00134. TRYPSIN_HIS. False negative. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 

■ 


Search... 


Entry information 




Entry name 


CMA1_MACFA 


Accession 


Primary (citable) accession number: P56435 


Entry history 


Intea rated into Julv 15 1998 

UniProtKB/Swiss- 

Prot: 

Last sequence July 15. 1998 
update: 

Last modified: January 1 1 , 201 1 

This is version 60 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 
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Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P52195 (CMA1_PAPHA)^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 11, 2011. Version 57. 



Names and origin 


Protein names 


Recommended name: 
Chymase 
EC=3.4,21.39 

Alternative name(s): 
Alpha-chymase 
Mast cell chymase 


Gene names 


Name: CMA1 
Synonyms: CHM 


Organism 


Papto hamadryas (Hamadryas baboon) 


Taxonomic identifier j 9557 [NCBI] 


Taxonomic lineage 


; Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
' Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 

Primates > Haplorrhini > Catarrhini > Cercopithecidae > 

Cercopithecinae > Papio 


Protein attributes 




Sequence length 


247 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed Into a mature 
form. 


Protein existence 


Evidence at transcript level. 


General annotation (Comments) ' 


Function 


Major secreted protease of mast cells with suspected roles in 
vasoactive peptide generation, extracellular matrix degradation, 
and regulation of gland secretion. 


Catalytic activity i 


Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 
-l-Xaa. 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Mast cell granules. 


Sequence similarities 

■ 


Belongs to the peptidase SI family. Granzyme subfamily. 
Contains 1 peptidase SI domain. 



Ontologies 
Keywords 

Cellular component j Secreted 

Domain ; Signal 
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Molecular function 


Hydrolase 
Protease 
Serine protease 





PTM 


Disulfide bond 

Glycoprotein 

Zymogen 




Gene Ontology (GO) 
Biological process 


proteolysis 

Inferred from electronic annotation. 


Source: InterPro 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 
Inferred from electronic annotation. 


Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key Position(s) | Length j Description | Graphical view 



\ Feature identifier 



Molecule processing 





Signal peptide 


1 -19 


19 


By similarity 


1 






Propeptide 


20-21 


2 


Activation 
peptide 


1 


PRO 


H 


Chain 


22 - 247 


226 


Chymase 


1 


PRO, 



Regions 

Domain 



22 - 245 



224 



Peptidase 1 I 
SI 



Sites 





Active site 


66 


1 


Charge 

relay 

system 

i By similarity I 


1 




□ 


Active site 


110 


1 


Charge 

relay 

system 

1 By similarity 


1 






Active site 


203 


1 


Charge 

relay 

system 

By similarity | 




■ 
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Amino acid modifications 



□ 



□ 



□ 



Glycosylation 



80 



1 



._f__- 



Glycosylation j 



103 



1 



Disulfide bond 



T 



51 ^67 



N-linked 
(GlcNAc ) 

I Potentian 

N-linked 

(GlcNAc ..,) 

Potential 



By similarity | 1 



11 



Disulfide bond 144 209 



By similarity 



I I 



Disulfide bond 175 188 



By similarity I 



II 



Sequences 



Sequence 



Length IVIass (Da) 



P52195[UniParc]. FASTA 247 27.339 

Last modified October 1, 1996. Version 1. 
Checksum: E0EC15E0FA72FD8B 



10 20 30 40 50 60 

MLLLPLPLLIi LFLCSRAEAG EIIGGTECKP HSRPYMAYLE IVTSNGPSKS CGGFLIRRNF 

70 80 90 100 110 120 

VLTAAHCAGR SITVTLGAHN ITEKEDTWQE LEVIKQFRHP KYNTSTLHHD IMLLKLKEKA 

130 140 150 160 170 180 

SLTLAVGTLP FPSQFNFVPP GRMCRVAGWG RTGVLKPGSD TLQEVKLRLM DPQACSHFRY 

190 200 210 220 230 240 

FDHNLQLCVG NPRKTKSAFK GDSGGPLLCA GVAQGIVSYG RLDAKPPAVF TRISHYRPWI 
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© DDBJ 



U38521 mRNA. Translation: AAA91160.1. 
U38463 Genomic DNA. Translation: AAA91 159.1. 
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ProteinModelPortal 
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SMR 


j P52196. Positions 22-247. 


ModBase 


1 Search... 


Protein family/group databases 




MEROPS 


S01.140. 


Phylogenomic databases 




HOVERGEN 


HBG013304, 


Enzyme and pathway databases 




BRENDA 


1 3.4.21.39. 39388. 


Family and domain databases 




InterPro 


IPR009003. Pept_cys/ser_Trypsin-like. 

IPRni8114 Pentidase S1/S6 AS 
IPR001254. Peptidase S1 S6. 
IPR001314. Peptidase_S1A. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 

1 1 i 1 s/Ci i view J 

1 .. .. .-— ....... .-HM- . 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00020. Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PRn^iTP 

1 r\ wo 1 1 c 


PS00134. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


CMA1_PAPHA 


Accession 


Primary (citable) accession number: P52195 


Entry history 

• 


Integrated into October 1, 1996 

UniProtKB/Swiss- 

Prot: 

Last seauence October 1 . 1 996 
update: 

Last modified: January 1 1 , 201 1 

This is version 57 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Relevant documents 





Peptidase families 

Classification of peptidase families and list of entries 
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P50339 (CMA1_RAT)W Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 75. 



Names and origin 



Protein names 


Recommended name: 
Chymase 
EC=3.4.21.39 

Alternative name(s): 
Alpha-chymase 
Mast cell protease 3 

' Short name=rMCP-3 

Mast cell protease 5 

1 Short name=rMCP-5 

Mast cell protease III 

Short name=rMCP-lll 


Gene names 


i 

Name: Cmal 
Synonyms: Mcpt3 


V^l \JCil 1191 1 1 




1 axononiic laeniiTier 


1 U i lO l^lMUrDlJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Euthena > Euarchontoglires > Glires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > 
r\aiius 


Protein attributes 




Sequence length 


247 AA, 


Sequence status 


Complete, 


Sequence processing 

■ 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at transcript level. 

« 


General annotation (Comments) 


r 

Function 


Major secreted protease of mast cells with suspected roles in 
vasoactive peptide generation, extracellular matrix degradation, 

anri regulation nf glanH 9if^.rre>\\nn 1 By similarity | 


Catalytic activity 


Preferential cleavage: Phe-|-Xaa > Tyr-|-Xaa > Trp-|-Xaa > Leu 
-l-Xaa. 


Subcellular location 


Secreted. Cytoplasmic granule. Note: Secretory granules. 


Tissue specificity 


Mast cells. 


Sequence similarities 

1 


Belongs to the peptidase S1 family. Granzyme subfamily. 
Contains 1 peptidase S1 domain. 
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Ontologies 



Keywords 

Cellular component 


Secreted 


Domain 


Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 

Glycoprotein 

Zymogen 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from mutant phenotype. Source: ROD 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


peptide binding 

Inferred from mutant phenotype. Source: RGD 

serine-type endopeptidase activity 

Inferred from mutant phenotype. Source: RGD 



Complete GO annotation... 



Sequence annotation (Features) 



Mok 


Feature key 

^cule processini 
Signal peptide 


Position(s) 

9 

1-19 


Length 
19 


Description 


Graphical view 

1 


Feature identifier 






Potential | 


□ 


Propeptide 


20-21 

^ 1 


2 


Activation 
peptide 


1 


PRO_000002744c 




Reg 

Site) 

□ 


Chain 

ions 

Domain 

Active site 

1 


22 - 247 
22 - 245 

66 


226 
224 

■ 

1 


Chymase 

Peptidase 
SI 

Charge 
relay 

system 

1 By similarity t 


1 

1 1 
1 


PRO 000002744^ 

— 




□ 


Active site 


110 


1 


Charge 
relay 


1 
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□ 



Active site 



203 



1 



system 



{ By similarity 



Charge 

relay 

system 



By similarity I 



Amino acid modifications 

□ 



□ 



Glycosylation 


80 


1 


N-linked 
(GlcNAc.) 

1 Potential I 


1 


Disulfide bond 


51 ^67 




By similarity j 


11 


Disulfide bond 


144 ^ 209 




By similarity j 


1 


Disulfide bond 


175^ 188 




By similarity 





II 



Sequences 



Sequence Length IVlass (Da) 



P50339 [UniParc]. FASTA 247 27,569 

Last modified October 1 . 1 996. Version 1 . 
Checksum: 6525D7BF1BFDF053 



1^ 20 30 40^ 50 60_ 

MNLHALCLLL LLLGSSTKAG EIIGGTECIP HSRPYMAYLE IVTSDNYLSA CSGFLIRRNF 

70 80 90 100 110 120^ 

VLTAAHCAGR SITVLLGAHN KTYKEDTWQK LEVEKQFIHP NYDKRIiVLHD IMLLKLKEKA 

130 14^ 150 160 170 180 

KLTLGVGTLP LSANFNFIPP GRMCRAVGWG RTNVNEPASD TLQEVKMRLQ EPQSCKHFTS 

190 200 21^ 220 230 240 

FQHKSQLCVG NPKKMQNVYK GDSGGPLLCA GIAQGIASYV HPNAKPPAVF TRISHYRPWI 



i NKILREN 

« Hide 
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Sequence databases 



sS^ tllVIDL 

® GenBank 
W DDBJ 


D'^MQ^ mRMA TranQlatinn- RAAOy^inT 1 
I-^oo*tC/*J 1 1 liMN/A. 1 1 di loldllwi 1. ar\rWJ f sJ\J f . i . 

U67908 mRNA. Translation: AAB48261.1. 

i 


IPI 


1 IPI001 97397. 


PIR 


S59135. 


KeTbeq 




UniGene 


Rn.10182, 


3D structure databases 




ProteinModelPortal 


P50339. 


SMR 


P50339. Positions 22-247. 


Mod Base 


Search,.. 


Protein family/group databases 




MEROPS 


S01.150. 


Proteomic databases 




PRIDE 


P50339. 


Odii/lllt? ailllvlaUUii UciulUcl9e9 






ZooZf. 


KEGG 


rno:25627. 


Organism-specific databases 




CTD 


25627. 


RGD 


2365. Cma1. 


Phylogenomic databases 




eggNOG 


roNOG06568. 


HOVERGEN 


HBG013304. 


Enzyme and pathway databases 




BRENDA j 


3.4.21.39.248. 


Gene expression databases 




Genevestigator ^ 


P50339. 



I 

i 



Chymase precursor - Rattu^prvegicus (Rat) 



+ i Additional computationally mapped references. 



C ros s -ref e re n ces 
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Family and domain databases 

InterPro 


1PR009003. Pept cys/ser Trypsin-like. 
IPR018114. Peptidase_S1/S6_AS. 
IPR001254. Peptidase S1 S6. 
IPR001314. Peptidase„S1A. 
[Graphical view] 


Pfam 


PF00089 TrvDsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN, 


SMART 


SM00020. Tryp^SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN HIS. 1 hit. 
PS00135. TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 

Other Resources 
NextBio 

Entry information 


Search,.. 
607417. 


Entry name { CMA1_RAT 


1 " ■ " 

Accession Primary (citable) accession number: P50339 

I Secondary accession number(s): Q9R2C8 


Entry history 

i 

! 
! 


Integrated into October 1, 1996 

UniProtKB/Swiss- 

Prot: 

Last sequence October 1, 1996 
update: 

Last modified: January 1 1 , 201 1 

This is version 75 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status j Reviewed (UniProtKB/Swiss-Prot) 
Annotation program i Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P00784 (PAPA1_CARPA) ^ Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 95. 



Names and origin 



Protein names 



Recommended name: 

Papain 
EC=3.4.22.2 
Altemative name(s): 

Papaya proteinase I 

Short name=PPI 

Allergen=Car p 1 



Organism 



T 



Taxonomic identifier 
Taxonomic lineage 



Carica papaya (Papaya) 

3649 [NCBI] 



T 



i 



Eukaryota > Viridiplantae > Streptophyta > Embryophyta > 
Tracheophyta > Spermatophyta > Magnoliophyta > 
eudicotyledons > core eudicotyledons > rosids > maivids > 
Brassicales > Caricaceae > Carica 



Protein attributes 



Sequence length 


345 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed Into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Catalytic activity 


Hydrolysis of proteins with broad specificity for peptide bonds, 
but preference for an amino acid bearing a large hydrophobic 
side chain at the P2 position. Does not accept Val in P1*. 


Allergenic properties 

•-— — — ■ ■ -•- — —— - — — — ■ ■ ■ — — — 1 ^ - — 


Causes an allergic reaction in human. 



Sequence similarities 



Ontologies 



Belongs to the peptidase C1 family. 



Keywords 

Disease 
Domain 



! Allergen 
1 Signal 



Molecular function 


Hydrolase 




Protease 




Thiol protease 


PTM 


Disulfide bond 




Zymogen 
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Technical term 



{ 



3D-structure 

Direct protein sequencing 



Gene Ontology (GO) 



Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Molecular function 


cysteine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 


Complete GO annotation... 




Sequence annotation (Features) 



I Feature key 

Molecule processing 

Signal peptide 



Position(s) Length Description Graphical view 



I Feature identifier 





18 ' I Potential 



Activation j 
peptide j 



r 



PRO 000002640 



Papain 



PRO_000002^0 



Sites 



o 


Active site 


168 


1 


1 Ref.4 


1 








Id 


Active site 


292 


1 




1 






H 


Active site 


308 


1 


Ref.4 


1 







Amino acid modifications 



Disulfide bond 



Disulfide bond 



155 ^ 196 



189^228 



Ref.2 



Ref.2 



I I 



I I 



Disulfide bond 



Experimental info 



286 ^ 333 



I Ref.2 



I I 



□ 


Sequence conflict 


180 

--^ 

219-220 


1 

. . . 

2 


E-^QAA 
sequence 

1 Ref.2 1 


1 

i 


□ 


Sequence conflict 




YP-»PY 
AA 

sequence 

1 Ref.2 1 


□ 


Sequence conflict 


251 


1 


E — Q AA 
sequence 

1 Ref.2 i 


1 
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□ 



Sequence conflict 



268 



1 I E->QAA 
seque nce 

i Ref.2 I 



Secondary structure 



Z4i 



I Helix Q Strand | Turn 



Details... 



Sequences 

Sequence Length Mass (Da) 

□ P00784 [UniParc]. FASTA 345 38.922 

Last modified January 1, 1988. Version 1. 
Checksum: 82D9FB35EDCA12EF 

i 

; 10 20 30 40 50 60 ; 

■ MAMIPSISKIj LFVAICLFVY MGLSFGDFSI VGYSQNDIiTS TERLIQLFES WMLKHNKIYK 

t 

70 80 90 100 110 12£ j 

; NIDEKIYRFE IFKDNLKYID ETNKKNNSYW LGLNVFADMS NDEFKEKYTG SIAGNYTTTE ■ 

■ 130 140 150 160 170 180 | 
; LSYEEVLNDG DVNIPEYVDW RQKGAVTPVK NQGSCGSCWA FSAWTIEGI IKIRTGNLNE 

t 

190 200 210 220 230 240 ^ 

' YSEQELLDCD RRSYGCNGGY PWSAIiQLVAQ YGIHYRNTYP YEGVQRYCRS REKGPYAAKT \ 

[ i 

\ 250 260 270 280 290 300 } 

j DGVRQVQPYN EGALLYSIAN QPVSWLEAA GKDFQLYRGG IFVGPCGNKV DHAVAAVGY6 | 

I I 
310 320 330 340 j 

; PNYILIKNSW GTGWGENGYI RIKRGTGNSY GVCGLYTSSF YPVKN i 

! . i 

« Hide 



References 



[1] 


"Cloning and sequencing of papain-encoding cDNA." 

Cohen L.W., Coghlan V.M., Dihel L.C. 

Gene 48:219-227(1986) [PubMed: 2881845] [Abstract] 

Cited for: NUCLEOTIDE SEQUENCE [MRNA]. 


[2] 


"The complete amino acid sequence of papain. Additions and corrections." 

Mitchel R.E.J., Chaiken I.M., Smith E.L. 

J. Biol. Chem. 245:3485-3492(1970) [PubMed: 5470818] [Abstract] 
Cited for: PROTEIN SEQUENCE OF 134-345. 


13] 


"A reinvestigation of residues 64-68 and 175 in papain. Evidence that residues 64 and 
175 are asparagine." 

Husain S.S., Lowe G. 

Biochem. J. 1 16:689-692(1970) [PublVled: 5435495] [Abstract] 
Cited for: SEQUENCE REVISION TO 197. 


[4] 


"Structure of papain." 

Drenth J., Jansonius J.N., Koekoek R.. Swen H.M.. Wolthers B.G. 



http://www.uniprot.org/uniprot/P00784 



4/12/2011 



Papain precursor - Carica p||^a (Papaya) Page 4 of 6 






Nature 218:929-932(1968) [PubMed: 6681232] [Abstract] 
Cited for: X-RAY CRYSTALLOGRAPHY (2.8 ANGSTROMS). 


[5] 


"Structure of papain refined at 1.65-A resolution." 

Kamphuis LG., Kalk K.H., Swarte M.B.A., Drenth J. 

J. Mol. Biol. 179:233-256(1984) [PubMed: 6502713] [Abstract] 

Cited for; X-RAY CRYSTALLOGRAPHY (1.65 ANGSTROMS). 




"The refined 2.4 A X-ray crystal structure of recombinant human stefin B in complex 
with the cysteine proteinase papain: a novel type of proteinase inhibitor interaction." 

Stubbs M.T., Leber B., Bode W., Huber R., Jerala R., Lenarcic B.. Turk V. 
EMBO J. 9:1939-1947(1990) [PubMed: 2347312] [Abstract] 
Cited for: X-RAY CRYSTALLOGRAPHY (2.4 ANGSTROMS). 


[7] 


"Crystal structure of papain-succinyl-Gln-Val-Val^la-Ala-p-nitroanilide complex at 1.7-A 
resolution: noncovalent bindina mode of a common seauence of endoaenous thiol 
protease inhibitors." 

Yamamoto A., Tomoo K., Doi M., Ohishi H., Inoue M., Ishida T., Yamamoto D., Tsuboi S.. 
Okamoto H., Okada Y. 

Biochemistry 31:11305-11309(1992) [PubMed: 1445868] [Abstract] 
Cited for: X-RAY CRYSTALLOGRAPHY (1.7 ANGSTROMS). 


[8] 


Structure of monoclinic papain at 1.60-A resolution. 

Pickersgill R,W,, Harris G.W., Garman E. 
Acta Crystallogr. 6 48:59-67(1992) 

Cited for: X-RAY CRYSTALLOGRAPHY (1.6 ANGSTROMS). 


+ 


Additional computationaliy mapped references. 



Web resources 



Worthington enzyme manual 



Cross-references 



Sequence databases 




@EMBL 


M15203 mRNA. Translation: AAB02650.1. 


® GenBank 




® DDBJ 




PIR 


PPPA. A26466. 



3D structure databases 



® PDBe 

® RCSB PDB 
©PDBj 



Entry 


Method 


Resolution 




(A) 


1BP4 


X-ray 


2.20 


1BQI 


X-ray 


2.50 


1CVZ 


X-ray 


1.70 


1EFF 


model 




1KHP 


X-ray 


2.00 


1KHQ 


X-ray 


1.60 


1PAD 


X-ray 


2.80 


1PE6 


X-ray 


2.10 


1PIP 


X-ray 


1.70 


1POP 


X-ray 


2.10 


1PPD 


X-ray 


2.00 


1PPN 


X-ray 


1.60 


1PPP 


X-ray 


1.90 



Chain Positions PDBsum 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



1 34-345 
134-345 
1 34-345 
1 34-345 
134-345 
1 34-345 
134-345 
134-345 
134-345 
1 34-345 
134-345 
1 34-345 
1 34-345 



[»1 

» 

» 

[»] 

[»: 
[»: 
[»: 

[»] 
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1STF 
2CIO 
2PAD 
3E1Z 
3IMA 
3LFY 
4PAD 
6PAD 
6PAD 
9PAP 



X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 
X-ray 



2.37 
1.50 
2.80 
1.86 
2.03 
2.60 
2.80 
2.80 
2.80 
1.65 



E 

A 

A 

B 
A/C 
A/C 

A 

A 

A 

A 



134-345 

134-345 

1 34-345 

134-345 

1 34-345 

1 34-345 

1 34-345 
134-345 

1 34-345 
1 34—345 



[»] 

[»] 

[»] 
[»] 

[»] 

[»] 



Protei n Model Portal 


P00784. 


SMR 


P00784. Positions 38-345. 


Mod Base 


Search... 




Protein family/group databases 






Allergome 


709. Car p 1. 




MEROPS 


C01.001. 




Enzyme and pathway databases 






BRENDA 


3.4.22.2. 18730. 




Family and domain databases 






InterPro 


IPR000169. Pept cys AS. 
IPR01 31 28. Peptidase CIA, 
IPR000668. Peptidase CIA C. 
IPR013201. ProtJnhibJ29. 
[Graphical view] 




PANTHER 


PTHR12411. Peptidase_C1A, 1 hit. 


Pfam 


PF08246. Inhibitor 129. 1 hit. 
PF00112. Peptidase_C1. 1 hit. 
[Graphical view] 


PRINTS 


PR00705. PAPAIN. 


SMART 


SM00848, Inhibitor 129. 1 hit. 
SM00645. Pept.CI. 1 hit. 

[Graphical view] 


PROSITE 


PS00640. THIOL PROTEASE ASN. 1 hit. 
PS00139. THIOL PROTEASE CYS. 1 hit. 
PS00639. THIOL_PROTEASE_HIS. 1 hit 
[Graphical view] 




ProtoNet 


Search... 




Other Resources 






BindingDB 


P00784. 





Entry information 
Entry name , PAPA1_CARPA 

Accession ' Primary (citable) accession number: P00784 
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Entry history 



Integrated into July 21 , 1986 

UniProtKB/Swiss- 

Prot: 



Last sequence January 1, 1988 
! update: 
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This is version 95 of the entry and version 1 of the sequence. 
[Complete history] 



Entry status 



Reviewed (UniProtKB/Swiss-Prot) 



Annotation program 



Plant Protein Annotation Program 



Relevant documents 



Allergens 

Nomenclature of allergens and list of entries 
PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 
Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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P08246 (ELNE_HUMAN)* Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 129, 



Names and origin 


Protein names 


Recommended name: 
Neutrophil elastase 

Alternative name(s): 

Bone marrow serine protease 
Elastase-2 

Human leukocyte elastase 

Short name=HLE 

Medullasin 
PMN elastase 


Gene names 


Name: ELANE 
Synonyms: ELA2 


Organism 


Homo sapiens (Human) [Complete proteome] 

■ ^ r lb * ■ ■ 


Taxonomlc identifier 


9606 [NCBI] 

._ - . _ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrnini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


267 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Modifies the functions of natural killer cells, monocytes and 
granulocytes. Inhibits C5a-dependent neutrophil enzyme 

release and cheinntaYis 1 Ref.lS 1 


Catalytic activity 


Hydrolysis of proteins, including elastin. Preferential cleavage: 
Val-I-Xaa > Ala-|-Xaa. 


Subunit structure 


Interacts with NOTCH2NL. UMIL 


Tissue specificity 


Bone marrow cells. 


Involvement in disease 


Defects in ELANE are a cause of cyclic haematopoiesis (CH) 
[MIM:1 62800]; also known as cyclic neutropenia. CH is an 
autosomal dominant disease in which blood-cell production 
from the bone marrow oscillates with 21 -day periodicity. 
Circulating neutrophils vary between almost normal numbers 
and zero. During intervals of neutropenia, affected individuals 
are at risk for opportunistic infection. Monocytes, platelets, 
lymphocytes and reticulocytes also cycle with the same 

fr^aqiiAnry 1 Ref.16 11 Ref.21 | 



http ://www .un iprot.org/un iprot/P08246 
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Protein 

Translations of Ufe 
Display Settings: OenPept 

neutrophil elastase [Homo sapiens] 

GenBanIc AAA3635g.1 

PASTA SssmSA 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 



267 aa 
[Homo sapiens] 



AAA36359 

neutrophil elastase 
AAA36359 

AAA36359.1 GI:386981 
locus HUMNELl accession 
locus HDMNEL2 accession M20200 



linear 



PRI 27-APR-1993 



locus HUMNEL4 accessic 
locus HDMNEL5 accession M20203 



KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

PUBMED 
COMMENT 

FEATURES 

source 



M20199 


.1 


M20200 


. 1 


M20201 


.1 


M20202 


. 1 


M20203 


. 1 



Protein 



Re cjion 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria,- Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 267) 

Takahashi , H . , Nukiwa^T., Yoshimura, K. , Quick, CD. « States, D. J., 
Holmes, M.D., Whang- Peng, J. , Knutsen,T. and Crystal, R. 6. 
Structure o£ the- human neutrophil elastase gene 
J. Biol. Chem. 263 (29), 14739-14747 (1988) 

2902087 

On Aug 28, 1993 this sequence version replaced gi ; 189149 . 
Method: conceptual translation. 
liOcation/Quali fiers 

1. .267 

/organisms "Homo sapiens" 
/ db_^xr e f ■ " t axon : 9606 " 
1..267 

/name = "neutrophil elastase" 
30. .245 

/region_neime = " Tryp_SPc " 

/note= "Trypsin- like serine protease,- Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 

active forms. Alignment contains also inactive enzymes 



Site 



Site 



Site 



CDS 



ORIGIN 



that have substitutions of the catalytic. . 

/db_xref»"CDD: 29152 » 

30 

/si te_type= " cleavage" 

/db xref="CDD: 29152 " 

order (70, 117,202) 

/si te_type=" active" 

/db xref°"CDDi 29152 " 

order (196,217,219) 

/si te_type= " other " 

/note=" substrate binding sites" 

/db_xre f « "CDD : 2 9152 " 

1 . . 267 

/coded_by=" join (M2 0199.1 : 144 5. . 1511,M20200 
M20201.1:51. . 192, M20202 . 1 : 49 . . 279,M20203 . 1 



Cd00190" 



1:50 . .206, 
49. .255) » 



// 



1 mtlgrrlacl flacvlpall Iggtalasei vggrrarpha wpfmvslqlr gghfcgatli 
61 apnfvmsaah cvanvnvrav rwlgahnls rreptrqvfa vqrifengyd pvnllndivi 
121 Iqlngsatin anvqvaqlpa qgrrlgngvq clamgwgllg rnrgiasvlq elnvtwtsl 
181 crrsnvctlv rgrqagvcfg dsgsplvcng lihgiasfvr ggcasglypd afapvaqfvn 
241 widsiiqrse dnpcphprdp d^asrth 



http://www.ncbi.nlm.nih.gov/protein/AAA36359 
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Protein 

Translations of Life 



Display Settings: GenPept 

elastase/medullasin precursor (EC 3.4.21.37) [Homo sapiens] 

GenBanlc AAA36173.1 



SSdiOL 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOXniCB 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 

FEATURES 

source 



Protein 

sig peptide 
mat peptide 
Region 



AAA36173 267 aa linear PRI ll-JUN-1993 

elastase/medullasin precursor (EC 3.4.21.37} [Homo sapiens]. 
AAA36173 

AAA36173.1 GI: 307123 

locus HUMLEUELA accession M34379. 1 

Homo sapiens (hiunan) 

Homo sapiens 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Buteleostoroi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 267) 

OkanOfK., Aoki,Y., Shimizu^H. and Naruto«M. 

Functional expression o£ human leukocyte elastase (HLE) /medullasin 
in eukaryotic cells 

Biochem. Biophys. Res. Commun. 167 (3), 1326-1332 (1990) 
232227S 

On Jul 26, 1993 this sequence version replaced gi ; 1S71X7 . 
Method: conceptual translation. 

Location/Qualifiers 

1. .267 

/organisms "Homo sapiens" 
/db xref a"taxon : 9606 " 
1. .267 



Site 



Site 



Site 



CDS 



ORIGIN 



/name» "elastase/medullasin precursor (EC 3.4.21.37)" 
1. .27 

/note>*'elasta8e/medullasin signal peptide" 
30, . 267 

/pr oduc t = " e las t ase /medul lasin " 
30. .245 

/region_name= "Tryp_SPc" 

/note= "Trypsin- like serine protease; Many of these are" 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their , 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/ db_x re f «" CDD : 29152 " 
30 

/ site_type=" cleavage" 

/db_xr e f = " CDD : 29152 " 

order (70,117,202) 

/ si te_type=" active" 

/ db_xre f =" CDD : 29152 " 

order (196 , 217, 219) 

/ si te_type = " other " 

/notes "substrate binding sites" 

/db_xr e f =" CDD : 29152 " 

1. .267 

/coded__by="M34379. 1:39. .842" 



// 



1 mtlgrrlacl flacvlpall Iggtalasei vggrrarpha wpfmvslqlr gghfcgatli 
61 apnfvmsaah cvanvnvrav rwlgahnls rreptrqvfa vqrifengyd pvnllndivi 
121 Iqlngsatin anvqvaqlpa qgrrlgngvq clamgwgllg rnrgiasvlq elnvtwtsl 
181 crrsnvctlv rgrqagvcfg dsgsplvcng lihgiasfvr ggcasglypd afapvaqfvn 
241 widsiiqrse dnpcphprdp dpasrth 



http://www.ncbi.nlm.nih.gov/protein/AAA36173 
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P37357 (EL2A_HORSE) * Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 . 201 1 . Version 52. 



Names and origin 


Protein names 


Recommended name: 
Neutrophil elastase 2A 
EC=3.4.21.- 

Altemative name(s): 
Proteinase 2A 


Organism 


Equus caballus (Horse) 


Taxonomic identifier 


9796 [NCBI] 


Taxonomic lineage 

Protein attributes 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Perissodactyla > Equidae > Equus 


Sequence length j 85 AA. 
Sequence status j Fragments. 


Protein existence j Evidence at protein level. 

General annotation (Comments) 


Function 


May be involved In the degradation of connective tissue In 
chronic lung disease. 


: 

Sequence similarities . 

: 
: 

Ontologies 


Belongs to the peptidase 81 family. Elastase subfamily. 
Contains 1 peptidase 81 domain. 


Keywords 

Molecular function 

i 
\ 

4 


Hydrolase 
Protease 
Serine protease 


Technical term 

Gene Ontology (GO) 
Biological process 


Direct protein sequencing 
proteolysis 

Inferred from electronic annotation. Source: InterPro 


Molecular function 
Complete GO annotation... 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



http://www.uniprot.org/uniprot/P37357 
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Sequence annotation (Features) 



Feature key 



Position t Length 
(s) I 



Description 



Graphical view 



Feature identifie 



I 



Molecule processing 

Chain 



1 -85 



85 I Neutrophil 
elastase 
2A 



PRO 00000886 



Regions 

B ! Domain 



1 -~85 



85 Peptidase 
S1 



Sites 



Active site 



67 



1 



Charge 

relay 

system 



Experimental info 

I Non-adjacent residues 1 34 - 35 i 



II 



Ell { Non-adjacent residues 59-60 



Sequences 



II 



Sequence 


Length 


Mass (Da) 


B P37357 [UniParc]. 


FASTA 85 


8,893 


Last modified October 1, 1994. Version 1. 
Checlcsum: 0ABE7D44A1 B90E26 






I 20 30 40 5 0 ^£ 

! IVGGRAAEPH SRPYMVSLQI RGNPGSHFCG GT1.IMGWGRL GTREPLPXVL QEU^VTWTA 1 


j 70 80 

i GICFGDSGGP LICNGVAQGV FSFVR 




i 

i 
t 

i 


« Hide 
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"Structural and functional characterization of elastases from horse neutrophils. 

Dubin A., Potempa J,, Travis J. 

Biochem, J. 300:401^06(1994) [PubMed: 7516152] [Abstract] 
Cited for : PROTEIN SEQUENCE. 
Tissue : Neutrophil. 
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C ros s -ref e re n ces 



Sequence databases 



PIR 


1 S44461. 




3D structure databases 






ProteinModelPortal 


P37357. 




SMR 


P37357. Positions 1-34, 


ModBase 


Search... 



Protein family/group databases 
MEROPS 

Family and domain databases 
InterPro 



1 S01.131. 



IPR009003. Pept_cys/ser_Trypsin-like. 
IPR018114. Peptidase_S1/S6_AS. 
IPR001254. Peptidase„S1_S6. 

[Graphical view] 



Pfam 



PF00089. Trypsin. 2 hits. 
[Graphical view] 



SIVIART 


t SM00020, Tryp^SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN DOM. Partial match. 
PS00134, TRYPSIN HIS. Partial match. 
PS00135, TRYPSIN_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 




Entry name 


EL2A_HORSE 


Accession 


Primary (citable) accession number: P37367 


Entry history 

■ 

: 

i 

1 

< 

i 


- - 

Integrated into October 1, 1994 

UniProtKB/Swiss- 

Prot: 

Last sequence October 1 , 1 994 
update: 

Last modified: January 1 1 , 201 1 

This is version 52 of the entry and version 1 of the sequence. 

[Complete history] 



Entry status 



Annotation program 



I 



Reviewed (UniProtKB/Swiss-Prot) 
Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 



http://www.uniprot.org/uniprot/P37357 
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SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 



EMBL-EBI 




« 



c 



http://www.uniprot.org/uniprot/P37357 
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P37358 (EL2B_HORSE) # Reviewed, UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 50. 



Names and origin 


Protein names 


Recommended neme: 
Neutrophil elastase 2B 
EC=3.4.21.- 

Altemative name(s): 
Proteinase 2B 


Organism 


Equus caballus (Horse) 


Taxonomic identifier 


9796 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Perissodactyla > Equidae > Equus 


Protein attributes 




Sequence length 


73 AA. 


Sequence status 


Fragments. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


May be involved in the degradation of connective tissue in 
chronic lung disease. 


Sequence similarities 


Belongs to the peptidase S1 family. Elastase subfamily. 
Contains 1 peptidase S1 domain. 


Ontologies 




Keywords 




Molecular function 

■ 

■ 


Hydrolase 
Protease 
Serine protease 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation... 



http://ww\v.uniprot.org/uniprot/P37358 
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Sequence annotation (Features) 



Feature key 



Position Length | Description j Graphical view 
(s) 



Feature identifie 



Molecule processing 

Chain 



1 -73 



73 I Neutrophil 
eiastase 
2B 



PRO 0000088C 



Regions 

□ Domain 



1-73 



73 



Peptidase 
S1 



Sites 

□ I Active site 



64 



1 I Charge 
I relay 
I system 



Experimental info 



□ 


Non-adjacent residues 


31 - 32 1 2 


1 II 1 1 

J .._ ..J..... „ 


□ 


Non-adjacent residues 


56 - 57 i 2 

1 


1 " 


1 



Sequences 



Sequence 



P37358 [UniParc]. 

Last modified October 1, 1994. Version 1 
Checksum: AFFB0B330DB69041 



FASTA 



Length Mass (Da) 

73 7,615 



10^ 20 30 40 50 60 

; IVGGRPARPH AWPFMASLQR RGGHFCGATL IMGWGQIX3TN RPLPSVLQEL NVTWTAGIC 

' FGDSGGPLVC NGL 

« Hide 
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^ "Structural and functional characterization of elastases from horse neutrophils.'* 

Dubih A., Potempa J., Travis J. 

Biochem. J. 300:401-406(1994) [PubMed: 7516162] [Abstract] 
Cited for : PROTEIN SEQUENCE. 
Tissue : Neutrophil. 
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Cross-references 


Sequence databases 


• 


PIR 


1 S44462. 


3D structure databases 




ProteinModelPortal 


1 P37358. 


SMR 


1 P37358. Positions 1-73. 


ModBase 


1 Search... 


Protein family/group databases 




MEROPS 


S01.131. 


Family and domain databases 




InterPro 


i IPR009003. Pept cys/ser Trypsin-like. 
1 IPR01 81 14. Peptidase S1/S6 AS. 
i IPR001254. Peptidase^1_S6. 
[Graphical view] 


Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pep1^Ser_Cys. 1 hit. 


PROSITE 


PS50240. TRYPSIN^DOM. Partial match. 
PS00134. TRYPSIN HIS. Partial match. 
PS00135. TRYPSIN.SER. 1 hit. 

^Otdpni^al VI6WJ 


PrntnNl^f 


OcaiOi 1... 


Entry information 




Entry name 


EL2B_HORSE 


Accession 


Primary (citable) accession number: P37358 


— - 

■ 
■ 


integrated into Uctober 1, 1994 

UniProtKB/Swiss- 

Prot: 

i-aol sew|U6nce wciuuer i , i ^^sh 

update: 

Last modified: January 1 1 , 201 1 

This is version 50 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


1 

Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 



http://www.uniprot.org/uniprot/P37358 
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Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 

h Si 



EMBL-EBi :i 



http://www.uniprot.org/uniprot/P37358 



4/12/2011 



Neutrophil elastase - Mus musculus (Mouse) 



Page 1 of 4 



Q9Z284 (Q9Z284_MOUSE) # Unreviewed. UniProtKB/TrEMBL 

Last modified April 5. 2011. Version 77, 



Names and origin 


Protein names 


Submitted name: 

Nftiitrnphil f^laRtaRP 1 EMBL AAC79702 1 


Gene names 


Nflm*>- Flanf^ 1 MGI 2679229 | 

«;ynnnym«-nf^la9a I MG( 95316 j Flfl9 1 MGI 95316 MGI 2679229 
f^\a9a 1 MGI 95316 


Oraanism 


: Mtift miisr.iiluft (MniiCA)! EMBL AAC79702,1 


Tsivrinnmir' iH^ntifi^r 


100Q0 rNHRIl 


Taxonomlc lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 
i > Mus 


Protein attributes 




Sequence length 


207 AA. 


Sequence status 


Complete. 


Protein existence 


Evidence at transcript level. 


■ 

General annotation (Comments) 


Sequence similarities 

• 


Belongs to the peptidase S1 family. 1 RuieBase ruooo36ov4 


Ontologies 




Gene Ontology (GO) 




Biological process 

1 
1 

I 

i 


acute inflammatory response to antigenic stimulus 
Inferred from direct assay. Source: MGI 

leukocyte migration 

Inferred from mutant phenotype. Source: MGI 

negative regulation of growth of symbiont in host 
Inferred from genetic interaction. Source: MGI 

neutrophil mediated killing of fungus 
Inferred from mutant phenotype. Source: MGI 

phagocytosis 

Inferred from mutant phenotype. Source: MGI 

positive regulation of immune response 
Inferred from genetic interaction. Source: MGI 


1 

« 

t 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


r 


response to lipopolysaccharide 



http://www.uniprot.org/uniprot/Q9Z284 
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- 


Inferred from genetic interaction. Source: MGI 
response to yeast 

Inferred from mutant phenotype. Source: MGI 


Cellular component 


cytoplasm 

Inferred from direct assay. Source: MGI 


Molecular function 


serine-type endopeptidase activity 
Inferred from direct assay. Source: MGI 


Complete GO annotation... 




Sequences 




Sequence 


Length Mass (Da) 


n Q9Z284 [UniParc]. 


FASTA 207 22.526 



Last modified May 1, 1999. Version 1 
Checksum: 1F4E834DDD173DEE 



10 20 30 40 50 60 

M^yjGRLSSRT LAAMLLALFL GGPALPSEIV GGRPARPHAW PFMASLQRRG GHFCGATLIA 

70 80 90 100 110 120 

RNFVMSAVHC VTGNFRSVQV VLGAHDLRRQ ERTRQTFSVQ RIFENGFDPS QLLNDIVIIQ 

13£ 140 150 160 170 180 

LNGSATINAN VQVAQLPAQG QGVGDRTPCL AMGWGRLGTN RPSPSVLQEL NVTWTNMCP 

190 200 
RRVNVCTLVP RRQAGICFVS TLCRRRV 



« Hide 



References 



[1] 



"Characterization and localization of the genes for mouse proteinase-3 and neutrophil 
elastase." 

Sturrock A., Franklin K.F.. Wu S.Q., Hoidal J.R. 

Submitted (APR-2000) to the EMBL/GenBank/DDBJ databases 

Cited for : NUCL EOTIDE SEQUE NCE. 

Strain: 1 29/SvJ I EMBLAAC79702n 



- t- 



I Additional computationally mapped references. 



C ros s -ref e re n ces 



Sequence databases 



® EMBL 


AF082186 Genomic DNA. Translation: AAC79702.1. 




© GenBank 






Oddbj 






IPI 


IPI00785483. 




UnlGene 


Mm.262194. 





http ://www. un iprot.org/uniprot/ Q9Z2 84 
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3D structure databases 

HSSP 



J HSSP built from PDB template 1PPF based on UniProtKB 
P08246. 



ProteinModelPortal 


1 Q9Z284. 




SMR 


1 Q9Z284. Positions 29-198. 


Mod Base 


1 Search... 




Protein-protein interaction databases 




STRING 


Q9Z284. 










MGI 


MGI:2679229. Elane. 




Phylogenomic databases 






GeneTree 


ENSGT00580000081 367. 




HOVERGEN 


HBG013304, 


InParanoid 


Q9Z284. 




Gene expression databases 






ArrayExpress 


Q9Z284. 




Bgee 


Q9Z284. 




Genevestigator 


Q9Z284. 




Family and domain databases 






II lid 1 

,„, , _ , ,,, , „ , , 


IPR009003, Pept cys/ser Trypsin-like. 
IPR001254. Peptldase_S1_S6. 
[Graphical view] 




Pfam 


PF00089. Trypsin. 1 hit. 
[Graphical view] 


SMART 


SM00020, Tryp_SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 

: 
1 


PS50240. TRYPSIN_DOM. 1 hit. 
[Graphical view] 




ProtoNet | 


Search... 




Other Resources 






SOURCE 1 


Search... 




Entry information 






Entry name 


Q9Z284_MOUSE 





Accession 



Primary (citable) accession number: Q9Z284 



http://www.uniprot.org/uniprot/Q9Z284 
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Entry history 



Entry status 



Integrated into May 1, 1999 

UniProtKB/TrEMBL: 

Last sequence May 1, 1999 

update: 

Last modified: April 5, 201 1 

This is version 77 of the entry and version 1 of the sequence. 

[Complete history] 



Unreviewed (UniProtKB/TrEMBL) 



© 2002-2011 UniProt Consortium | License & Disclaimer | Contact 



EMBL-EBi 



• 




SIB 



http://www.unlprot.org/uniprot/Q9Z284 
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P48740 (MASP1_HUMAN)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 116. 



Names and origin 



Protein names 



Recommended name: 
Mannan-binding lectin serine protease 1 

£0=3.4.21.- 
Alternative name(s): 
Complement factor MASP-3 

Complement-activating component of Ra-reactive factor 
Mannose-binding iectin-associated serine protease 1 

Short name=MASP-1 

Mannose-binding protein-associated serine protease 
Ra-reactive factor serine protease pi 00 

Short name=RaRF 

Serine protease 5 

Cleaved into the followino 2 chains: 

1 . Mannan-binding iectin serine protease 1 heavy 
ctiain 

2. Mannan-binding iectin serine protease 1 iigtit 
ciiain 



Gene names 



Name: MASP1 

Synonyms:CRARF, CRARF1 , PRSS5 



Organism 
Taxonomic identifier 
Taxonomic lineage 



Homo sapiens (Human) [Complete proteome] 



9606 [NCBI] 



Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 



Protein attributes 



Sequence length 


1 699 AA. 


Sequence status 


j Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


< 

r 

Functions in the lectin pathway of complement, which performs 
a key role in innate immunity l^y recognizing pathogens through 
patterns of sugar moieties and neutralizing them. The lectin 
pathway is triggered upon binding of mannan-binding lectin 
(MBL) and ficolins to sugar moieties which leads to activation of 
the associated proteases MASP1 and MASP2. Functions as an 
endopeptidase and may activate MASP2 or C2 or directly 
activate C3 the key component of complement reaction. 
Isoform 2 may have an inhibitory effect on the activation of the 



http://www.uniprot.org/uniprot/P48740 



4/12/2011 



Mannan-binding lectin seri|^^rotease I precursor - Homo sapiens (H^jpn) 



Page 2 of 15 



Enzyme regulation 


l^r^in nafhu/Av/ nf rrnmnlc»nnAnt nr maw rIoawA If^FRPS 1 Ref.5 | 
1 Ref.24 1 


Inhibited by SFRPING1 and A?M 1 Reu? i| Ref.20 


Subunit structure 


Homodimer. Interacts with the otigomeric lectins MBL2, FCN2 
and FCN3; triggers the lectin pathway of complement through 



activation of C3 Interacts with SERPING1. 1 ^^m ^ I [Mil] 

Ref.l4 irRef.15 inRef.ie || Ref.18 |j Ref.l9 j} Ref.25 | 



^1 iK^olli liar \r%f*Ckt\an 


QonrofoH i Ref 5 


Tissue specificity 


Protein of the plasma which is primarily expressed by liver. 


1 Ref.5 Ref.1 II Ref. 2 II Ref.12 


Post-translational modification 


The iron and 2-oxoglutarate dependent 3-hydroxylation of 
aspartate and asparagine is (R) stereospecific within EOF 

riomainft 1 By similarity ! 

N-glycosylated. Some N-linked glycan are of the complex-type 


i By similarity j t Ref.11 It Ref.25 || Ref.22 ! 

Autoproteolytic processing of the proenzyme produces the 
active enzyme composed on the heavy and the light chain held 
together by a disulfide bond. Isoform 1 but not isoform 2 is 
activated through autoproteolytic processing. 


Sequence similarities 


Belongs to the peptidase S1 family. 
Contains 2 CUB domains. 
Contains 1 EGF-like domain. 
Contains 1 peptidase S1 domain. 
Contains 2 Sushi (CCP/SCR) domains. 


Biophvsicochemical orooerties * Kinetic parameters: 

1 Km=0.10 mM for Ac-Gly-Lys-OMe (at 30 degrees Celsius) 

j 1 Ref.17 { j Ref.20 | 

\ Km=310 pM for Bz-Arg-OEt (at 30 degrees Celsius) 
1 Km=4.8 pM for C2 (at 37 degrees Celsius) 


Sequence caution 1 


The sequence AAH39724.1 differs from that shown. Reason: 
Erroneous initiation. 



Ontologies 



Keywords 



Biological process 


Complement activation lectin pathway 




Immunity 




Innate immunity 


Cellular component 


Secreted 


Coding sequence diversity 


Alternative splicing 



Domain 



! EGF-like domain 
; Repeat 
i Signal 
Sushi 
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Gene Ontology (GO) 

Biological process 



Cellular component 



Molecular function 



Ligand 


Calcium 




Metal-bindrna 

IVIwLQi Lyil lull lU 


Molecular function 


Hydrolase 




Protease 




Senne protease 


PTM 


Autocatalytic cleavage 




Disulfide bond 




Glycoprotein 




Hydroxylation 


Technical term 


3D-structure 




Complete proteome 




Direct protein sequencing 



complement activation, lectin pa thway 

Inferred from mutant phenotype I ^^^-^^ 1 Source: UniProtKB 

negative regulation of co mpleme nt activation 
Inferred from direct assay CsiLlIl. Source: UniProtKB 

proteolysis 

Inferred from electronic annotation. Source: InterPro 



extracellular space 

Inferred from direct assay Clitlll. Source: UniProtKB 



calcium ion binding 

Inferred from direct assay I \ Source: UniProtKB 

calcium-dependent protein bindin g 

Inferred from physical interaction 1 Ref.i9 j Ref.i9 i Ret. 25 I 
Source: UniProtKB 

protein binding 

Inferred from physical interaction I Ref. i6 I Ref.5 | Ref.isl Source; 
UniProtKB 

protein homodimerization activity 

Inferred from physical interaction I Ref.2s I Source: UniProtKB 

serine-type endopeptidas e activity 

Inferred from direct assay 1 Ref.i7 j Source: UniProtKB 



Complete GO annotation. 



Alternative products 

This entry describes 4 isoforms produced by alternative splicing. [Align] [Select] 
Isof orm 1 (Identifier: P48740-1 ) 

This Isoform has been chosen as the 'canonical' sequence. All positional infonvation in this entry 
refers to it. This is also the sequence that appears in the downloadable versions of the entry. 

Isoform 2 (identifier: P48740-2) 
Also known as: MASP-3; 
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The sequence of this isoforrn differs from the canonical sequence as follows: 

435- 435: V ECGQPSRSLP...QSWEPQVER 

436- 699: Missing. 

Note: Glycosylated on Asn-533 and Asn-599. 
isoforrn 3 (identifier: P48740-3) 

The sequence of this isoforrn differs from the canonical sequence as follows: 
364-380: IVDCRAPGELEHGLITF KNEIDLESELKSEQVTE 
381-699: Missing. 

Isoforrn 4 (identifier: P48740-4) 

The sequence of this isoform differs from the canonical sequence as follows: 
1-113: Missing. 

435- 435: V ECGQPSRSLP...QSWEPQVER 

436- 699: Missing. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) j Length ! Description 



j Graphical view 





1 Signal peptide 


1 


-19 


19 


Ref.11 


1 


1 




Chain 


20- 


699 


680 


Mannan-binding lectin serine 
protease 1 


1 






Chain 


20- 


448 


429 


Mannan-binding lectin serine 
protease 1 heavy chain 


1 


1 






Chain 


449- 


699 


— 

251 


Mannan-binding lectin serine 
protease 1 light chain 


1 





Regions 

Domain 



20-138 



1^ 



Domain 



139-182 



119 i CUB 1 



I I 



44 EGF-like; calcium-binding 



II 



Domain 



Domain 



Domain 



Domain 



185-297 i 113 ! CUB 2 



299 - 364 



365 - 434 



449 - 696 



66 ! Sushi 1 

70 ! Sushi 2 

.■■ i ■ - ■ >^ 

248 ' Peptidase SI 



I I 



I I 



I 



Region 



20 - 278 



Region 



20-184 



Region 



20-184 



r 



259 I Interaction with FCN2 

165 I Homodimerization [IZ^IilrtZ 

165 Interaction with MBL2 



I I 



Sites 



□ 


Active site 


490 


1 ! Charge relay syft**»m 1 By simiiantv 1 


□ 


Active site 


552 


1 ! Charge relay syst**m 1 By similarity i 


□ 


Active site 


646 


1 I Charge relay syftt*»m 1 By similarity j 



! I 



I 
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LJ 


Metal binding 


68 




Calcium 1 


1 

1 { 


LJ 


Metal binding 


76 




Calcium 1 


1 1 


LJ 


Metal binding 


121 




Calcium 1 


' — L 


p-j 


Metal binding 


123 




Calcium 1 ; via carbonyl oxygen 


1 ! 


LJ 


Metal binding 


139 




Calcium 2 




□ 


Metal binding 


•4 ^ ft 

140 




Calcium 2; via carbonyl oxygen 


' 1 


LJ 


Metal binding 


142 


L 


Calcium 2 


^-J 1 


11 


Metal binding 


^ rft 
159 




Calcium 2 


- ' - I 




Metal binding 


160 




Calcium 2; via carbonyl oxygen 


1 ! L 




Metal binding 


163 




Calcium 2; via carbonyl oxygen 


, ^ L 




Metal binding 


ft ft ^ 
235 




Calcium 3 




LJ 


Metal Dinding 


245 




Calcium 3 


- L„._ 1 


Metal binding 


282 


— i 


Calcium 3 


1 1 


Metal binding 


284 




Calcium 3; via carbonyl oxygen 


« ! 




Site 


448 - 449 


2 


Cleavage; by autolysis 


li 

1 



Amino acid modifications 



□ 


Modified residue 


S 159 

p 


1 ; (3R)-3-hydroxyasparagine | 1 

1 1 Potential | [ 


^- 




Glycosylation 


■ 49 


1 


N-linked (GlcNAc...)! ^^^22 ; | 

- ~ 1 - • -• 


□ 


Glycosylation 


178 


1 


N-linked (GlcNAc...)LR§L25_ | | 

1 Ref.22 1 j 






Glycosylation 


385 


1 


N-linked (GlcNAc.) 1 «ef 22 I 


1 1 






Glycosylation 


407 


1 


N-linkAri (nirNAr ) 1 Ref.22 


1 1 




Disulfide bond 


73^91 




1 Ref.25 1 


II i 




Disulfide bond 


143<-». 157 




1 Ref.25 


1 




Disulfide bond 


153 ^ 166 




Ref.25 


1 1 


□ 


- 

Disulfide bond 


168 ^ 181 




Ref.25 


1 1 


□ 


Disulfide bond 


185<-*212 




Ref.25 1 


11 1 




Disulfide bond 


242 260 




Ref.25 1 


11 1 




Disulfide bond 


301 349 




1 By similarity I 


11 i 


□ 


Disulfide bond 


329 362 




By similarity I 


11 1 


Disulfide bond 


367 414 




By similarity I 


11 1 


□ 


Disulfide bond 


397 432 


■ 


By similarity i 


n 




□ 


Disulfide bond 


436 <-> 572 




Interchain (between heavy and 

light chains) 1 Potential 1 


1 


□ 


Disulfide bond 


475 ^ 491 


1 


1 By similarity | 


1 


1 


' j — 


1 

Disulfide bond 


614^631 




By similarity j 






Disulfide bond 


642 ^ 672 




By similarity I 





http://www.uniprot.org/uniprot/P48740 



4/12/2011 



Mannan-binding lectin serii^protease 1 precursor - Homo sapiens (Human) 



Page 6 of 1 5 



Natural variations 



m 


Alternative sequence 


i 1 


-113 


i 113 


' Missing in isoform 4. 




1 






Alternative sequence 


364 


-380 


1 " 


IVDCR...GLITF-^ 
KNEIDLESELKSEQVTE in 
isoform 3. 




: 
■ 


II 


t 




Alternative sequence 


381 


-699 


319 


Missing in isoform 3. 


- 




1 




□ 


Alternative sequence 




435 


1 


ECGQPSRSLPSLVKRIIGGR 

NAEPGLFPWQALIWEDTSR 

VP N D KWFG SG AL LS AS Wl LT 

AAHVLRSQRRDTTVIPVSKE 

HVTVYLGLHDVRDKSGAVNS 

SAARWLHPDFNIONYNHDI 

ALVQLQEPVPLGPHVMPVCL 

PRLEPEGPAPHMLGLVAGWG 

ISNPNVTVDEIISSGTRTLS 

DVLQYVKLPWPHAECKTSY 

ESRSGNYSVTENMFCAGYYE 

GGKDTCLGDSGGAFVIFDDL 

SQRWWQGLVSWGGPEECGS 

KQVYGVYTKVSNYVDWVWEQ 

MGLPQSWEPQVER in isoform 

2 and isoform 4. 




1 






Alternative sequence 


436 


-699 


264 

■■ 
■■ 


Missing in isoform 2 and isoform 





1 


I " 


□ 


Natural variant 


21 




T ^ 1. [dbSNP:rs1 062049] 


1 


1 

-1 




Natural variant 


568 


1 i 

; 

i _w _; 


V ^ A. [dbSNP:rs13322090J 






1 

_...!„.,.. 




Natural variant 


679 


1 


G R. [dbSNP:rs3774266] 






! 
1 



Experimental info 



B 


Mutagenesis 


68 


1 


E — > A or Q: Partial loss of 
Interaction with FCN2. FCN3 

and MRI 9 1 Ref 25 j 


1 




LI 


Mutagenesis 


77 


1 


Y — ► A: Partial loss of Interaction 
with FCN2, FCN3 and MBL2. 

Ref.25 1 


1 






Mutagenesis 

„ , WH. ■-■■ — ■l^— ■■-■.■■■■■til— ■.M...11.1H«-*M.M 


99 


1 


E — > A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

1 Ref.25 1 


1 




□ 


Mutagenesis 


121 


1 


D — > A or N: Loss of interaction 
with FNC2 and FCN3 and partial 
loss of interaction with MBL2. 

1 Ref.25 1 


1 






Mutagenesis 


122 


1 1 F ^ A: Partial loss of interaction 
i with FCN2. FCN3 and MBL2. 

j 1 Ref.25 1 


i " 




□ 


- , 

Mutagenesis 


123 

1 


1 

1 


S — ► A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

j Ref.25 1 


1 
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□ 


Mutagenesis 


125 


□ 


Mutagenesis 


237 



1 



E ~* A: Partial loss of interaction 

with FC N2, FCN3 and MBL2. 

I Ref.25 I 



□ 



□ 



1 H A: Loss of interactio n with 
FCN2. FCN3 and MBI-2. 



Mutagenesis 



239 i 



1 



E — > A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

nRef.25 



Mutagenesis 



244 



1 Y A: Loss of interactio n with 
FCN2, FCN3 and MBL2. 



□ 



□ 



n 



□ 



Mutagenesis 



262 



1 



E ^ A: Partial loss of interaction 

with FC N2, FCN3 and MBL2. 

I Ref.25 1 



Mutagenesis 



274 



1 



S — ► A: Partial loss of interaction 
with FCN2 and FCN3. No effect 
on inter action with MBL2. 

Ref.25 \ 



Mutagenesis 



283 



1 



t 



N — > A: Partial loss of interaction 
with FC N2, FCN3 and MBL2. 

Ref 25 i 



Mutagenesis 



286 



1 I E A: Partial loss of interaction 
with FC N2, FCN3 and MBL2. 

I i Ref 25 I 



Mutagenesis 



646 i 



1 : S A: No autoprot eolytic 
processing. I I 



Sequence conflict 
Sequence conflict 
Sequence conflict 



89 
232 



I 



1 ; R"^ K 



Sequence conflict 



Sequence 
Sequence 



conflict 
conflict 



235 



Sequence conflict 



235 



285 



285 



Sequence conflict 



285 



Sequence conflict 



392 



1 
1 



T 
F 



n CAH 18409. 
n BAF84375. 1 Ref.6 



1 



1 
1 



1 



1 



1 



Q 



Q 



n BAF84376. CMHI 



n BAA05928. 1 



n BAA34864. 1 Re^3 



n BAA05928. CUlI 



n BAA34864. 



n BAA89206. 1 



n BAF83846. [MI] 



4 



I 



□ 



□ 



□ 



□ 



Sequence conflict 
Sequence conflict 



A 



499 



1 



n BAA05928. 1 Ref.2 | 



499 



1 I E->K 



n BAA04477. 1 Ref.i 



Sequence conflict 



499 



1 I E-^K 



n BAA89206, 1 Ref.4 



Sequence conflict 



627 



Sequence conflict 



543 



1 : D -> A 
1 I Q-^K 



n BAA34864, 13^ 



n BAA04477. 1 Ref. i 



Sequence conflict 



552 , 



1 , D-^ V 



n BAA34864. 1 



□ 1 Sequence conflict 

Secondary structure 

1 



643 



1 A 



n BAA04477. 1 Ref.i i 



tl 

..|_ 



il 
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I Helix I Strand | Turn 

Details... 



Sequences 



Sequence 



Length Mass (Da) 



g] isoform 1 [UniParc], 

Last modified December 16, 2008. Version 3. 
Checksum: 5B37C7FB9F51FD1D 



FASTA 



699 



79.247 



1 ft 


2_0 


t 30 


40^ 


50^ 




MRWLLIiYYAL 


CFSLSKASAH 


TVELNNMFGQ 


IQSPGYPDSY 


PSDSEVTWNI 


TVPDGFRIKI* 


70 


O 0 


90^ 


100 


110 


120 


YFMHFNLESS 


YLCEYDYVKV 


ETEDQVLATF 


CGRETTDTEQ 


TPGQEWLSP 


GSFMSITFRS 


13j0 


140 


150 


16£ 


170 


18£ 


DFSNEERFTG 


FDAHYMAVDV 


DECKEREDEE 


LSCDHYCHNY 


IGGYYCSCRF 


GYILHTDNRT 


190 


200 


210 


220 


230 


240 


CRVECSDNLF 


TQRTGVITSP 


DFPNPYPKSS 


ECLYTIELEE 


GFMVNLQFED 


IFDIEDHPEV 


2S0 


260 


270 


280 


290 


300 


PCPYDYIKIK 


VGPKVLGPFC 


GEKAPEPIST 


QSHSVLILFH 


SDNSGENRGW 


RLSYRAAGNE 


310 


320 


330 


340 


350 


360 


CPELQPPVHG 


KIEPSQAKYF 


FKDQVLVSCD 


TGYKVLKDNV 


EMDTFQIECL 


KDGTWSNKXP 


370 


380 


390 


4 00^ 


410 


420 


TCKIVDCRAP 


GELEHGLITF 


STRNNLTTYK 


SEIKYSCQEP 


YYKMLJJNNTG 


lYTCSAQGVW 


430 


440 


450 


460. 


470 


480^ 


MNKVLGRSIiP 


TCLPVCGL.PK 


FSRKU4ARIF 


NGRPAQKGTT 


PWIAMLSHLN 


GQPFCGGSIili 


490 


500 


510 


520 


530 


540 


GSSWIVTAAH 


CliHQSLDPED 


PTLRDSDLLS 


PSDFKIILGK 


Hlf^LRSDEME 


QHLGVKHTTL 


550 


560 


570 


580 


590 


600 


HPQYDPNTFE 


MDVALVEIiLE 


SPVliNAFVMP 


ICLPEGPQQE 


GAMVIVSGWG 


KQFLQRFPET 


610 


620 


630 


640 


650 


660 


LMEIEIPIVD 


HSTCQKAYAP 


UCKKVTRDMI 


CAGEKEGGKD 


ACAGDSGGPM 


VTLNRERGQW 


670 


68 01 


690 








YLVGTVSWGD 


DCGKKDRYGV 


YSYIHHNKDW 


IQRVTGVRN 







« Hide 



Isoform 2 (MASP-3). 

Checksum: 09B5297A6C14283A 
Show » 



FASTA 



728 



81,860 



Isoform 3. 

Checksum: DDED114311A62714 
Show » 



FASTA 



380 



43.640 



isoform 4. 

Checksum: EEE63886709340FA 
Show » 



FASTA 



615 



68,918 
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Primary (citable) accession number: P48740 
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Entry history 

» 
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Integrated into February 1. 1996 

UniProtKB/Swiss- 

Prot: 

Last sequence December 16, 2008 
update: 
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This is version 1 16 of the entry and version 3 of the sequence, 
[Complete history] 
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Chordata Protein Annotation Program 


Disclainfier 
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P98064(MASP1_MOUSE) # Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8, 201 1. Version 106. 



Names and origin 



Protein names 


Recommended name: 
Mannan-binding lectin serine protease 1 

EC=3.4.21.- 
Altemative name(s): 

Complement factor MASP-3 

Complement-activating component of Ra-reactive factor 
Mannose-binding lectin-associated serine protease 1 

Short name=MASP-1 

Mannose-binding protein-associated serine protease 
Ra-reactive feictor serine protease p100 

Short name=RaRF 

oenne protease o 

Cleaved into the followina 2 chains: 

1 . Mannan-binding iectin serine protease 1 tieavy 
chain 

2. Mannan-binding iectin serine protease 1 tight 
chain 


Gene names 


Name: Masp1 
Synonyms: Crarf, Masp3 


Organism 


iVIus musculus (Mouse) 


Taxonomic identifier 


10090 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> IViUS 


Protein attributes 




Sequence length 


704 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Function 



Functions in the lectin pathway of complement, which performs 
a key role in innate immunity by recognizing pathogens through 
patterns of sugar moieties and neutralizing them. The lectin 
pathway is triggered upon binding of mannan-binding lectin 
(MBL) and ficolins to sugar moieties which leads to activation of 
the associated proteases MASP1 and MASP2. Functions as an 
endopeptidase and may activate MASP2 or C2 or directly 
activate 03 the key component of complement reaction. 



http://www.uniprot.org/uniprot/p98064 



4/13/2011 



Page 2 of 10 





Isoform 2 may have an inhibitory effect on the activation of the 
lectin pathway of complement or may cleave IGFBP6. 1 ^^^-^ 1 


l-n7\/mp rpmilfltinn 


InhihitpH hv f>FRPINni f:ind A9M 1 By similarity I 


Subunit structure 

Xi^Uh^U 1 III wll UwLUI w 


Homodimer Interacts with the olioomeric lectins MBL1 MBL2. 

1 t^^l 1 III l\^ mm III ^WpI Cil\^lbWr WW 1 LI 1 mm IW ^^1 l\j^^l 1 l^^l m\^ IWp^^M I l%ir I w I lar mm 9 f ■ w ■ ^v^w | 

FCN2 and FCN3; triggers the lectin pathway of complement 
through activation of C3 Interacts with SFRPING1 1 By similarity 


Subcellular location 


Secreted 1 Re^-i . 


Tissue specificity 


Protein of the plasma which is primarily expressed by liver. 

Ref.7 1 Ref.1 | 


Post-transl§itional modification 


The iron and 2-oxoglutarate dependent 3-hydroxylation of 
aspartate and asparagine is (R) stereospecific within EGF 

Hnmains 1 By similarity I 

N-glycosylated. Some N-linked glycan are of the complex-type 

1 By similarity | 

Autoproteoiytlc processing of the proenzyme produces the 
active enzyme composed on the heavy and the light chain held 
together by a disulfide bond. Isoform 1 but not isoform 2 is 

sir.tiwat^H thrniinh aiitnnrntf^nlwtip. nrnnftRsinn 1 By similarity j 


Disruption phenotype 


Mice are smaller and more vulnerable indicating developmental 

and growth defects. Mice serum has low C4 and C3 cleavage 
activity together with low MASP2 activation. 1 ^^^-^ 1 


Sequence similarities 

■ 

: 

■ 

: 
: 


Belongs to the peptidase S1 family. 
Contains 2 CUB domains. 
Contains 1 EGF-like domain. 
Contains 1 peptidase S1 domain. 
Contains 2 Sushi (CCP/SCR) domains. 



Ontologies 


Keywords 




Biological process 


Complement activation lectin pathway 

Immunity 

Innate immunity 


Cellular component 


Secreted 


Coding sequence diversity 


Alternative splicing 


Domain 


EGF-like domain 

Repeat 

Signal 

Sushi 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 



I Disulfide bond 



http://www.xmiprot.org/uniprot/p98064 



4/13/2011 



Mannan-binding lectin serine protease 1 precursor - Mus musculus (Mouse) 



1 




Page 3 of 10 



Technical term 

Gene Ontology (GO) 

Biological process 



Glycoprotein 
Hydroxylation 



Direct protein sequencing 



Cellular component 



Molecular function 



complement activation, lectin pa thway 

Inferred from mutant phenotype i j Source: UniProtKB 

proteolysis 

Inferred from electronic annotation. Source: InterPro 



extracellular space 

Inferred from direct assay I Ref.7.1 source: UniProtKB 



calcium ion binding 

Inferred from sequence or structural similarity. Source: 
UniProtKB 

protein homodimerization activity 

Inferred from sequence or structural similarity. Source: 

UniProtKB 

serine-type endopeptidas e activi ty 

Inferred from direct assay CEiLL]. Source: UniProtKB 



Complete GO annotation... 



Alternative products 



This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] 



Isoform 1 (identifier: P98064-1 ) 
>A/so known as: MASP-1 ; 

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry 
refers to it. This is also the sequence that appears in the downloadable versions of the entry. 

Isoform 2 (identifier: P98064-2) 
Also known as: MASP-3; 

The sequence of this isoform differs from the canonical sequence as follows: 

443- 443: V QPSRALPNLV...RAVRDLQVER 

444- 704: Missing. 

Note: Glycosylated on Asn-538 and Asn-604 (By similarity). 



Sequence annotation (Features) 



Feature key 



l\Aolecule processing 



Signal peptide 



Position(s) Length j Description 



Graphical view 



1 -24 



24 



I 



http://www.uniprot.org/ixniprot/p98064 



4/13/2011 



Mannan-binding lectin serineprotease 1 precursor - Mus musculus (Mouse) 



Regions 

Domain 



m 



Domain 



Domain 



Domain 



Domain 



Domain 



Region 



Region 



Region 



Region 



Sites 
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m 


Chain 

mil I 


25 - 704 


680 


Mannan-bindlna lectin serine 

IwlBil 11 Iwll 1 l^li IVili 1^4 l^^^^^li i w^^l >l 

protease 1 


1 




Chain 


25 - 453 


429 


Mannan-bindina lectin serine 
protease 1 heavy chain 


1 1 




Chain 


454 - 704 


251 


Mannan-binding lectin serine 
protease 1 light chain 


1 



25-143 



144-187 



190-302 



304 - 369 



370 - 439 



454 - 701 



25 - 305 



25 - 283 



25-189 



25-189 



119 



44 



113 



66 



70 



248 



281 



259 



166 



165 



CUB 1 



EGF-like; calcium-binding 



By slmi}ahty 



CUB 2 



Sushi 1 



Sushi 2 



Peptidase SI 



Interaction with MBL1 

I By similarity | 



Interaction with FCN2 

I By similarity | 



I I 



II 



I I 



Homodimerization I By similarity 



I I 



Interaction w ith MBL2 

I By similarity | 



I I 



I I 



I I 



□ 


Active site 


495 




nhargp rf»l ay. system 1 By similarity 




1 


□ 


Active site 


557 




Charge relay system 1 By similarity , 


i 1 




Active site 


651 




Charge relay system i ^y similarity 1 




□ 


Metal binding 


73 




Calcium 1 1 By similarity | 


_i 1 


□ 


Metal binding 


81 




1 nalf^iiim 1 By similarity 


1 i 




Metal binding 


126 




Calcium 1 1 similarity 


ji 




Metal binding 


128 




Calcium 1 ; via carbonyl oxygen 






1 By similarity 




Metal binding 


144 




nalr.il im 9 By similarity 






□ 


Metal binding 


145 




Calcium 2; via carbonyl oxygen 

1 By similarity | 


1 






Metal binding 


147 




Calcium? By similarity 








Metal binding 




164 




Calcium 2 By similarity 


! 1 


1 




Metal binding 


165 






Calcium 2; via carbonyl oxygen 

1 By similarity j 


1 






Metal binding 


168 




Calcium 2; via carbonyl oxygen 


1 ! 


1 By similarity 




1 


□ 


Metal binding 


240 




naif-Jiim r^l Bysimtlarity 


- 


□ 


Metal binding 


250 




Calcium 31 By similarity 1 
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□ 


Metal binding 


287 


1 


Calcium ^ 1 By similarity 






□ 


Metal binding 


289 


1 


Calcium 3; via carbonyi oxygen 

\ By similarity 


1 






Site 


453 - 454 


2 


Cleavage; by autolysis 

1 By similarity | 


1 





Amino acid modifications 

Modified residue 



I Glycosylation 



Glycosylation 



Glycosylation 



Glycosylation 



~ 1... 



Disulfide bond 



Disulfide bond 



I Disulfide bond 



Disulfide bond 



j Disulfide bond 



I Disulfide bond 



Disulfide bond 



164 



54 



183 



390 



412 



78^96 



148^ 162 



158^171 



173 
190 



186 



217 



265 



Disulfide bond 



247 

306 354 I 
1 334 ^ 367 i 



Ell I Disulfide bond 



Disulfide bond 



I 372^419 
1 402 ^ 437 



□ I Disulfide bond 



441 ^ 577 



Disulfide bond 



480 ^ 496 



I Disulfide bond 



619^636 



Disulfide bond 



i 



647 ^ 



Natural variations 



Alternative sequence 



443 



1 



1 



1 



1 



1 



(3R)-3-hy droxyasparagine 

I Potential j ' : 



N-linked (GlcNAc..,) I Potential I 



N-linked (GlcNAc:..) I Potential | 



N-linked (GlcNAc...) I Potential 



N-linked (GlcNAc.) I Potential 



I 



By similarity 



II 



I By similarity 



By similarity 



I 



By similarity 



By similarity 



II 



„ 1. 



By similarity 



II 



By similarity 



II 



By similarity 



II 



By simiiarity 



II 



By similarity 



Interchain (b etween h eavy and 

light chains) f Potential I 
I By similarity 



II 

i" 



i 



By similarity | 



By similarity 



1 



QPSRALPNLVKRIIGGRNAE 

LGLFPWQALIWEDTSRVPN 

DKWFGSGALLSESWILTAAH 

VLRSQRRDNTVIPVSKEHVT 

VYLGLHDVRDKSGAVNSSAA 

RVILHPDFNIQNYNHDIALV 

QLQKPVPLGAHVMPICLPRP 

EPEGPAPHMLGLVAGWGISN 

PN VTVDEII LSGTRTLSDVL 

QYVKLPWSHAECKASYESR 

SGNYSVTENMFCAGYYEGGK 

DTCLGDSGGAFVIFDEMSQH 

WVAQGLVSWGGPEECGSKQV 



I 
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Alternative sequence 



Experimental info 



444 _ 704 



261 



YGVYTKVSNYVDWLWEEMNS j 
PRAVRDLQVER in isoform 2. i 



IVIissing in isoform 2. 



I 



□ 


Sequence conflict 


267 


1 j G A in AAI31638. 1 Re^-S 




Sequence conflict 


345 


1 j E G in BAA03944. 1 R^f.i 


□ 


Sequence conflict 


345 


1 I F — G In RAR69B8B 1 Ref.2 




, 

Sequence conflict 


— — . 

428 


1 i F K in RAA03944 1 1 




Sequence conflict 


465 


1 1 M T in AAN39850 1 



Sequences 



Sequence 



Lerigtli itflass (Da) 



n Isoform 1 (MASP-1) [UniParc]. 



FASTA 



704 



79.968 



Last modified April 14, 2009. Version 2. 
Checksum: 4CF0B1 791 6C1 0961 



10 


20 


301 


4£ 


5£ 


6£ 


MRFLSFWRLL 


LYHALCLAIiP 


EVSAHTVELN 


EMFGQIQSPG 


YPDSYPSDSE 


VTWNITVPEG 


10 


80 


90^ 


100^ 


110 


12£ 


FRIKLYFMHF 


NLESSYLCEY 


DYVKVETEDQ 


VliATFCGRET 


TDTEQTPGQE 


WliSPGTFMS 


130 


140 


15£ 


16£ 


17_0 


18£ 


VTFRSDFSNE 


ERFTGFDAHY 


MAVDVDECKE 


REDEEIiSCDH 


YCHNYIGGYY 


CSCRFGYILH 


190 


202 


210^ 


22£ 


230 


24£ 


TDNRTCRVEC 


SGNIjFTQRTG 


TITSPDYPNP 


YPKSSECSYT 


IDLEEGFMVS 


liQFEDIFDIE 


250^ 


26 0 


210 


280 


■ 2 90 


30£ 


DHPEVPCPYD 


YIKIKAGSKV 


WGPFCGEKSP 


EPISTQTHSV 


QILFRSDNSG 


ENRGWRLSYR 


310 


32 0 


330 


34£ 


35 01 


36£ 


AAGNECPKLQ 


PPVYGKIEPS 


QAVYSFKDQV 


LVSCDTGYKV 


liKDNEVMDTF 


QIECLKDGAW 


370 


380 


390 


A 00 


41£ 


42£ 


SNKIPTCKIV 


DCGAPAGIjKH 


GLVTFSTRNN 


LTTYKSEIRY 


SCQQPYYKMi. 


HNTTGVYTCS 


43 0 


440 


450^ 


46£ 


4 7£ 


4 8£ 


AHGTWTNEVL 


KRSLPTCIiPV 


CGVPKFSRKQ 


ISRIFNGRPA 


QKGTMPWIAM. 


LSHLNGQPFC 


4 90 


500 


510 


520 


53£ 


54£ 


GGSLLGSNWV 


liTAAHCLHQS 


LDPEEPTLHS 


SYLLSPSDFK 


IIMGKHWRRR 


SDEDEQHLHV 


550 


560 


570 


580 


59£ 


60£ 


KRTTLHPLYN 


PSTFENDIiGL 


VEIiSESPRIiN 


DFVMPVCLPE 


QPSTEGTMVI 


VSGWGKQFLQ 


610 


620 


630 


640 


-65£ 


66£ 


RFPBNIiMEIE 


IPIVNSDTCQ 


EAYTPLKKKV 


TKDMICAGEK 


EGGKDACAGD 


SGGPMVTKDA 


67 0 


680 


6 90 


7 0£ 






ERDQWYLVGV 


VSWGEDCGKK 


DRYGVYSYIY 


PNKDWIQRIT 


GVRN 





« Hide 



isoform 2 (MASP-3). 

Checksum: 6B3B54D5D3F5822B 
Show » 



FASTA 



733 



82,446 
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PF00084. Sushi. 2 hits. 
PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722. CHYMOTRYPSIN. 


SMART 


SM00032. CCP. 2 hits. 
SM00042. CUB. 2 hits. 
SM00179. EGF CA. 1 hit. 
SM00020. Tryp.SPc. 1 hit. 
[Graphical view] 


SUPFAM 


SSF57535. Complement control module. 2 hits. 
SSF49854. CUB. 2 hits. 
SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS00010. ASX HYDROXYL 1 hit. 
PS01180. CUB, 2 hits. 
PS00022. EGF 1. False negative. 
PS01186. EGF_2. 1 hit. 
PS50026. EGF 3. False negative. 
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PS50923. SUSHI. 2 hits. 
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Other Resources 



SOURCE 


Search... 


Entry information 




Entry name 


MASP1_MOUSE 


Accession 


Primary (citable) accession number: P98064 
Secondary accession number(s): A2RRH8 B Q920S0 


Entry history 


Integrated into February 1, 1996 

UniProtKB/Swiss- 

Prot: 

Last sequence April 14, 2009 
upaate. 

Last modified: March 8. 201 1 

This is version 106 of the entry and version 2 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



MGD cross-references 

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 

©2002-2011 UniProt Consortium | License & Disclaimer | Contact 

EMBL-EBl|!|l 
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P48740 (MASP1_HUMAN) ^ Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 116. 



Names and origin 


Protein names 


Recommended name: 

Mannan-binding lectin serine protease 1 
EC=3.4.21.- 
Altemative name(s): 

Complement factor MASP-3 

Complement-activating component of Ra-reactive factor 
iviannose-Dinaing leciin-associaiea senne proiease i 

Short name=MASP-1 

Mannose-blndlng protein-associated serine protease 
Ra-reactive factor serine protease p100 

Serine protease 5 

Cleaved into the foliowino 2 chains; 

1 . Mannan-binding lectin serine protease 1 heavy 
cttain 

2. Mannan-binding lectin serine protease 1 light 
chain 


Gene names 


Name: MASP1 

Synonyms:CRARF, CRARF1 , PRSS6 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 

Protein attributes 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Sequence length 


699 AA. 


Sequence status 


Complete. 


Sequence processing i 


The displayed sequence is further processed into a mature 
form. 


— — ■■ — 1 ■ ■■■■■ ■ 

Protein existence | Evidence at protein level. 

General annotation (Comments) 


Function 


Functions in the lectin pathway of complement, which performs 
a key role in innate immunity by recognizing pathogens through 
patterns of sugar moieties and neutralizing them. The lectin 
pathway is triggered upon binding of mannan-binding lectin 
(MBL) and ficolins to sugar moieties which leads to activation of 
the associated proteases MASP1 and MASP2. Functions as an 
endopeptidase and may activate MASP2 or C2 or directly 
activate C3 the key component of complement reaction. 
Isoform 2 may have an inhibitory effect on the activation of the 



http://www.uniprot.org/uniprot/P48740 



4/13/2011 



Mannan-binding lectin serii^^rotease 1 precursor - Homo sapiens (Human) 



Page 2 of 15 



Enzyme regulation 



Subunit stnjcture 



Subcellular location 



Tissue specificity 



Post-translational modification 



Sequence similarities 



Biophysicochemical properties 



Sequence caution 



lectin p athway of complement or may cleave IGFBP5. 1 ^Q^^ I 

I Ref.24 1 



Inhibited by SERP1NG1 and A9M I Ref.i7ll Ref.20 



Homodimer. Interacts with the oligomeric lectins MBL2, FCN2 
and FCN3; triggers the lectin pathway of co mplement throu gh 
activation of C3. Interacts with SERP1NG1.I R ef/ii | [ Ref.i3 | 

1 Ref.14 ifRef-IS irReTTell Ref.lTIl Ref.19ll Ref.25 I 



Secreted [MH, 



Protein of the plasma which is primarily expressed by liver. 

I Ref.S II Ref.l II Ref.2 11 Ref.12 I 



The iron and 2-oxoglutarate dependent 3-hydroxylation of 
aspartat e and aspara gine is (R) stereospecific within EOF 

domains I By similarity j 

N-glycosyla t ed. Some N-linked glyc an are of the complex-type 

I By similarity [ j Ref 11 1| Ref.25 irRef.22 | 



Autoproteolytlc processing of the proenzyme produces the 
active enzyme composed on the heavy and the light chain held 
together by a disulfide bond. Isoform 1 but not isoform 2 is 
activated through autoproteolytlc processing. 



Belongs to the peptidase S1 family. 
Contains 2 CUB domains. 
Contains 1 EGF-like domain. 
Contains 1 peptidase S1 domain. 
Contains 2 Sushi (CCP/SCR) domains. 



Kinetic parameters: 

Km=0.10 mM for Ac-Gly-Lys-OMe (at 30 degrees Celsius) 



I Ref. 17 11 Ref.20 



Km=310 pM for Bz-Arg-OEt (at 30 degrees Celsius) 
Km=4.8 pM for C2 (at 37 degrees Celsius) 



The sequence AAH39724.1 differs from that shown. Reason: 
Erroneous initiation. 



Ontologies 


Keywords 




Biological process 


Complement activation lectin pathway 

Immunity 

Innate immunity 


Cellular component 


Secreted 


Coding sequence diversity 


Alternative splicing 
Polymorphism 


Domain 


EGF-like domain 

Repeat 

Signal 

Sushi 
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Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 
Disulfide bond 

Glvconrotein 
Hydroxylation 


Technical term 


3D-structure 
Complete proteome 
Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


complement activation, lectin pathway 

Inferred from mutant phenotype 1 1 Source: UniProtKB 

negative regulation of complement activation 
Inferred from direct assay L5®LL.. Source: UniProtKB 

proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


extracellular space 

Inferred from direct assay LEeLL.. Source: UniProtKB 


Molecular function 


calcium ion binding 

Inferred from direct assay 1 Ref.25 i Source: UniProtKB 
calcium-dependent protein binding 

Inferrf^H frnm phy«tir^l inf^^rartinn 1 Ref.19 I Ref.19 I Ref.25 I 

Source: UniProtKB 
protein binding 

Inf^rrf^H frnm physin;?! intf^rartinn Ref.16 I Ref.5 I Ref.18 1 f^niirnp- 

UniProtKB 

protein homodimerlzation activity 

Inferrf^d frnm physir^l intftrartinn 1 Ref.25 j Sournft" UniProtKB 

serine-type endopeptidase activity 

Inferred from direct assay 1 ^e^-^^ 1 Source: UniProtKB 



Complete GO annotation... 



Alternative products 



This entry describes 4 isoforms produced by alternative splicing. [Align] [Select] 



Isoform 1 (identifier: P48740-1) 

This isoform has been chosen as the 'canonical' sequence. AH positional infonvation in this entry 
refers to it. This is also the sequence that appears in the downloadable versions of the entry. 



Isoform 2 (identifier: P48740-2) 
Also known as: MASP-3; 



http://vv^vAv.uniprot.org/uniprot/P48740 



4/13/2011 



Mannan-binding lectin serii^^rotease 1 precursor - Homo sapiens (Hujaan) Page 4 of 15 



ij^^rotease 1 precursor - Homo sapiens (H^|p 



The sequence of this isoforrn differs from the canonical sequence as fotiows: 

435- 435: V -> ECGQPSRSLP...QSWEPQVER 

436- 699: Missing. 

Note: Glycosylated on Asn-533 and Asn-599. 
isoform 3 (identifier: P48740-3) 

The sequence of this isoform differs from the canonical sequence as follows: 
364-380: IVDCRAPGELEHGLITF KNEIDLESELKSEQVTE 
381-699: Missing. 

Isoform 4 (identifier: P48740-4) 

The sequence of this isoform differs from the canonical sequence as follows: 
1-113: Missing. 

435- 435: V ECGQPSRSLP...QSWEPQVER 

436- 699: Missing. 



Sequence annotation (Features) 



Regions 





Feature key 


Position(s) 


Length 


Description 


Graphical view 


Molecule processing 












Signal peptide 


1 -19 


19 


1 Ref.11 f 


1 1 

* 

1- 




Chain 


20 - 699 


680 


Mannan-binding lectin serine 


1 1 










protease 1 


i 




Chain 


20 - 448 


429 


Mannan4>inding lectin serine 


1 li 










protease 1 heavy chain 




B 


Chain 


449 - 699 


251 


Mannan-binding lectin serine 


1 










protease 1 light chain 







Domain 


20-138 


119 


CUB 1 


1 1 i 






Domain 


139-182 


44 


EGF-like; calcium-binding 


II 1 

\— 




Domain 


185-297 


113 


CUB 2 


1 1 1 


PI 




Domain 


299 - 364 


66 


Sushi 1 


1 1 1 


m 


Domain 


365 - 434 


70 


Sushi 2 


1 II 


B 


Domain 


449 - 696 


248 


Peptidase SI 


II 


B 


Region 


20 - 278 


259 


Interaction with FCN2 


1 1 t 




Region 


20-184 


165 

L _ 


HnmnHimftriyatinn 1 By similarity [ 


• 1 ! 


m 

Sites 

□ 


Region 
Active site 




20-184 

490 


165 

1 


Interaction with MBL2 

Charge relay system 1 By similarity | 


1 1 


1 


□ 


Active site 


552 


1 


Charge relay system 1 By similarity | 


1 1 




Active site 


646 


1 


r.hs*rQf> r^lfly Qyftti^m 1 By similarity j 
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1 i 


Metal binding 


68 


"* 


Calcium 1 


r > 
1 1 
1 


mi 


Metal binding 


76 




Calcium 1 


g i 
1 


i — 1 
LJ 


Metal binding 


121 




Calcium 1 


1 i 
1 


i — \ 
1 1 


Metal binding 


123 




Calcium 1 ; via carbonyl oxygen 




f — 1 
1 1 


Metal binding 


139 

i 




Calcium 2 


1 


1 1 


Metal binding 


i 

140 




Calcium 2; via carbonyl oxygen 


i t 
1 


1 — j 

LJ 


Metal binding 


142 




Calcium 2 






Metal binding 


159 


"* 


Calcium 2 


1 


LJ 


■ a ^ *_ 1 1 • _m • ____ 

Metal binding 


160 


"* 


Calcium 2; via carbonyl oxygen 


1 1 


□ 


Metal binding 


163 


1^ 


Calcium 2; via carbonyl oxygen 


i i 
1 1 


i } 


Metal binding 


OOP 
235 




Calcium 3 


1 


□ 


Metal binding 


245 




Calcium 3 


1 i 


□ 


Metal binding 


282 




Calcium 3 


1 

i t 


□ 


Metal binding 


284 




Calcium 3; via carbonyl oxygen 


i 1 

i 


Ami 


Site 

no acid modifications 

Modified residue 


448 - 449 
159 


2 
1 


Cleavage; by autolysis 
(3R)-3-hydroxyasparagine 

1 Potential | 


li 

i i 




□ 


Glycosylation 


49 


1 


N-linked (GlcNAc.) 1 ^^^^2 | 


i 

' ! 


□ 


Glycosylation 


178 


1 


N-linked (GlcNAn ) 1 Re^ 25 | 

1 Kef. 22 i 


i 




Q 


Glycosylation 


385 


1 


N-linked (GlcNAc...)! 


i i 


LJ 


Glycosylation 


407 


1 


N.hnkfaH (nirNAr ) 1 Kef.22 1 


il 

- - I ... 




Disulfide bond 


^o o 
73 91 




1 Ket.ZD 1 


II 


1 

1 
1 




Disulfide bond 


143 157 




1 Kei.2o 1 


• 


|i j 


Disulfide bond 


153 166 




1 Da( on 1 
p KeT.2D J 


■ 


1 

1 

\ 

1 


L_l 


Disulfide bond 


168 181 


" "' ■ ' 


1 Drsf OC 1 

1 Hei.zo 1 


1 


Em 


Disulfide bond 


185 ^ 212 




1 □A'f OC 1 

1 KeT.^O i 


1 

1— 




Disulfide bond 


242 260 




1 D/-»f OC i 


1 

^_ 


D 


Disulfide bond 


301 349 




1 By similarity i 


II i 

[_ 


□ 


Disulfide bond 


329 4-^ 362 




1 By similarity 1 


11 1 


□ 


Disulfide bond 


367 ^ 414 




1 By similarity I 


II 1 

1 


□ 


Disulfide bond 


397 ^ 432 




1 By similarity | 


III 


□ 


Disulfide bond 

i 


436 ^ 572 




Interchain (between heavy and 

light rhaine) 1 Potential 1 


1 


1 


□ 


Disulfide bond 1 


475 491 




i By similarity I 


\ 


1 

1 
1 

1 


□ 


! 

Disulfide bond 


614 631 i 




1 By similarity 




□ 


Disulfide bond 

1 


642 -H- 672 




1 By similarity | 


1 1 

1 

1 
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Natural variations 



Alternative sequence 



Alternative sequence 



Alternative sequence 



Alternative sequence 



1 - 113 



364 - 380 



381 - 699 



435 



Alternative sequence 



Natural variant 



□ 



□ 



436 - 699 



21 



Natural variant 



568 



Natural variant 



679 



Experimental info 



□ 



□ 



□ 



□ 



Mutagenesis 



Mutagenesis 



Mutagenesis 



Mutagenesis 



Mutagenesis 



68 



77 



99 



121 



122 



Mutagenesis 



123 



113 



17 



319 



1 



264 



1 



1 



Missing in isoform 4. 



IVDCR...GL1TF — 
KNEIDLESELKSEQ\/TE in 
isofomn 3. 



Missing in isoform 3. 




V — 

ECGQPSRSLPSLVKRIIGGR 
NAEPGLFPWQALIWEDTSR 
VPNDKWFGSGALLSASWILT 

aahvlrsqrrpttvipvske 

hvtvylglmdvrdksgavns 

saarwlhpdfniqnynhdi 

alvqlqepvplgphvmpvcl 

prlepegpaphmlglvagwg 

ISNPNVTVDEIISSGTRTLS 
DVLQYVKLPWPHAECKTSY 
ESRSGNYSVTENMFCAGYYE 
GGKDTCLG DSGGAFVI FDDL 
SQRWWQGLVSWGGPEECGS 
KQ VYG VYTKVS N YVDWVWEQ 
MGLPQSWEPQVER in isofornn 
2 and isoform 4. 



Missing in isoform 2 and isoform 
4. 



I. [dbSNPirsI 062049] 



V->A. [dbSNP:rs1 3322090] 



G — R. [dbSNP:rs3774266] 



1 



1 



E — ► A or Q: Partial loss of 
interaction w ith FC N2, FCN3 
and MBL2. 



Y ^ A: Partial loss of Interaction 
with FC N2, FCN3 and MBL2. 

I Ref.25 I 



1 



1 



1 



E — » A: Partial loss of interaction 
with FC N2, FGN3 and MBL2. 

I Ref.25 I 



D — ► A or N: Loss of interaction 
with FNC2 and FCN3 and partial 
loss of interaction with MBL2. 

I Ref.25 I ; 



F — » A: Partial loss of interaction 
with FC N2, FCN3 and MBL2. 

I Ref.25 I 



S — ► A: Partial loss of interaction 
with FC N2, FCN3 and MBL2, 

I Ref.25 I 
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Mutagenesis 


125 


1 


E A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

1 Ref.25 1 


1 


1 

1 


LJ 


Mutagenesis 


237 


1 


H — ► A: Loss of interaction with 
FCN2, FCN3 and MBL2. 1 Ref 25 I 


1 1 




m M m * 

Mutagenesis 


239 


1 


E — > A: Partial loss of interaction 
with FCN2, FCN3 and MBI-2. 

1 Ref.25 1 


1 


LJ 


Mutagenesis 


244 


1 


Y — ► A: Loss of interaction with 
FCN2, FON3 and MR! ? 1 R^f 25 


1 




□ 

■■■■■1 1 III* 


Mutagenesis 


262 


1 


E ^ A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

1 Ref.25 1 


1 






Mutagenesis 


274 




S A: Partial loss of interaction 
with FCN2 and FCN3. No effect 
on interaction with MBL2. 

1 Ref.25 1 ■ 


1 








Mutagenesis 


283 


™„. 

1 


N ^ A: Partial loss of interaction 
With FCN2, FCN3 and MBL2. 

1 Ref 25 1 


1 




□ 


Mutagenesis 




1 


E — > A: Partial loss of interaction 
with FCN2. FCN3 and MBL2. 

1 Ref.25 1 


1 




L_J 


Mutagenesis 


646 


1 


S — ^ A: No autoproteolytic 

prnrARRing 1 Ref.21 | 










Sequence conflict 


2 


1 


R K in RAF84-'^7-R 1 Ref.6 j 


1 


I 




: 


Sequence conflict 


89 


1 


T A in nAH1ft4nP 1 Ref.7 I 


1 


% 
1 

! 




Sequence conflict 


i^£Ofc> 


U 


F — > 1 in RAFR4-^7F; 1 Ref.6 | 


1 1 


Id i 


! 

Sequence conflict 

i 


f— 

235 


1 


E ^ Q in BAA05928. 1 2 | 


1 1 

. 1,„_ 


□ 1 


] 

Sequence conflict 


235 


1 


F O in RAA.^4RR4 1 Ref.3 | 


1 1 

, f 




Sequence conflict 


285 


1 


G — ► A in BAAQ5928 1 2 | 


i 1 

t 


□ 1 


Sequence conflict i 


/"X (*k r 

285 


1 


r; A in RAA.'^4Rfi4 1 Ref.3 | 


1 1 


□ 


Sequence conflict i 


285 


1 


n -* A in RAARPPOR 1 Ref.4 | 


1 1 




Sequence conflict 


392 


1 


F — ^ n in RAFR/iR4R 1 Ref.6 1 


i 1 


□ 


Sequence conflict 


499 


1 


E — > G in BAA05928. 1 2 | 


ir 




Sequence conflict 


499 


1 


F -* K in RAAn4477 1 Ref.1 






il 


□ 


Sequence conflict 


499 


1 


F _k in RAARP:?nfi 1 Ref.4 






1 


□ 


Sequence conflict 


f— «-> —7 
527 


1 


D — ► A in BAA34864. LSa:,S...I 






1 


□ 


Sequence conflict 


543 


1 


Q _>. K in RAAn4477 1 Ref.1 1 


j_ 




Sequence conflict 


552 


1 


D — V in RAA34864, Reo 


1 1 


1^ j 


Sequence conflict 


643 


1 


A .«? in RAAa<l477 1 Ref.1 1 
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Details... 



Sequences 



Sequence 



Length Mass (Da) 



Isoform 1 [UniParc]. 

Last modified December 16, 2008. Version 3. 
Checksum: 5B37C7FB9F51FD1D 



PASTA 



699 



79.247 



10 20 30 40 50 60 

MRWLLLYYAL CFSLSKASAH TVELNNMFGQ IQSPGYPDSY PSDSEVTWNI TVPDGFRIKL 



70 80 90 100 . 110 120 

YFMHFNLESS YLCEYDYVKV ETEDQVLATF CGRETTDTEQ TPGQEWLSP GSFMSITFRS 



130 

DFSNEERFTG 



140 150 
FDAHYMAVDV DECKEREDEE 



190 200 
CRVECSDNLF TQRTGVITSP 



210 

DFPNPYPKSS 



160 

LSCDHYCHNY 



170 180 
IGGYYCSCRF GYILHTDNRT 



220 230 
ECLYTIELEE GFMVNLQFED 



240 

IFDIEDHPEV 



250 260 270 280 290 300 

PCPYDYIKIK VGPKVLGPFC GEKAPEPIST QSHSVLILFH SDNSGENRGW RLSYRAAGNE 



310 



320 



330 



340 



350 



360 



CPELQPPVHG KIEPSQAKYF FKDQVLVSCD TGYKVLKDNV EMDTFQIECL KDGTWSNKIP 



370 

TCKIVDCRAP 

r 

430 

MNKVLGRSLP 
4 90 

GSSWIVTAAH 



380 

GELEHGLITF 
44 0 

TCLPVCGLPK 

500 

CLHQSLDPED 



390 

STRNNLTTYK 
450 

FSRKLMARIF 

510 

PTLRDSDLLS 



4 00 

SEIKYSCQEP 
4 60 

NGRPAQKGTT 



410 

YYKMLNNNTG 
470 

PWIAMLSHLN 



420 

lYTCSAQGVW 
480 

GQPFCGGSLL 



520 530 540 

PSDFKIILGK HWRLRSDENE QHLGVKHTTL 



550 560 570 

HPQYDPNTFE NDVALVELLE SPVLNAFVMP 



580 .590 600 

ICLPEGPQQE GAMVIVSGWG KQFLQRFPET 



610 



620 



LMEIEIPIVD HSTCQKAYAP 



630 

LKKKVTRDMI 



640 .650 660 

CAGEKEGGKD ACAGDSGGPM VTLNRERGQW 



670 



680 



690 



YLVGTVSWGD DCGKKDRYGV YSYIHHNKDW IQRVTGVRN 



« Hide 



Isoform 2 (IVIASP-S). 

Checksum: 09B5297A6C14283A 
Show » 



PASTA 



728 



isoform 3. 

Checksum: DDED114311A62714 

Show » 



PASTA 



380 



Isoform 4. 

Checksum: EEE63886709340PA 
Show » 



PASTA 



615 



81.860 



43.640 



68.918 
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IPR009003. Pept_cys/ser__Trypsin-like. 
IPR018114. Peptidase_S1/S6_AS. 
IPR001254. Peptidase_S1_S6. 
IPR001314, Peptidase_S1A. 
IPR000436. Sushi_SCR_CCP. 
[Graphical view] 



G3DSA:2. 10.70. 10. Complement_controLmodule. 2 hits. 
G3DSA:2.60. 120.290. CUB. 2 hits. 



http://www.uniprot.org/uniprot/P48740 
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Pfam 


PF00431 CUB 2 hits 
PF07645. EGF CA. 1 hit. 
PF00084, Sushi. 2 hits. 
PF00089. Trypsin. 1 hit. 
[Graphical view] 


PRINTS 


PR00722 CHYMOTRYPSIN 

1 » > W » ^bCb. I 1 IVIX^ 1 1 ^ 1 1 ^^1 I ^ . 


SMART 


SM00032. CCP. 2 hits. 
SM00042. CUB. 2 hits. 
SM0017i9. EGF CA. 1 hit. 
SM00020. TrypISPc. 1 hit. 

[Graphical view] 


SUPFAM 


SSF57535. Complement control module. 2 hits. 
SSF49854. CUB. 2 hits. 
SSF50494. Pept_Ser_Cys. 1 hit. 


PROSITE 


PS00010. ASX HYDROXYL 1 hit. 
PS01180. CUB. 2 hits. 
PS00022. EGF 1. False negative. : 
PS01186. EGF_2. 1 hit. 
PS50026. EGF 3. False negative. 
PS01187. EGF CA. 1 hit. 
PS50923. SUSHI. 2 hits. 
PS50240. TRYPSIN DOM. 1 hit. 
PS00134. TRYPSIN_HIS. 1 hit. . 

[Graphical view] 


ProtoNet 


Search... 


Other Resources 




NextBio 


21938. . 


SOURCE 


Search... 

• 


Entry information 




Entry name 


MASP1_HUMAN 


Accession 


Primary (citable) accession number: P48740 
Secondary accession number(s): A8K542 fel Q9UF09 


Entry history 1 


Integrated into February 1/1996 

UniProtKB/Swiss- 

Prot: 

Last sequence December 16, 2008 
update: 

Last modified: Apnl 5, 2011 

This is version 1 16 of the entry and version 3 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation nrooram 


Chordata Protein Annotation Proa ram 

l^^l ^JOI«OI 1 1 W*^?ll 1 w\t 1 1 i^/Ld hl^/l III < ^ III 


Disclaimer 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 


Relevant documents 





Human chromosome 3 



http://www,uniprot.org/uniprot/P48740 
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Q96P40 (Q96P40_HUMAN)# Unreviewed, UniProtKB/TrEMBL 

Last modified March 2, 2010. Version 19. 



Names and origin 



Protein names 



j Submitted name: 



ADAMTS 1 3 I EMBLAAL11 094.1 



Gene names 



Name:ADAMTS13 . emblaaliio94.i 



Organism 



i Homo sapiens (Human) I emblaaliio94T 



Taxonomic identifier 



9606 [NCBI] 



Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniate > Vertebrata > 

Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


45 AA. 


Sequence status 


Fragment. 


Protein existence 


Evidence at transcript level. 



Sequence annotation (Features) 



I Feature key 



Position j Length i Description 



Graphical view 



\ Feature 
I identifier 



Experimental info 



□ 


i 1 

1 Non-terminal residue j 


1 1 




1 Non-terminal residue i 


45 1 



1 I EMBLAAL1 1094,1 | [ I 

I ! 

EMBLAAL1 1094.1 | | 



Sequences 



Sequence 



Length Mass (Da) 



Q Q96P40 [UniParc]. 

Last modified December 1, 2001. Version 1 
Checksum: 2D4AA2E2F39EC633 



PASTA 



45 



4,535 



I 10 20 30 40 

LFVCRFTGGM ARSMATSPAQ TSPSPTSSLS HGRPGCGPLC VGPAR 



« Hide 



References 



[1] 



I „ 



Mutations in a member of the ADAMTS gene family cause thrombotic 
thrombocytopenic purpura." 



http://www.uniprot.org/uniprot/Q96P40 
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Levy G.G., Nichols W.C., Lian E.G., Foroud T., McCIintick J.N., McGee B.M., Yang A,Y., 
Siemieniak D.R., Stark K.R., Gruppo R., Sarode R., Shurin S.B., Chandrasekaran V., Stabler 
S.P., Sabio H., Bouhassira E.E., Upshaw J.D. Jr., Ginsburg D., Tsai H.-M. 
Nature 413:488-494(2001) [PubMed: 11586351] [Abstract] 
Cited for : NU CLEOTIDE SEQ UENCE. 
Tissue: Liver I emblaaliio94.i 



C ross -ref e re n ces 



Sequence databases 

@EMBL 
GenBank 
DDBJ 



AF414400 mRNA. Translation: AAL1 1094.1 



3D structure databases 

ModBase 

Family and domain databases 
ProtoNet 

Entry information 



Search... 



Search. 



Entry name 



i 



Q96P40 HUMAN 



Accession 



Primary (citable) accession number: Q96P40 



Entry history 



Integrated into December 1, 2001 

UniProtKBn"rEMBL: 

Last sequence December 1 . 2001 

update: 

Last modified: March 2, 2010 

This is version 19 of the entry and version 1 of the sequence. 





[Complete history] 


Entry status 


Unreviewed (UniProtKB/TrEMBL) 


Disclaimer 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 

only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 
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Protein 

Translations of Life 



Display Settings: GenPept 

von Willebrand factor-cleaving protease [Homo sapiens] 



GenBank: BAB69487.2 
PASTA Graphics 



liOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSODRCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMEO 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



BAB69487 1427 aa 

von Willebrand £actor-cleaving protease 

BAB69487 

BAB69487.2 01:16117338 
accession AB06d693.2 



linear PRI 17-OCT-2001 
[Homo sapiens] . 



Protein 



Region 



Site 



Region 



CDS 



Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutherla; Euarchcntoglires; Primates; Haplorrhini; 
Catarrhini; Hominidac; Homo. 
1 

Soe j ima , K . , Mimura / N . , Hirashima , M . , Maeda , H . , Hamamoto, T . , 
Nakagaki,T. and Nozaki.C. 

A novel human metalloprotease synthesized in the liver and secreted 
into the blood: possibly, the von willebrand factor- cleaving 
protease? 

J. Biochem. 130 (4), 475-480 (2001) 

11S74066 

2 (residues 1 to 1427} 
Soe j ima, K. and Mimura, N. 
Direct Submission 

Submitted (08-AUG-2001) Kenji Soejima, The Chemo-Sero-Therapeutic 
Research Institute, First Research Department; Kyokushi, Kikuchi, 
Kumamoto 869-1298, Japan (E-mail : 5oejima@kaketsuken, or. jp, 
Tel:81-968-37-3100 (ex. 5250) , Fax: 81-968-37-3616) 
On Oct 12, 2001 this sequence version replaced gi: 15990913 . 

Location/Qualifiers 

1. . 1427 

/organism=: " Homo sapiens " 
/db_xre f « * t axon ; 9606 * 
/chromosome^ " 9 " 
/map="9q34" 
/ 1 i s s ue_t:ype ■ * 1 i ve r " 
1. . 1427 

/products "von Willebrand factor- cleaving protease" 
80. .283 

/region_name = " ZnMc_ADAMTS_like " 

/note= "Zinc-dependent metalloprotease, ADAMTS_like 
subgroup. ADTMs (A Disintegrin And Metalloprotease) are 
glycoproteins, which play roles in cell signaling, cell 
fusion, and cell-cell interactions. This particular 
subfamily represents domain architectures...; cd04273" 
/db_xre f - " CDD : 58574 » 
order(224. .225,228,234) 
/site_type= " active " 
/db_xr e f =" CDD : 58574 " 
387. .439 

/region_name= "TSP_1 " 

/noteo"Thrombospondin type 1 domain; cl02514" 
/ db_x re f »" CDD : 154951 " 
1. . 1427 

/gene = "vWF-CP'' 

/coded_by=''AB06 9698 . 2 :445. .4728" 
/no t e - " ADAMTS 1 3 ; 

a disintegrin- like and metalloprotease (reprolysin type) 
with thrombospondin type 1 motif, 13; 
vWFCP" 



ORIGIN 



// 



1 mhqrhprarc pplcvagila cgfllgcwgp shfqqsclqa lepqavssyl spgaplkgrp 

61 pspgfqrqrq rqrraaggil hlellvavgp dvfqahqedt eryvltnlni gaellrdpsl 

121 gaqfrvhlvk mviltepega pnitanltss llsvcgwsqt inpeddtdpg hadlvlyitr 

181 fdlelpdgnr qvrgvtqlgg acsptwscli tedtgfdlgv tiaheighsf glehdgapgs 

241 gcgpsghvma sdgaapragl awspcsrrql Isllsagrar cvwdpprpqp gsaghppdaq 

301 pglyysaneq crvafgpkav actfarehld mcqalschtd pldqsscsrl Ivplldgtec 

361 gvekwcskgr crslveltpi aavhgrwssw gprspcsrsc gggwtrrrq cnnprpafgg 

421 racvgadlqa emcntqacek tqlefmsqqc artdgqplrs spggasfyhw gaavphsqgd 

481 alcrhmcrai gesfimkrgd sfldgtrcmp sgpredgtls Icvsgscrtf gcdgrmdsqq 

541 vwdrcqvcgg dnstcsprkg sftagrarey vtfltvtpnl tsvyianhrp Ifthlavrig 

601 grywagkms ispnttypsl ledgrveyrv altedrlprl eeiriwgplq edadiqvyrr 

661 ygeeygnltr pditftyfqp kprqawvwaa vrgpcsvscg aglrwvnysc Idqarkelve 

721 tvqcqgsqqp pawpeacvle pcppywavgd fgpcsascgg glrerpvrcv eaqgsllktl 

781 pparcragaq qpavaletcn pqpcparwev sepssctsag gaglalenet cvpgadglea 

641 pvtegpgsvd eklpapepcv gmscppgwgh Idatsageka pspwgsirtg aqaahvwtpa 

901 agscsvscgr glmelrflcm dsalrvpvqe elcglaskpg srrevcqavp cparwqykla 

961 acsvscgrgv vrrilycara hgeddgeeil Idtqcqglpr pepqeacsle pcpprwkvms 

1021 Igpcsascgl gtarrsvacv qldqgqdvev deaacaalvr peasvpclia dctyrwhvgt 

1081 wmecsvscgd giqrrrdtcl gpqaqapvpa dfcqhlpkpv tvrgcwagpc vgqgtpslvp 

1141 heeaaapgrt tatpagasle wsqargllfs papqprrllp gpqensvqss acgrqhlept 

1201 gtidmrgpgq adcavaigrp Igewtlrvl esslncsagd mlllwgrltw rkmcrklldm 

1261 tfssktntlv vrqrcgrpgg gvllrygsql apetfyrecd mqlfgpwgei vspslspats 

1321 naggcrlfin vaphariaih alatnmgagt eganaayili rdthslrtta fhgqqvlywe 

1381 sessqaemef segflkaqas Irgqywtlqs wvpenqdpqs wkgkegt 



http://www.ncbi.nlm.nih.gov/protein/BAB69487 
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P08473 (NEP_HUMAN) # Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 146. 



Names and origin 



Protein names 


Recommended name: 
Neprilysin 

EC=3.4.24.11 
Alternative name(s): 
Atriopeptidase 

Common acute lymphocytic leukemia antigen 

Short name^CALLA 

Enkephalinase 

Neutral endopeptidase 24.11 

Short name=NEP 

Short name=Neutral endopeptidase 

Skin fibroblast elastase 

Short r)ame=SFE 

CD_antlgen=CD10 


Gene names 


Name: MIVIE 
Synonyms:EPN 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrbini > Catarrhini > Hominidae > Homo 



Protein attributes 



Sequence length 


750 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 




Catalytic activity 



Cofactor 



Enzyme regulation 



Thermolysin-like specificity, but is almost confined on acting on 
polypeptides of up to 30 amino acids. Biologically important in 
the destruction of opioid peptides such as Met- and Leu- 
enkephalins by cleavage of a Gly-Phe bond. Able to cleave 
anglotensin-1, angiotensin-2 and angiotensin 1-9. Involved in 
the degradation of atrial natriuretic factor (ANF). Displays UV- 
inducl ble elastase activity tow ard skin preelastic and elastic 

fibers. I Ref.9 1 1 Ref.t2 1 1 Ref.14 j 



Preferential cleavage of polypeptides between hydrophobic 
residues, particularly with Phe or Tyr at PI'. 



Binds 1 zinc ion per subunit. I Ref.is j | Ref.i7 | 
Inhibited in a dose dependent manner by opiorphin. 



http://vvww.uniprot.org/uniprot/p08473 
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Subcellular location 


Cell membrane; Single-pass type II membrane protein. 


Miscellaneous 


important cell surface marker in the diagnostic of human acute 
lymphocytic leukemia. 


Sequence similarities 


Belongs to the peptidase M13 family. 


Biophysicochemical properties 


Kinetic oarameters: 
Km=55.1 mM for angiotensin-1 ^2 
Km=179 pM for angiotensin-2 
Km=1 1 1.4 pM for angiotensin 1-9 


Sequence caution 


The sequence CAA30157.1 differs from that shown. Reason: 
Erroneous initiation. 



Ontologies 



Keywords 



Cellular component 


Cell membrane 
Membrane 


Domain 


Signal-anchor 
Transmembrane 
Transmembrane helix 


Ligand 


Metal-binding 
Zinc 


Molecular function 


Hydrolase 

Metalloprotease 

Protease 


PTM 


« 

Disulfide bond 
Glycoprotein 


Technical term 


3D-structure 
Complete proteome 



Gene Ontology (GO) 



Biological process 


cell-cel! signaling 

Traceable author statement. Source: Protinc 
proteolysis 

Traceable author statement. Source: Protinc 


Cellular component 


integral to plasma membrane 

Traceable author statement. Source: Protinc 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

metalloendopeptidase activity 

Inferred from electronic annotation. Source: InterPro 

protein binding 

Inferred from physical interaction. Source: IntAct 



Complete GO annotation.. 



http://www.uniprot.org/uniprot/p08473 
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Binary interactions 



With 


Entry 


#Exp. 


IntAct 


Notes 


HSPA1A 


P08107 


1 


EBI-353759,EBI-629985 




HSPB1 


P04792 


1 


EBI-353759,EBI-352682 




NDRG1 


Q92597 


1 


EBI-353759,EBI-716486 





Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) Length Description j Graphical view 



Feature iderjitif 



□ 


Initiator methionine 


1 


1 


Removed 


1 




B 


Chain 


2-750 


749 


Neprilysin 




PRO. 



Regions 



Topological domain 



Transmembrane 



Topological domain 



Motif 



2-28 



29-51 



52 - 750 



16-23 



27 



23 



699 



8 



Cytoplas mic ^ I 

Potential 



Helical; 
Signal- 
anchor for 
type 11 
membrane 
protein; 

I Potentiail 



■4 



Extracell ular 

I Potential | 



Stop- 
transfer 
sequence 

I Potential I 



Sites 



□ 


Active site 


591 


1 1 






Active site 


651 


1 


Proton 
donor 

1 Ref.10 1 


□ 


Metal binding 


584 


1 


Zinc; 
catalytic 


□ 


Metal binding 

1 


588 


1 


Zinc; 
catalytic 


□ 

1 


Metal binding 


647 


1 


Zinc; 
catalytic 




http://www.uniprot.org/uniprot/p08473 
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n 



Binding site 



103 



Amino acid modifications 



□ 



□ 



Glycosylation 



Glycosylation 



Glycosylation 



Glycosylation 



Disulfide bond 



Disulfide bond 



Disulfide bond 



Disulfide bond 



Disulfide bond 



Disulfide bond 



Experimental info 



145 



285 



325 



628 



57 62 



80 ^ 735 I 



88 ^ 695 



143^411 



234 242 



621 ^747 
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1 



Substrate 
carboxyl 

I By similarity j 



1 



1 



1 



N-linked 
(GlcNA c. 

I Ref.16 1 
I Ref.17 I 



•) 



Ref.11 



Ref.15 



N-linked 
(GlcNA c..,) 

Ref.11 I 



N-linked 
(GlcNA c. 

Ref.16 I 



•) ! 



1 



Ref.17 



Ref.15 



N-linked 
(GlcNA c. 

I Ref .16 I 



Ref .17 I 



Ref,15 



Ref.16 



f 



Ref .17 i 



Ref.16 I 



Ref.17 I 



1 Ref.16 I 
Ref.17 I 



Ref.16 



Ref.17 



Ref.16 



I 



Ref.17 



Ref.16 



I I 



Ref.17 I 





Sequence conflict 


26 


1 


P^Rin ,1 
AAA51915. 1 

1 Ref.4 1 


□ 


Sequence conflict 


44 


1 


T-^Rin 1 

AAA51915. i i 

Ref.4 { . i 


□ 


Sequence conflict 


81 


1 


T — Rin 1 1 ■ 

AAA51915. i ! 

Ref.4 1 * 
' ' i 1 



http://www,uniprot.org/uniprot/p08473 
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Sequence conflict 



304 



1 I T R in 

AAA51 915. 
I Ref.4 I 



Secondary structure 



• fl ■ « I 



I Helix I Strand | Turn 



Details... 



Sequences 



Sequence 



Length Mass (Da) 



U P08473 [UniParc]. 



FASTA 



760 



85,514 



Last modified January 23, 2007. Version 2. 
Checksum: BCF3827C39898630 



10 

MGKSESQMDI 
70 



20 

TDINTPKPKK 
80 



30 

KQRWTPIiEIS 
90 



401 

LSVLVLLIiTI 
100 



50 60 
lAVTMIAIiYA TYDDGICKSS 



110 



120 



DCIKSAARLI QNMDATTBPC TDFFKYACGG WLKRNVIPET SSRYGNFDIL RDELEWLKD 

13£ 14£ 150 160 17 0 18£ 

VliQEPKTEDI VAVQKAKALY RSCINESAID SRGGEPLLKL LPDIYGWPVA TENWBQKYGA 

190 200 210 220 230 240^ 

SWTAEKAIAQ LNSKYGKKVIi INLFVGTDDK NSVNHVIHID QPRLGDPSRD YYECTGIYKE 

250 260 270 280 .290 300 

ACTAYVDFMI SVARLIRQEE RLPIDENQLA LEMNKVMELE KEIANATAKP EDRNDPMLLY 

310 320 33£ 340 350 360 

NKMTIAQIQN NFSLEINGKP FSWLNFTNEI MSTVNISITN EEDVWYAPE YLTKLKPILT 



370 

KYSARDIiQNL 



38 0 

MSWRFIMDLV 



390 

SSL.SRTYKES 



4 00 

RNAFRKAIiYG 



410 

TTSETATWRR 



4 3£ 44 0 

NAVGRLYVEA AFAGESKHW 



45£ 46£ 47£ 

EDLIAQIREV FIQTLDDLTW MDAETKKRAE 



490 



500 



510 



520 



530 



420 

CANYVNGNME 
48£ 

EKTVLAIKERI 
54 0 



GYPDDIVSND NKLNNEYIiBIi NYKEDEYFEN IIQNLKFSQS KQLKKLREKV DKDEWISGAA 



550 

WNAFYSSGR 



560 

NQIVFPAGIL 



570 

QPPFFSAQQS 



58£ 

NSLNYGGIGM 



59 0 

VIGHEITHGF 



60 0 

DDNGRNFNKD 



610 62 0 63£ 640 650 660 

GDLVDWWTQQ SASNFKEQSQ CMVYQYGNFS WDLAGGQHLN GINTLGENIA DNGGLGQAYR 

670 680 69£ 700 71£ 72£ 

AYQNYIKKNG BBKLLPGLDL NHKQLFFLNF AQVWCGTYRP EYAVNSIKTD VHSPGNFRII 



73£ 

• GTLQNSAEFS 

« Hide 



74£ 75£ 
EAPHCRKNSY MNPEKKCRVW 
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Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. 
Tissue: Brain. 


[8] 


"Molecular cloning and amino acid sequence of human enkephalinase (neutral 
endopeptidase)." 

Malfroy B., Kuang W.-J., Seeburg P.H.. Mason A.J., Schofield P.R. 
FEBS Lett: 229:206-210(1988) [PubMed: 3162217] [Abstract] 
Cited for: NUCLEOTIDE SEQUENCE fMRNAl OF 3-750. 
Tissue: Placenta. 


[9] 


"Endopeptidase-24.1 1 in human plasma degrades atrial natriuretic factor (ANF) to ANF 
(99-105/106-126)." 

YandleT.G., Brennan S.O., Espiner E.A., Nicholls M.G., Richards A.M. 
Peptides 10:891-894(1989) [PubMed: 2531377] [Abstract] 
Cited for: FUNCTION IN THE DEGRADATION OF ANF. 


[10] 


"Asp650 is crucial for catalytic activity of neutral endopeptidase 24-11." 

Le Moual H., Dion N., Roques B.P., Crine P., Boileau G. 

cur. J. Diocnem. /.^ \ .h/d-^o\J\ lyy**^ [nuuiviea. o iqouo^j [rNUouctwij 

Cited for: ACTIVE SITE ASP-651 . 


[11] 


"Identification and quantification of N-linked glycoproteins using hydrazide chemistry, 
stable isotope labeling and mass spectrometry." 
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Zhang H.. Li X.-J., Martin D.B., Aebersold R. 

Nat. Biotechnol. 21:660-666(2003) [PubMed: 12754519] [Abstract] 

Cited for: GLYCOSYLATION AT ASN-145 AND ASN-285. 


[12] 


"Evaluation of angiotensin-converting enzyme (ACE), its homologue ACE2 and 
neprilysin in angiotensin peptide metabolism." 

Rice G.I.. Thomas D.A., Grant P.J., Turner A.J. . Hooper N.M. 
Biochem. J. 383:45-51(2004) [PubMed: 15283675] [Abstract] 

Cited for: FUNCTION IN ANGIOTENSIN PEPTIDE METABOLISM. BIOPHYSICOCHEMICAL 
PROPERTIES. 


[13] 


"Human opiorphin, a natural antinociceptive modulator of opiold-dependent 
pathways." 

Wisner A., Dufour E., Messaoudi M., Nejdi A., Marcel A., Ungeheuer M.-N., Rougeot C, 
Proc. Natl. Acad. Sci. U.S.A. 103:17979-17984(2006) [PubMed: 17101991] [Abstract] 
Cited for: INHIBITION BY OPIORPHIN. 


[14] 


"Neprilysin is Identical to skin fibroblast elastase: its role |n skin aging and UV 
responses." 

Morisaki N., Moriwaki S., Sugiyama-Nakagiri Y., Haketa K.^ Takema Y., Imokawa G. 
J, Biol. Chem. 285:39819-39827(2010) [PubMed: 20876573] [Abstract] 
Cited for: IDENTIFICATION AS SKIN FIBROBLAST ELASTASE, FUNCTION. 


[15] 


"Structure of human neutral endopeptldase (Neprilysin) complexed with 
phosphoramidon." 

Oefner C, D'Arcy A.. Hennig M., Winkler F.K., Dale G.E. 

J. Mol. Biol. 296:341-349(2000) [PubMed: 10669592] [Abstract] 

Cited for: X-RAY CRYSTALLOGRAPHY (2.1 ANGSTROMS). GLYCOSYLATION AT ASN- 
145; ASN-325 AND ASN-628. 


[16] 


"Structural analysis of neprilysin with various specific and potent inhibitors." 

Oefner C, Roques B.P.. Fournie-Zaluski M.-C. Dale G.E. 

Acta Crystallogr. D 60:392-396(2004) [PubMed: 14747736] [Abstract] 

Cited for: X-RAY CRYSTALLOGRAPHY (1.95 ANGSTROMS) OF 55-750 IN COMPLEXES 
WITH ZINC IONS AND SYNTHETIC INHIBITORS. DISULFIDE BONDS. COFACTOR. 
GLYCOSYLATION AT ASN-145; ASN-325 AND ASN-628. 


[17] 


"Structural studies of a blfunctlonal Inhibitor of neprilysin and DPP-IV." 

Oefner C, Pierau S., Schuiz H., Dale G.E. 

Acta Crystallogr. D 63:975-981(2007) [PubMed: 17704566] [Abstract] 

Cited for: X-RAY CRYSTALLOGRAPHY (2.05 ANGSTROMS) OF 55-750 IN COMPLEX 

\A/ITI-I 7IMP irMVlQ AKin TUP QVMTHPTIP IMHIRITDR HI^IJI FIPiF RONDS 

COFACTOR. GLYCOSYLATION AT ASN-145; ASN-325 AND ASN-628. 


• 

+ 


Additional computationally mapped references. 


Web resources 


Atlas of Genetics and Cytogenetics in Oncology and Haennatology 


Cros 


s-references 



Sequence databases 



@EMBL 
® GenBank 
® DDBJ 



Y00811 mRNA. Translation: CAA68752.1. 

J03779 mRNA. Translation: AAA51915.1. 

M26628 W M26627 Genomic DNA. Translation: AAA52294.1 

AK291761 mRNA. Translation: BAF84450.1. 

EU326307 Genomic DNA. Translation: ACA05913.1. 
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1 CH471052 Genomic DNA. Translation: EAW7875AA. 
CH471052 Genonnic DNA. Translation: EAW78755.1. 
CH471052 Genonnic DNA. Translation: EAW78756.1. 
CH471062 Genonnic DNA. Translation: EA\/V78757.1. 
CH471052 Genomic DNA. Translation: EAW78758.1. 
BC101632 mRNA, TransTation: AAI01 633.1. 
BC101658 mRNA. Translation:. AAI01 659.1. 
X07166 mRNA. Translation: CM3 Q157.1. Different initiation. 

IPI IPI00247063. 

PIR HYHUN. A41387. ; ;. ;^ 

RefSeq NPJ)00893.2. NM_000902.3; 

NP_009218.2. NM_0072B7.2. 
NP_009219,2. NM_007288.2. 

NP_009220.2. NM_007 289.2.;. 

UniGene Hs.307734. 



3D structure databases 



® PDBe 


Entry Method 


Resolution 


Chain 


Positions 


PDBsum 


® RCSB PDB 


(A) 










©PDBj 


1 DL9 model 




A 


508-750 




[»] 




1 DMT X-ray 


2.10 


A 


55-750 




[»] 






1 QVD model 




A 


56-750 




[»] 






1R1H X-ray 


1.95 


A 


55-749 










1R1I X-ray 


2.60 


A 


55-749 




t * 






1R1J X-ray 


2.35 


A 


55-749 




■ " 

1 n 






1Y8J X-ray 


2.25 . 


A 


55-749 




► ■ 






2QPJ X-ray 


2.05 


A 


55-750 




[»] 




ProteinModelPortal 


P08473. 












SMR 


P08473. Positions 55-750. 


ModBase 


Search... 













Protein-protein interaction databases 



IntAct 


P08473. 74 interactions. 


STRING 


P08473. 


Protein family/group databases 




MEROPS 


M1 3.001. 


PTiy/l databases 




PhosphoSite 


P08473. 



Proteomic databases 




PeptideAtlas 


P08473. 


PRIDE 


P08473. 



Genome annotation databases 



EnsembI 



ENST00000360490; ENSP00000353679; ENSG000001 96549. 
ENST00000404362; ENSP00000384558; ENSGOOOO0 196549. 
ENST00000460393; ENSP00000418525; ENSG000001 96549. 
ENST00000462745; ENSP00000419653; ENSG000001 96549. 
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ENST00000492661; ENSP00000420389; ENSG000001 96549. 
ENST00000493237; ENSP000004 17079; ENSG000001 96549. 


GenelD 


4311. 


KEGG 


hsa:4311. 


UCSC 


uc003fab.1. human. 


Organism-specific databases 


■ ' ; 


CTD 


4311. 


GeneCards 


GC03P152191. 


H-lnvDB 


HIX0003792. 


HGNC 


HGNC:7154. MME. 


HPA 


CAB000013. 


MIM 


120520. gene. 


neXtProt 


NX_P08473. 


PharmGKB 


PA30864. 


GenAtlas 


Search... 


Phylogenomic databases 


■ • 

■ 


eggNOG 


prNOG07698. 


GeneTree 


E NSGT00550000074200 . 


HOGENOM 


HBG701001. 


HOVERGEN 


HBG005554. 


InParanoid 


P08473. 


OMA 


TYRPcYA. 


OrthoDB 


EOG4XWFXB. 


PhylomeDB 


P08473. 


Enzyme and pathway databases 




BRENDA 


3.4.24.11. 247. 


Gene expression databases 


- 

- 


Array Express 


P08473. 


Bgee 


P08473. 


CleanEx 


HS__lvlivlE. 


Genevestigator 


P08473, 


GermOnline 


ENSGOOOO0 196549. Honno sapiens. 


Family and domain databases 




Inter Pro 


IPR000718. Peptidase_M13. 
IPR018497. Peptidase Ml 3 C. 
IPR008753. Peptidase_M13_N. 
[Graphical view] 


PANTHER 


PTHR11733. Peptidase_M13. 1 hit. 
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Pfam 


PF01431. Peptidase M13. 1 hit. 
PF05649. Peptidase_M13_N. 1 hit. 
[Graphical view] 


PRINTS 


PR00786. NEPRILYSIN. 


PROSITE 


PS00142. ZINC_PROTEASE, .1 hit. 
[Graphical view] 


ProtoNet 

Other Resources 

Drug Bank 


Search... 

DB00616. Candoxatnl. 


NextBio 


16959. 


SOURCE 


Search... 


Entry information 


• 


Entry name 


NEP_HUMAN 


Accession 


Primary (citable) accession number: P08473 

Secondary accession number(s): A8K6U6. D3DNJ9. Q3MIX4 


Entry history 


Integrated into August 1 , 1 988 

UniProtKB/Swiss- 

Prot: 

Last sequence January ^o, zvuf 
update: 

Last modified: April 5, 2011 

1 nis IS version iho ot me eniry ana version z or ine secfuence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Disclaimer 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 



Relevant documents 



Human cell differentiation molecules 

CD nomenclature of surface proteins of human leucocytes and list of entries 
Human chromosome 3 

Human chromosome 3: entries, gene names and cross-references to MIM 
MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 
PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 
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Protein 

Translations of Life 



Display Settings: GenPept 

common acute lymphoblastic leukemia antigen precursor [Homo sapiens] 

GenBank: AAA51915.1 

FASI& GfSDtliCS 

Go;?; 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



Protein 

mat peptide 
Region 



Re g i on 



Site 



Site 



CDS 



ORIGIN 



AAA51915 750 aa linear PRI 31-OCT-1994 

common acute lymphoblastic leukemia antigen precursor [Homo 
sapiens] . 
AAA51915 

AAAS1915.1 GI: 179834 

locus HUMCAIiLA accession J03779. l 

Homo sapiens (human) 
Homo sapiens 

Eiikaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Butheria; Buarchontoglires; Primates; Haplorrhini; 
Catarrhini ; Hominidae ; Homo . 
1 (residues 1 to 750) 

Shipp^M.A., Richardson,N.B. , Sayre^P.H., Brown, N.R., 
Masteller, B.L. , Clayton, L. K. , Ritz,J. and Reinherz, E.L. 
Molecular cloning of the common acute lyn^hoblastic leiikemia 
antigen (CALIA) identifies a type II integral membrane protein 
Proc. Natl. Acad. Sci . U.S.A. 85 (13), 4819-4823 (1988) 
2968607 

Method: conceptual translation. 
Location/Quali fiers 
1. . 750 

/organisms "Homo sapiens" 
/db_xre f »" t axon : 9606 " 
/map="3q21-q27" 
1. .750 

/name* "common acute lymphoblastic leukemia antigen 
precursor" 
2. .750 

/products "common acute lymphoblastic leukemia antigen" 
72. ,750 

/ region_name = " PepO " 

/notes "Predicted metalloendopeptidase (Posttranslational 
modification, protein turnover, chaperones] ; CO63590" 
/db xref°"CDD: 33390 " 
79. .748 

/region_name«»"M13 " 

/notea "Peptidase family M13 includes neprilysin, 
endothel in- converting enzyme I; cd08662" 
/ db_xre f = " CDD 189000 " 

order(543. .544,581,584. .585,588,647,690. .691,712,718} 

/ si te_type= "active " 
/db xref="CDD: 169000 " 
order (584, 588, 647) 
/si t e_type = " other " 
/note="2n binding site" 
/db_xre f =» " CDD : 189000 " 
1. .750 
/gene="MME" 

/coded_by="J03779. 1:12. .2264" 
/db_xref="GDB:G00-12O-190" 



// 



1 mgksesgmdi tdintpkpkk kqrwtrleis Isvlvlllti iavrmialya tyddgickss 

61 dciksaarli qnmdattepc rdffkyacgg wlkrnvipet ssrygnfdil rdelewlkd 

121 vlqepktedi vavqkakaly rscinesaid srggepllkl Ipdiygwpva tenweqkyga 

181 swtaekaiaq Inskygkkvl inlfvgtddk nsvnhvihid qprlglpsrd yyectgiyke 

241 actayvdfmi svarlirqee rlpidenqla leronkvmele keianatakp edrndpmlly 

301 nkmrlaqiqn nfsleingkp fswlnftnei mstvnisitn eedvwyape yltklkpilt 

361 kysardlqnl mswrfimdlv sslsrtykes rnafrkalyg ttsetatwrr canyvngnme 

421 navgrlyvea afageskhw edliaqirev fiqtlddltw mdaetkkrae elealaikeri 

481 gypddivsnd nklnneylel nykedeyfen iiqnlkfsqs kqlkklrekv dkdewisgaa 

541 wnafyssgr nqivfpagil qppffsaqqs nslnyggigra vigheithgf ddngrnfnkd 

601 gdlvdwwtqq sasnfkeqsq cmvyqygnfs wdlaggqhln gintlgenia dngglgqayr 

661 ayqnyikkng eekllpgldl nhkqlfflnf aqvwcgtyrp eyavnsiktd vhspgnfrii 

721 gtlqnsaefs eafhcrknsy ranpekkcrvw 
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4/12/201 



unnamed protein product [H^mo sapiens] - Protein result 



Page 1 of 1 



Protein 
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Display Settings: GenPopt 

unnamed protein product [Homo sapiens] 

GenBank: CAA68752.1 
FASTA Gfaohics 



to; 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 
KEYWORDS 
SCHJRCE 

ORGANISM 



750 aa 

[Homo sapiens] 



linear 



PRI 12 -SEP- 1993 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



Protein 



Region 



Region 



ORIGIN 



CAA68752 

unnamed protein product 
CAA68752 

CAA68752.1 GI: 29626 
embl accession Y00811 , 1 

Homo sapiens (human) 

Homo sapiens 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata/ Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini ; Hominidae ; Homo . 
1 

Letarte.M., Vera,s., Tran,R., Addis, J. B., Onlzu)ca,R. J. , 
Quackenbush , E . J . , Jongeneel< C.V. and McInnes«R.R. 
Common acute lymphocytic leukemia antigen is identical to neutral 
endopeptidase 

J. Exp. Med. 168 (4)« 1247-1253 (1988) 

2971756 

2 (residues 1 to 750) 
JongeneelfC.V. 
Direct Submission 

Submitted {ll-AUG-1988) Jongeneel C.v. , Ludwig Institute for Cancer 
Research, Lausanne Branch, Chemin des Boveresses, CH-1066 
Epalinges, Switzerland 

Data kindly reviewed (31-AUG-1988) by Jongeneel C.V. 
Location/Qualifiers 
1. .750 

/organism^ "Homo sapiens" 
/db_xre f =» " taxon : 9606 " 
/clone="45" 
/eel l^type- " cortex * 
/tissue_type= "kidney" 
/clone_lib=" lambda gtll" 
1. .''50 

/r-ames*" unnamed protein product" 
72. .750 

/ re g i on_name » " PepO " 

/notea" Predicted metalloendop^ptidase [Posttranslational 
modification, protein turnover, chaperones] ; COG3590" 
/db xref="CDD; 33350 " 
79. ,748 

/region_name="M13 " 

/note°" Peptidase family Ml 3 includes neprilysin, 
endothel in- converting enzyme I; cd08662*' 
/db_ xref ="CDD; 169000 " 

order (543. . 544 , 581 , 584 . .585,588,647,690. .691,712,718) 
/ si te^type = " ac t ive 
/db_xr e f = • CDD : 189000 " 
order (584, 58 8, 647) 
/si te^type= " other " 
/notes "Zn binding site" 
/ db_xre f »" CDD : 189000 " 
1. .750 

/coded_by="Y00811. 1:136. .2388" 
/note="CALLA protein (AA 1 - 750)" 
/db_xref ='!GDB: 120190" 
/db xref»"GQA; P084 73 " 
/db_xre f - " HGNC : 7154 ■ 
/db_xr e f = " In t e r Pr o : IPR000718 " 
/db_xre f » " Inter Pro ; IPR008753 " 
/db xref-"InterPro; IPR0ie4 97 " 
/ db^xr e f = " PDB : 1DL9 " 
/db xref =''PDB: IDMT " 
/db_xre f = " PDB : IQVD " 
/db_xre f - " PDB : IRIH • 
/db xref °"PDB: IRII " 
/ db_xr e f " " PDB : IRIJ " 
/db_xre f = " PDB : 1Y8J " 
/db xref = "PDB; 2QPJ " 

/db_xr e f » Ur i Pr ot KB/ Swi s s - Pr o t : P084 73 " 

1 mgksesqmdi tdintpkpkk kqrwtpleis Isvlvlllti iavtmialya tyddgickss 
61 dciksaarli qnmdattepc tdffkyacgg wlkmvipet ssrygnfdil rdelewlkd 
121 vlqepktedi vavqkakaly rscinesaid srggepllkl Ipdiygwpva tenweqkyga 
181 swtaekaiaq Inskygkkvl inlfvgtddk nsvnhvihid qprlglpsrd yyectgiyke 
241 actayvdfmi svarlirqee rlpidenqla lemnkvmele keianatakp edrndpmlly 
301 nkmtlaqiqn nfsleingkp fswlnftnei mstvnisitn eedvwyape yltklkpilt 
361 kysardlqnl mswrfimdlv sslsrtykes rnafrkalyg ttsetatwrr canyvngnme 
421 navgrlyvea afages)chw edliaqirev fiqtlddltw mdaetkkrae ekalaikeri 
481 gypddivsnd nklnneylel nykedeyfen iiq[nlkfsqs kqlkklrekv dkdewisgaa 
541 wnafyssgr nqivfpagil qppffsaqqs nslnyggigm vigheithgf ddngrnfnkd 
601 gdlvdwwtqq sasnfkeqsq cmvyqygnfs wdlaggqhln gintlgenia dngglgqayr 
661 ayqnyikkng eekllpgldl nhkqlfflnf aqvwcgtyrp eyavnsiktd vhspgnfrii 
721 gtlqnsaefs eafhcrknsy mnpekkcrvw 



Site 



Site 



CDS 
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Display Settings: GenPefrt 

enkephalinase [Homo sapiens] 

GenBanlc AAA52294.1 
PASTA Graphics 



09 tp; 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCB 



AAA52294 750 aa 

enkephalinase [Homo sapiens] . 
AAA52294 

AAA52294.1 01:179860 



linear PRI 07 -NOV- 1994 



locus HUMCAIjIA04 accession M26608 . 1 
locus HUMCALLAOS accession M26e09 



locus HUMCALIAOe accession M2661Q ■ 1 
locus HUMCAXOiAO? accession M266XX 



locus HDMCALLA08 accession M26612 . 1 
locus HUMCALLA09 accession M26613 



locus HUMCAIiLAlO accession M26614 . 1 
locus HUMCAUiAll accession M26615 



locus humcaIjIA13 accession 
locus HUMCALLA14 accession 
locus K0MCALLA15 accession ^26619.1 



locus HUMCALLA17 accession 
locus HUMCAIjLAIS accession 

locus HUMCAltlA19 accession M26623 



locus HUMCALiIA20 accession M26624 . 1 
locus HX]MCAUiA21 accession M26625 . 1 
locus HUMCALLA22 accession M26626 



locus HDMCALLA23 accession M26627 



M26607 


. 1 


M26608 


. 1 


M26609 


. 1 


M26610 


. 1 


M26611 


. 1 


M26612 


.1 


M26613 


. 1 


M26614 


.1 


M26615 


. 1 


M26616 


. 1 


M26617 


, 1 


M26618 


. 1 


M26619 


.1 


M26620 


.1 


M26621 


. 1 


M26622 


. 1 


M26623 


. 1 


M26624 


. 1 


M2662S 


. 1 


M26626 


.1 


M26627 


. 1 


M26e2B 


. 1 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMEO 
COMMENT 
FEATURES 

source 



Method : 



Protein 



Region 



Region 



Sice 



Site 



CDS 



locus HUMCALLA24 accession M26e2B.l 

Homo sapiens (human) 
Homo sapiens 

Bxikaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 750) 

D*Adamio,L., Shipp,M.A., Masteller,B.L. and Reinherz,E.L. 
Organization of the gene encoding common acute lymphoblastic 
leukemia antigen (neutral endopeptidase 24.11): multiple miniexons 
and separate 5* untranslated regions 

Proc. Natl. Acad. Sci. U.S.A. 86 (18), 7103-7107 (1989) 

2528730 

conceptual translation. 
Locat ion/Qual i f ier s 
1. .750 

/organisms "Homo sapiens** 
/db_xr e f ■ • t axon : 9606 
/map="3q21-q27" 
/cell_line="NALM 6" 
1. .750 

/product = " enkephal inase " 
/ EC_number » " 3 .4 .24 . 11 " 
72. .750 

/region_name« ** PepO " 

/noteo" Predicted metal loendopeptidase (Posttranslational 
modification, protein turnover, chaperones] ; 0063590" 
/db_xre f = " CDD : 33390 " 
79. .748 

/ reg i on_name = " M 13 " 

/note= "Peptidase family M13 includes neprilysin, 
endothel in- converting enzyme I; cd08662" 

/db_xr e f = " CDD : 1 89000 " 

order (543. .544,581,584. .585,588,647,690. .691,712,718) 
/si te_type= "active " 
/db_xr e f = " CDD : 189000 " 
order (584, 588, 647) 
/si te_type« " other " 
/notea'Zn binding site" 
/db xrefo'CDD; 189000 " 
1. .750 
/gene«»"MME" 

/coded_by=" join(M26607 . 1 : 18 . . 177,M26608 . 1;8 . .43, 



M26609.1:8. . 169,M26610 . 1 : 8 . 
M26612.1:8. . 126,M26613 . 1 : 8 . 
M26615.1:8. . 109,M26616 . 1 : 8 . 
M26618.1:8. . 136 , M26619 . 1 : 8 . 
M26621.1:e. . 111,M26622 . 1 : 8 . 
M26624.1:8. . 141,M26625 . 1 : 8 . 



.88,M26611.1:8. .103, 
. 73,M26614 . 1:8 . .142, 
.144,H26617.1:8. .101, 
.106,M26620.1:8. 
.66,M26623.1:8. 
.73,M26626.1:8. 
107) " 



.88, 
127, 
103, 



M26627.1:8. . 84 , M26628 . 1 : 8 
/notes "common acute lyi^phoblastic antigen" 
/db xref «"6DB;600-120-190" 



ORIGIN 



1 mgksesqmdi tdintp)cpkk kqrwtpleis Isvlvlllti iavtmialya tyddgickss 
61 dciksaarli qnmdattepc tdffkyacgg wlkrnvipet ssrygnfdil rdelewlkd 
121 vlqepktedi vavqkakaly rscinesaid srggepllkl Ipdiygwpva tenvreqkyga 
181 swtaekaiaq Inskygkkvl inlfvgtddk nsvnhvihid qprlglpsrd yyectgiyke 
241 actayvdfmi svarlirqee rlpidenqla lemnkvmele keianatakp edrndpmlly 
301 n)antlaqiqn nfisleingkp fswlnftnei mstvnisitn eedvwyape yltklkpilt 
361 kysardlqnl mswrfimdlv sslsrtykes rnafrkalyg ttsetatwrr canyvngnme 
421 navgrlyvea afages)chw edliaqirev fiqtlddltw mdaetkkrae ekalaikeri 
481 gypddivsnd nklnneylel nykedeyfen iiqnlkfsqs kqlkklrekv dkdewisgaa 
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541 wnafyssgr nqivfpagil qppffsaqqs nslnyggigm vigheithgf ddngrnfnkd 
601 gdlvdwwtqq sasnfkeqsq cmvyqygnfs wdlaggqhln gintlgenia dngglgqayr 
661 ayqnyikkng eekllpgldl nhkglfflnf aqvwcgtyrp eyavnsiktd vhspgnfrii 
721 gtlqnsaefs eafhcrknsy mnpekkcrvw 

// 
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Protein 

Translations of Life 



Display Settings: GenPept 

unnamed protein product [Homo sapiens] 

GenBank: CAA30157.1 



Go to: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
• TITLE 

JOURNAL 
PDBMBD 
FEATURES 

source 



CAA30157 

unnamed protein product 
CAA30157 

CAA30157.1 GI:34758 

embl accession X07166.1 



(human) 



74 3 aa 

(Homo sapiens] . 



linear 



PRI 12 -SEP- 1993 



Protein 



Region 



Region 



Site 



Site 



CDS 



ORIGIN 



Homo sapiens 

Homo sapiens 

Bukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 743} 

Malfroy, B., Kuang,H.j., Seeburg, P.H. « Mason, A. J. and Schof ield, P.R. 
Molecular cloning and eunino acid sequence of hirnian enkephalinase 

(neutral endopeptidase) 

FEES Lett. 229 (1), 206-210 (1988) 

3162217 

Location/(}iaali f iers 
1. .743 

/organism* "Homo sapiens" 
/db_xref ="taxon ; 9606 " 
/clones "lambda h7" 
/tissue_type= "placenta" 
/clone_lib>"lainbda gtlO" 

1. .743 

/name** " unnamed protein product" 
65. .743 

/ r e g i on_name = " PepO " 

/note =" predicted metalloendopeptidase [Posttranslational 
modification, protein turnover, chaperones] ; C063590" 
/db xref-"CDD; 33390 " . 
72. .741 

/region_name="M13 " 

/noteo "Peptidase family M13 includes neprilysin, 
endothel in -converting enzyme I; cd08662" 

/db xref ="CDD: 189000 " 

order (536. . 537 , 574 , 577 . . 578 , 581 , 640 , 683 . .684, 705, 711) 
/site_type="active" . 
/db xref ="CDD; 189000 " 
order (577,581, 640) 
/si t e_ t ype » " ot he r " 
/note="Zn binding site" 
/db_xre f = " CDD : 189000 " 
1. .743 

/ coded_by = "X07166.1:18 
/note="en)cephalinase (AA 
/db_xref="GDB: 120190" 
/db_xre £ - "GOA : P08473 " 
/ db_xr e f = " HGNC : 7154 " 
/ db_xre f = • In te r Pro : IPR000718 " 
/db'xre f » • In te rPro : IPR008753 " 
/db_xre f= " Inter Pro : IPR018497" 
/db xref="PDB; 1DL9 " 
/db_xre f » " PDB : idmt " 
/db_xr e f - • PDB : IQVD " 
/db xref »"PDB; IRIH " 
/db xref="PDB; IRII " 
/db xref »"PDB; IRIJ " 
/db_xre f = " PDB : 1Y8J " 
/db_xre f = " PDB : 2QPJ " 

/db_xre f = " Uni Pr otKB/Swi ss - Pro t : P08473 " 



2249" 
1-743) " 



// 



1 mditdintpic p)dc)cqrwtpl eislsvlvll Itiiavtmia lyatyddgic kssdciksaa 
61 rliqnmdatt epctdffkya cggwlkrnvi petssrygnf dilrdelew Ikdvlqepkt 
121 edivavqkak alyrscines aidsrggepl Ikllpdiygw pvatemreqk ygaswtaeka 
181 iaqlnskygk kvlinlfvgt ddknsvnhvi hidqprlglp srdyyectgi ykeactayvd 
241 fmisvarlir qeerlpiden qlaletnnkvm elekeianat akpedmdpm llynkmtlaq 
301 iqnnfslein gkpfswlnft neimstvnis itneedwvy apeyltklkp iltkysardl 
361 qnlmswrfim dlvsslsrty kesmafrka lygttsetat wrrcanyvng nmenavgrly 
421 veaafagesk hwedliaqi revfiqtldd Itwradaetkk raeekalaik erigypddiv 
481 sndnklnney lelnykedey feniiqnlkf sqskqlkklr ekvdkdewis gaawnafys 
541 sgrnqivfpa gilqppffsa qqsnslnygg igmvigheit hgfddngrnf nkdgdlvdww 
601 tqqsasnfke qsqcmvyqyg nfswdlaggq hlngintlge niadngglgq ayrayqnyik 
661 luigeekllpg Idlnhkqlff Infaqvwcgt yrpeyavnsi ktdvhspgnf riigtlqnsa 
721 efseafhcrk nsymnpekkc rvw 
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Q61391 (NEP_MOUSE) * Reviewed. UniProtKB/Swiss-Prot 

Last modified March 8, 2011, Version 101. 



Names and origin 


rrotein names 


1 

rK&COfufn&nO&Q natuB, 

Neprilysin 

EC=3.4.24.11 
Alternative narnefs)' 
1 Atrlopeptidase 
Enkephalinase 
Neutral endopeptidase 24. 1 1 

1 Short name=NEP 

Short name=Neutrai endopeptidase 

Skin fibroblast elastase 

Short name=SFE 

CD_antlgen=CD10 


Gene names 


Name:Mme 


Liryanism 


mus muscuius \iviouse/ 


Taxonomic identifier 


10090 [NCdIj 


Taxonomlc lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 


Protein attributes 




Sequence length 


750 AA. 


Sequence status 


Complete. 


Sequence processing 

■ 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function ! 

- 

i 

Catalytic activity 


Thennolysin-like specificity, but is almost confined on acting on 
polypeptides of up to 30 amino acids. Biologically important in 
the destruction of opioid peptides such as Met- and Leu- 
enkephalins by cleavage of a Gly-Phe bond. Able to cleave 
angiotensin-1, angiotensin-2 and angiotensin 1-9l By similarity |, 
Involved in the degradation of atrial natriuretic factor (ANF). 
Displays UV-inducible elastase activity toward skin preelastic 

sanH Alflfttir flht^rs. 1 Ref.6 j 


Preferential cleavage of polypeptides between hydrophobic 
residues, particularly with Phe or Tyr at P1'. 


Cofactor 


Binds 1 zinc ion per subunit. 


Subcellular location 


Cell membrane; Single-pass type II membrane protein. 


Sequence similarities 


Belongs to the peptidase M1 3 family. 



http://www.uniprot.org/uniprot/Q6 1 391 
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Ontologies 



Keywords 



Cellular component 


Cell membrane 

Membrane 


Domain 


Signal-anchor 

Transmembrane 

Transmembrane helix 


Ligand 


Metal-binding 
Zinc 


Molecular function 


Hydrolase 

Metalloprotease 

Protease 


PTM 


Disulfide bond 
Glycoprotein 


Technical term 
Gene Ontology (GO) 

lift « 

Cellular component 


3D-structure 

Direct protein sequencing 
integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 
plasma membrane 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

metalloendopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



i 

Complete GO annotation,.. 



Sequence annotation (Features) 



Mol< 

□ 


Feature key 

^cule processing 

Initiator methionine 


1 

Position(s) 
1 


1 

Length 
1 


Description 
Removed 

1 By similarity 


Graphical view 

1 


Feature ideij 


itif 


Reg 

m 


Chain 
Ions 

Topological domain 


2-750 
2-28 


749 
27 


Neprilysin 
Cytoplasmic 

1 Potential j 


1 


PRO OOOOC 


7E 


H 


Transmembrane 


29-51 


23 


Helical; 
Signal- 
anchor for 
type II 


II 
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membrane 

protein; 

I Potential I 



Topological domain . 



52 - 750 



□ 



Motif 



16-23 ; 



699 



8 



Extracell ular 

I Potentiat | 

Stop- 
transfer 
sequence 

I Potential j 



4 



Sites 



Active site 



585 



1 



I By similarity | 



Active site 



651 



1 



Proton 
donor 

I By similarity 



Metal binding 



584 



1 



Zinc; 
catalytic 

I By similarity 



Metal binding 



588 



1 



Zinc; 
catalytic 

I By similarity 



Metal binding 



647 



Zinc; 
catalytic 



□ 



— «. 



Binding site 



103 



By similarity 



1 



Substrate 
carboxyl 

I By simriarity 



Amino acid modifications 



Gly cosy lat ion 



Glycosylation 



145 



211 



1 



1 



N-linked 
(GlcNA c. 



N-linked 
(GlcNAc 

I Potential j 



4- 



Glycosylation 



285 



N-linked 
(GlcNA c. 

I Ref.5 I 



Glycosylation 



311 



1 



I 



□ ) Glycosylation 



317 



1 



□ 



N-linked 
(GlcNA c... 

I Ref.5 I 

N-linked 
(GlcNAc. 

i I Ref.S I 



Glycosylation 



325 



1 N-linked 
(GlcNAc... 

I By similarity" 



http://www.uniprot.org/uniprot/Q6 1 39 1 
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Glycosylation 



628 : 



1 ! N-linked 



□ 



Disulfide bond 



I 57 ^ 62 



Disulfide bond 
Disulfide bond 



80 w 735 



Disulfide bond 



Disulfide bond 



Disulfide bond 



Experimental info 



88 ^ 695 



-f 



143.^411 



234 ^ 242 



621 ^ 747 



(GlcNAc.) 

I By simiiarity j 



By simitarity 



I 



-+ 

I 
I 



I By similarity 



By similarity 



By simiiarity 



By similarity 



I By similarity j 



I 



I 



-4- 



I I 



Sequence conflict 



230 



1 



D G in 
AAA37 386. 

{ Ref.1 j 



Secondary structure 



I Helix I Strand | Turn 



Details. 



Sequences 



Sequence Length Mass (Da) 



Q61391 [UniParc]. FASTA 750 85,702 

Last modified January 23, 2007. Version 3. 
Checksum: 1FC39A971D98F6FE 



10 20 30 40 50 60 j 

i M6RSESQMDI TDINAPKPKK KQRWTPLEIS LSVLVLLLTI IAVTT1IALYA TYDD6ICKSS ' 

; 70 8_0 90 100 110 120 j 

; DCIKSAARLI QNMDASVEPC TDFFKYACGG WLKRNVIPET SSRYSNFDIL RDELEVILKD 

■ 130 140 150 160 170 180 j 

j VLQEPKTEDI VAVQKAKTLY RSCINESAID SRGGQPLLKL LPDIYGWPVA SDNWDQTYGT 1 

190 200 210 220 230 240 

SWTAEKSIAQ LNSKYGKKVL INFFVGTDDK NSTQHIIHFD QPRLGLPSRD YYECTGIYKE 

25£ 260 270 280^ 290 300 

ACTAYVDFMI SVARLIRQEQ SLPIDENQLS LEMNKVMELE KEIANATTKP EDRNDPMLLY 

310 32^ 330 340. 350 360 

NK^f^IAKLQN NFSLEVNGKS FSWSNFTNEI MSTVNINIQN EEEVWYAPE YLTKLKPILT 

4 I 

370 380 390 400_ 410 420 

KYSPRDLQNL MSWRFIMDLV SSLSRNYKES RNAFRKALYG TTSETATWRR CANYVNGNME 

430 440 450 460 470 480 

NAVGRLYVEA AFA6ESKHW EDLIAQIREV FIQTLDDLTW MDAETKKKAE EKALAIKERI 

490 500 510 520 530 540 

GYPDDIISNE NKLNNEYLEIi NYREDEYFEN IIQNLKFSQS KQLKKLREKV DKDEWISGAA 

550 560 570 580 590 600 

WNAFYSSGR NQIVFPAGIL QPPFFSAQQS NSI^NYGGIGM VIGHEITHGF DDNGRNFNKD 

610 620 630 640 650 660 
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ODLVDKWTQQ SANNFKDQSQ CMVYQYGNFS 

I 670 680 690 

1 AYQNYVKKNG EEKLLPGLDL NHKQLFFLNF 

! 730 740 750 

GTLQNSAEFA DAFHCRKNSY MNPERKCRVW 

I 

« Hide 



WDIiAGGQHLN GINTLGENIA DNGGIGQAYR 

700 710 720 

AQVWCGTYRP EYAVNSIKTD VHSPGNFRII 



References 



^ « Hide 'large scale' references 

[1] 



"Murine common acute lymphoblastic leukemia antigen (CD10 neutral endopeptidase 
24.11). Molecular characterization, chromosomal localization, and modeling of the active 
site." 

Chen C.Y., Salles G.. Seldin M.F., Kister A.E., Reinher E.L., Shipp M.A. 
J. Immunol. 148:2817-2825(1992) [PubMed: 1374101] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [MRNAJ. 
Strain: BALB/c. 



[2] > "The transcriptional landscape of the mammalian genome." 

Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi 
T., Lenhard B., Wells C, Kodzius R., Shimokawa K., BajicV.B., Brenner S.E., Batalov S.. 



i 



I Forrest A.R., Zavolan M., Davis M.J. H Hayashizaki Y. 
! Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract] 
j Cited for : NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. 
j Strain : C57BU6J. 
Tissue : Epididymis and Testis. 



[3] { "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian 
: Gene Collection (MGC)." 

I The MGC Project Team 

j Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] 
Cited for : NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], 
Strain: C57BL/6J and FVB/N. 



} Tissue : Embryonic germ cell and Mammary tumor. 



[4] ; Lubec G., Sunyer B,, Chen W.-Q. 
Submitted (JAN-2009) to UniProtKB 

Cited for : PROTEIN SEQUENCE OF 473-479, MASS SPECTROMETRY. 
Strain: OF1. 
Tissue: Hippocampus. 



[5] 



i 



"Mass-spectrometric identification and relative quantification of N-linked cell surface 
glycoproteins." 

Wollscheid B.. Bausch-Fluck D., Henderson C, O'Brien R., Bibel M.. Schiess R., Aebersold R., 
Watts J.D. 

Nat. Biotechnol. 27:378-386(2009) [PubMed: 19349973] [Abstract] 

Cited for : GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-145; ASN-285; ASN-31 1 
AND ASN-31 7, MASS SPECTROMETRY. 



[6] ! "Neprilysin is identical to skin fibroblast elastase: its role in skin aging and UV 
responses." 

' Morisaki N., Moriwaki S.. Sugiyama-Nakagiri Y., Haketa K., Takema Y.. Imokawa G. 
J. Biol. Chem. 285:39819-39827(2010) [PubMed: 20876573] [Abstract] 
Cited for : IDENTIFICATION AS SKIN FIBROBLAST ELASTASE, FUNCTION. 

^ Additional computationally mapped references. 
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C ro s s -ref e re n c es 



Sequence databases 



@EMBL 
® GenBank 
® DDBJ 


M81591 mRNA. Translation: AAA37386.1. 
AK031446 mRNA. Translation: BAC27410.1. 
AK033824 mRNA. Translation: BAC28487.1. 
BC034092 mRNA. Translation: AAH34092.1. 
BC066840 mRNA. Translation: AAH66840.1. 


IPI 


IPI00461861. 


RefSeq 


NP__032630.2. NM_008604.3. 


UniGene 


Mm.296022. 




3D structure databases 






@PDBe 

® RCSB PDB 

® PDBj 


Entry Method Resolution Chain Positions 

(A) 

2YVC X-ray 3.20 D/E/F 2-23 


PDBsum 
[»] 


ProteinMbdelPortat 


Q61391. 


SMR 


Q61391. Positions 65-750. 


ModBase 


Search... 




Protein-protein interaction databases 




STRING 


Q61391. 




Protein family/group databases 






MEROPS 


M1 3.001. 




PTIVI databases 






PhosphoSite 


Q61391. 




Proteomic databases 






PRIDE 


Q61391. 




Genome annotation databases 






Ensembl 


ENSMUST00000029400: ENSMUSP00000029400: 
ENSMUSG00000027820. 

ENSMUST000001 07889; ENSMUSP000001 03521 ; 
ENSMUSG00000027820. 




GenelD 


17380. 


KEGG 


mmu: 17380. 


UGSC 


uc008pjp.1. mouse. 



Organism-specific databases 

CTD 



17380. 



1 



MGI 



; MGI:97004. Mme. 
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Phylogenomic databases 



eggNOG 


roNOG08854. 


Fl Vin/O u. 1 N v./ IVI 




HOVERGEN 


HBG006554. 


InParanoid 


Q61391. 


OMA 


TYRPFYA 


OrthoDB 


EOG4XWFXB. 


PhylomeDB 


061391. 


Enzyme and pathway databases 




BRENDA 


3.4.24.11. 244. 


Gene expression databases 




ArrayExpress 


061 391. 


Bgee 


061391. 

1 


CteanEx 


MM MME. 


Genevestigator 


061391. 


GermOnline 


ENSMUSG00000027820. Mus musculus. 


Family and domain databases 




interKro 


iKKuoo/io. Peptidase Mio. 
IPR01 8497. Peptidase Ml 3 C. 
IPR008753. Peptidase_M13_N. 
[Graphical view] 


PANTHER 


PTHR11733. Peptidase_M13. 1 hit. 


KTam 


^1-0140 1. Peptidase Mio. i nit. 
PF06649. Peptidase_M13_N. 1 hit. 
[Graphical view] 


PRINTS 


PR00786. NEPRILYSIN. 


PROSITE 

■ 

. „ ..-^ ' — — - — - — - — • *- 


PS00142. ZINC^PROTEASE. 1 hit. 

[Graphical view] 


ProtoNet 


Search... 


Other Resources 




NextBio 


291980. 


SOURCE 


Search 


Entry information 




Entry name 


NEP_MOUSE 


Accession 


Primary (citable) accession number: Q61391 
Secondary accession number(s): 06NXX5, 08K251 


Entry history 


Integrated into November 1, 1997 

UniProtKB/Swiss- 

Prot: 
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Entry status 



Annotation program 



Relevant documents 



Last sequence January 23, 2007 
update: 

Last modified: March 8, 201 1 

This is version 101 of the entry and version 3 of the sequence. 
[Complete history] 



Reviewed (UniProtKB/Swiss-Prot) 



Chordata Protein Annotation Program 



MGD cross-references 

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot 



PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 



Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 



•I* 



EMBL-EBI jj| ;|: i 




mm 
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P08049 (NEP_RABIT)* Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8. 2011. Version 95. 



Names and origin 


Protein names 


HBComnienoeo nQtne. 
1 Neprilysin 
EC=3.4.24.11 
Alternative name(s): 
Atriopeptidase 
Enkephalinase 
Neutral endopeptidase 24.11 

Short name=NEP 

Short nanie=Neutral endopeptidase 

Skin fibroblast elastase 

Short name=SFE 

CD_antigen=CD1 0 


Gene names j Name:MME 

Organism j Oryctolagus cuniculus (Rabbit) 


Taxonomic identifier 
Taxonomic lineage 

Protein attributes 


9986 [NCBl] 

Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Lagomorpha > Leporidae > Oryctolagus 


Sequence length | 750 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence | Evidence at protein level. 
General annotation (Comments) 


Function 

1 

. 


Thermolysin-like specificity, but is almost confined on acting on 
polypeptides of up to 30 amino acids. Biologically important in 
the destruction of opioid peptides such as Met- and Leu- 
enkephaiins by cleavage of a Gly-Phe bond. Able to cleave 
angiotensin-1. angiotensin-2 and angiotensin 1-9l BysimHamy j. 
Involved in the degradation of atrial natriuretic factor (ANF). 
Displays UV-inducible elastase activity toward skin preelastic 

anri ftlaRtIr fihftrR 1 By similarity I 


Catalytic activity 


Preferential cleavage of polypeptides between hydrophobic 
residues, particularly with Phe or Tyr at PV. 


Cofactor 


Binds 1 zinc ion per subunit. 


Subcellular location 


Cell membrane: Single-pass type II membrane protein. 


Sequence similarities 


Belongs to the peptidase Ml 3 family. 
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Names and origin 


rrotein names 


Hecommencfeci name: 
Neprilysin 
EC=3.4.24.11 

Atriopeptidase 
Enkephalinase 
Neutral endopeptidase 24.11 

Short name^NEP 

Short name=Neutrat endopeptidase 

Skin fibroblast elastase 

1 Short name=SFE 

CD_antigen=CD10 


Gene names 


Name:Mme 


Lirganism 


Kaitus norvegicus (Kat) 


Taxonomic identifier 


101 16 [NCBIJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > 
Rattus 


Protein attributes 




Sequence length 


750 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 1 


Evidence at protein level. 


General annotation (Comments) 


Function 1 

i 
1 
1 

> 

1 

! 


Themnolysin-like specificity, but is almost confined on acting on 
polypeptides of up to 30 amino acids. Biologically important in 
the destruction of opioid peptides such as Met- and Leu- 
enkephalins by cleavage of a Gly-Phe bond. Able to cleave 
angiotensin-1 , angiotensin-2 and angiotensin 1-9. Displays UV- 
inducible elastase activity toward skin preelastic and elastic 
fibers 1 By similarity |. Involved in the degradation of atrial 
natriuretic factor (ANF). 


Catalytic activity 


Preferential cleavage of polypeptides between hydrophobic 
residues, particularly with Phe or Tyr at PI'. 


Cofactor 


Binds 1 zinc ion per subunit. 


Enzyme regulation \ 


Inhibited in a dose dependent manner by sialorphin. 


Subcellular location ! 


Cell membrane; Single-pass type II membrane protein. 
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Sequence similarities 


Belongs to the peptidase M13 family. 
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Metalloprotease 

Protease 


n 1 IVI 


ulsuiTiae Dona 
Glycoprotein 


Technical term 


Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


peptide metabolic process 

Inferred from mutant phenotype. Source: RGD 


■ 


proteolysis 

Inferred from direct assay. Source: RGD 


Cellular component 


integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 
membrane fraction 

Inferred from direct assay. Source: RGD 
plasma membrane 

Inferred from mutant phenotype. Source: RGD 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

metalloendopeptidase activity 

Inferred from electronic annotation. Source: InterPro 

peptide binding 

Inferred from direct assay. Source: RGD 
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Entry status 
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P19621 (NEP_PIG)* Reviewed, UniProtKB/Swiss-Prot 

Last modified January 11, 2011. Version 64. 



Names and origin 



Protein names 



1 



Recommended name: 

Neprilysin 
EC=3.4.24.11 
Alternative name(s): 

Atriopeptidase 

Enkephalinase 

Neutral endopeptidase 24.11 

Short name=NEP 

Short name=Neutral endopeptidase 

Sl<ln fibroblast elastase 

Short name=SFE 

CD_antigen=CD10 



Gene names 



Name:MME 



Organism 



Sus scrofa (Pig) 



Taxonomic identifier 



9823 [NCBI] 



Taxonomic lineage 



Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Cetartiodactyla > Suina > Suldae > Sus 



Protein attributes 



Sequence length 



26 AA. 



Sequence status 



Fragment. 



Protein existence 



Evidence at protein level. 



General annotation (Comments) 



Function 



Catalytic activity 



Cofactor 



Subcellular location 



Themnolysin-like specificity, but is almost confined on acting on 
polypeptides of up to 30 amino acids. Biologically important in 
the destruction of opioid peptides such as Met- and Leu- 
enkephalins by cleavage of a Gly-Phe bond. Able to cleave 
anglotensin-1 , angiotensin-2 and angiotensin 1-9 i By simiianty i 
Involved in the degradation of atrial natriuretic factor (ANF). 
Displays UV-indu cible elasta se activity toward skin preelastic 
and elastic fibers i By similarity i 



Preferential cleavage of polypeptides between hydrophobic 
residues, particularly with Phe or Tyr at P1'. 



Binds 1 zinc ion per subunit. 



Cell membrane; Single-pass type 11 membrane protein. 



Sequence similarities 



Belongs to the peptidase Ml 3 family. 
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Ontologies 



Keywords 



Cellular component 


Cell membrane 
Membrane 


Domain 


Transmembrane 


Ligand 


Zinc 


Molecular function 


Hydrolase 

Metalloprotease 

Protease 


Technical term 

Gene Ontology (GO) 

Cellular component 


Direct protein sequencing 
integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 
plasma membrane 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 
Complete GO annotation... 


metallopeptidase activity 

Inferred from electronic annotation. Source: UniProtKB-KW 
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PRO_0000078j21 
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m P19621 [UniParc]. 



FASTA 



26 



3.039 



Last modified February 1, 1991. Version 1 
Checksum: 3848804A9DDF7DEF 



10 20 
PKPKKKQRWT PLEISLEVLV LVLVXI 



« Hide 



References 



[1] 



"The N-terminal amino acid sequence of pig Icidney endopeptidase-24.1 1 shows 
homology with pro-sucrase-isomaltase." 

Fulcher LS., Pappin D.J.C., Kenny AJ. 

Biochem. J. 240:305-308(1986) [PubMed: 3548708] [Abstract] 
Cited for : PROTEIN SEQUENCE. 

Tissue : Kidney. 



Cross-references 


Sequence databases 




PIR 


A26070. 


3D structure databases 




ModBase 


1 Search... 


Protein family/group databases 




MEROPS 


M1 3.001. 


Enzyme and pathway databases 




BRENDA 


3.4.24.11. 249. 

1 


Family and domain databases 


w 


PROSITE 


PS00142. ZINC_PROTEASE. Partial match. 
[Graphical view] 


ProtoNet 


Search... 



Entry information 



Entry name 



NEP PIG 



Accession 



{ Primary (citable) accession number: Pi 9621 



http://www.uniprot.org/uniprot/Pl 962 1 



4/12/2011 



Neprilysin - Sus scrofa (Pi 




Page 4 of 4 



Entry history 


Integrated into February 1, 1991 

UniProtKB/Swiss- 

Prot: 

Last sequence February 1, 1991 
update: 

Last modified: January 1 1 , 201 1 

This IS version 64 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 
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P09958 (FURIN_HUMAN)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 143. 



Names and origin 


Protein names 


j Recommended name: 
1 Furin 

1 EC=3.4.21.75 
Alternative name(s): 

Dibasic-processing enzyme 

Paired basic amino acid residue-cleaving enzyme 

Short name=PACE 


Gene names 


1 

Name: FURIN 
SynonymsrFUR. PACE, PCSK3 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Euthena > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Protein attributes 




Sequence length 


794 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence i 


Evidence at protein level. 



General annotation (Comments) 



Function 



j 



Catalytic activity 



Furin is likely to represent the ubiquitous endoprotease activity 
within constitutive secretory pathw ays an d capable of cleavage 

at the RX(K/R)R consensus motif. I ^^^-^ 



Release of mature proteins from their proproteins by cleavage 
of -Arg-Xaa-Yaa-Arg-|-Zaa- bonds, where Xaa can be any 
amino acid and Yaa is Arg or Lys. Releases albumin, 
complement component C3 and vWF from their respective 
precursors. 



Cofactor 



1 Calcium. 



Enzyme regulation 



Subunit structure 



Could be inhibited by the not secondly cleaved propeptide. 



Subcellular location 



4- 



Interacts with FLNA I By simiiarity 1 Binds to PACS1 which 
mediate s TGN localization and connection to clathrin adapters. 

I Ref.12 I 



Golgi apparatus > trans-Golgi network membrane; Single-pass 
type I membrane protein. Cell membrane; Single-pass type I 
membrane protein. Note: Shuttles between the trans-Golgi 
network and the cell surface. Propeptide cleavage is a 
prerequisite for exit of furin molecules out of the endoplasmic 
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reticulum (ER). A second cleavage within the propeptide occurs 
in the trans Golgi network (TGN), followed by the release of the 
propeptide and the activation of furin. 


Tissue specificity 


Seems to be expressed ubiquitously. 


Domain 


Contains a cytoplasmic domain responsible for its TGN 
localization and recycling from the cell surface. 1 Re^^o | 


PosMranslationai modification 


The inhibition peptide, which plays the role of an intramolecular 
chaperone, is autocatalytically removed in the endoplasmic 
reticulum (ER) and remains non-covalently bound to furin as a 
potent autoinhibitor Following transport to the trans Golgi, a 
second cleavage within the inhibition propeptide results in 
1 propeptide dissociation and furin activation. 

1 

I Phosphorylation is required for TGN localization of the 
1 enaoproiease. in vivo, exists as ui-, mono- ana non- 
j phosphorylated forms. 1 R®^''^ 




j — ^ — . , ^ „ 

' Dc«iunys 10 ine pcpiiQa5>6 00 Tariiiiy. ruiin oUwiciiTiiiy. 

\ Contains 1 homo B/P domain. 

t 
t 


Ontologies 




Keywords 




Cellular component 


Cell membrane 
Golgi apparatus 
Membrane 


Coding sequence diversity 


Polymorphism 


Domain 


Signal 

Transmembrane 
Transmembrane helix 


Ligand | 


Calcium 
Metal-binding 


Molecular function 

J , 1 ,„„, , ,„ 


Hydrolase 
Protease 
Serine protease 


PTM ] 

1 


Autocatalytic cleavage 

Cleavage on pair of basic residues 

Disulfide bond 

Glycoprotein 

Phosphoprotein 

Zymogen 


Technical term 


Complete proteome 



Gene Ontology (GO) 

Biological process j Notch signaling pathway 

j Traceable author statement. Source: Reactome 

! cell proliferation 

' Inferred from mutant phenotype. Source: BHF-UCL 

negative regulation of low-density lipoprotein particle receptor 
catabolic process 
'. Inferred from direct assay. Source: HGNC 
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negative regulation of transforming growth factor-beta1 
production 

Inferred from mutant phenotype. Source: BHF-UCL 

nerve growth factor processing 

Inferred from experiment. Source: Reactome 

nen/e growth factor production 

Inferred from direct assay. Source: BHF-UCL 

nerve growth factor receptor signaling pathway 
Traceable author statement. Source: Reactome 

peptide biosynthetic process 

Inferred from direct assay. Source: BHF-UCL 

peptidyl-glutamic acid carboxylation 
Traceable author statement. Source: Reactome 

positive regulation of membrane protein ectodomain proteolysis 
I Inferred by curator. Source: BHF-UCL 

> 

post-translational protein modification 
Traceable author statement. Source: Reactome 

secretion by cell 

Inferred from direct assay. Source: BHF-UCL 

I signal peptide processing 

j Inferred from direct assay. Source: HGNC 

transforming growth factor beta receptor signaling pathway 
Traceable author statement. Source: Reactome 

I viral assembly, maturation, egress, and release 
j Inferred from expression pattern. Source: BHF-UCL 



Cellular component 



Golgi lumen 

Inferred from experiment. Source: Reactome 
Golgi membrane 

Traceable author statement. Source: Reactome 
cell surface 

Inferred from direct assay. Source: BHF-UCL 
integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 
membrane raft 

Inferred from direct assay. Source: BHF-UCL 
plasma membrane 

Inferred from electronic annotation. Source: UniProtKB-SubCell 
trans-Golgi network 

Inferred from direct assay. Source: BHF-UCL 

trans-GoIgi network transport vesicle 
Inferred from direct assay. Source: MGI 



Molecular function 



metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 

nerve growth factor binding 

Inferred from direct assay. Source: BHF-UCL 

peptide binding 

Inferred from direct assay. Source: BHF-UCL 
protease binding 
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Inferred from physical interaction. Source: BHF-UCL 

serine-type endopeptidase activity 

Inferred from direct assay. Source: BHF-UCL 

j serine-type endopeptidase inhibitor activity 
Inferred from direct assay. Source: BHF-UCL 

Complete GO annotation... 



Sequence annotation (Features) 



Feature key 



Molecule processing 



I Position(s) j Length | Description 



Graphical view 



Featur 



Signal peptide 



Propeptide 
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24 
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25-107 



1 08 - 794 
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-4" 
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Regions 
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4. 



i 759-762 



23 I Helical; I Potential I 

4 I Cell surface signal 



II 



Motif 



Motif 
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3 ! Cell attac hment site 

I Potential j 



773 - 779 I 



7 I Trans Golgi network 
signal 




Compositional bias j 556 - 705 1 50 Cys-rich 



I I 



Sites 



□ 



Active site 



Active site 



Active site 



Metal binding 



Metal binding 



153 



194 



368 



— 



115 



162 



Charge rela y system 

I By similarity | 



Charge rela y system 

I By similarity j 



I 



Charge rela y system 

I By similarity | 



Calcium 1 

I By similarity | 



Calcium 1 

I By similarity | 



Metal binding 



208 



Calcium 1 

I By similarity | 



Metal binding 



258 



Calcium 2 

I By similarity | 



Metal binding 



301 



Metal binding 



331 



Ca lcium 2 

I By similarity 

Calcium 2 
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m 


Site 


75-76 


2 


Cleavage, second; 
by autolysis 


1 




E 


Site 


107-108 


2 


Cleavage, first; by 
autolysis 


i ' 





Amino acid modifications 



□ 



Modified residue 



773 



1 



Pho sphoser ine; by 

CK7 \ Ref.lll 



Modified residue 



775 



1 



Pho sphoser ine; by 

C.K7 \ Ref.ll I 



□ 



Glycosylation 



387 



1 



N-linked (GlcNAc.) 



Potential 1 



□ 



Glycosylation 



440 



1 i N-linked (GlcNAc.) 

Potential | 



□ Glycosylation 



□ I Disulfide bond 



553 



211 ^360 



1 



Disulfide bond 



Disulfide bond 

Natural variations 
Natural variant 



303 ^ 333 



450 474 



N-linked (GlcNAc.) 

Potentiail 



1 By similarity j 



By similarity 



I I 

ii 



By similarity | 



Natural variant 



43 I 



1 



A — V. 

[dbSNP:rs1 6944971] 



547 I 



1 I W — » R in cell line 

LoVo; does not 

undergo 

autocatalytic 

activation and is not 

transported to the 

Golgi a pparatus. 
1 Ref^a I 



Sequences 



VARJ 

i 



VAR ( 



Sequence 



Length Mass (Da) 



P09958 [UniParc]. 
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OrthoDB 


1 EOG4ZW59M. 



PhylomeDB 



P09958. 



Enzyme and pathway databases 



BRENDA 



3.4.21.75. 247. 



Pathway J nteraction_DB 



glypican_3pathway. Glypican 3 network. 
hif1_tfpathway. HIF-1 -alpha transcription factor network. 
p75ntrpathway. p75(NTR)-mediated signaling. 



rWdOlUI 1 IKS 


RFACT 11061 Slanallina bv NGF 

REACT_16888. Signaling by PDGF. 
REACT 17015. Metabolism of proteins. 
REACT 299. Signaling by Notch. 
REACT_6844. Signaling by TGF beta. 


Gene expression databases 




ArrayExpress 


P09958. 


Bgee 


P09958. 


CleanEx 


HS.FURIN. 


Genevestigator 


P09958. 


GermOnline 


ENSG00000140564. Homo sapiens. 


Family and domain databases 




InterPro 


IPR006212. Furin repeat. 
IPR008979. Galactose-bd-like. 
IPR009030. Growth fac rcpt. 
IPR000209. Peptidase S8/S53. 
IPR022398. Peptidase S8/S63 AS. 
IPR01 5500. Peptidase_S8_subtilisin-rel. 
IPR009020. ProtJnh_propept, 
IPR002884. PrprotnconvertsP. 
[Graphical view] 


Gene3D 


G3DSA:3.40.50.200. Pept_S8_S53. 1 hit. 


PANTHER 


PTHR10796. SubtilSerProt. 1 hit. 


Pfam 


PF01483. P_proprotein. 1 hit. 
PF00082. Peptidase_S8. 1 hit. 
[Graphical view] 


PRINTS 


PR00723. SUBTILISIN. 


SMART 


SM00261. FU. 2 hits. 
[Graphical view] 
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SUPFAM 


SSF49785. GaLbindJike. 1 hit. 
SSF57184. Grow_fac_recept. 1 hit. 
SSF52743. Pept_S8_S53, 1 hit. 
SSF54897. ProtJnh_j)ropept. 1 hit. 


PROSITE 


PS00136. SUBTILASE ASP. 1 hit. 
PS00137. SUBTILASE HIS. 1 hit. 
PS00138. SUBTILASE_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Other Resources 




NextBio 


19422. 


PMAP-CutDB 


P09958. 
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P23188 (FURIN_MOUSE)# Reviewed, UniProtKB/Swiss-Prot 

Last modified March 8, 2011. Version 114. 



Names and origin 


Protein names 


Recommended name: 

Furin 
EC=3.4.21.75 
Alternative name(s): 

Dibasic-processing enzyme 

Paired basic amino acid residue-cleaving enzyme 

Short name=PACE 

nronormone converiase o 


Gene names 


Name: Furin 
Synonyms:Fur, PcskS 


Oraanism 


Mus musculus (Mouse) 






Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
ciuieieosiomi > Mammalia > cumeria > cuarcnoniogiires > oiires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 


Protein attributes 




Sequence length 


793 AA. 


Sequence status 


Complete. 


Sequence processing 

; 

: 


The displayed sequence is further processed into a mature 
form. 



Protein existence Evidence at protein level. 



General annotation (Comments) 



Function I Furin is likely to represent the ubiquitous endoprotease activity 

I within constitutive secretory pathways and capable of cleavage 
i at the RX(K/R)R consensus motif. 

Catalytic activity | Release of mature proteins from their proproteins by cleavage 

I of -Arg-Xaa-Yaa-Arg-|-Zaa- bonds, where Xaa can be any 
I amino acid and Yaa is Arg or Lys. Releases albumin, 
I complement component C3 and vWF from their respective 
j precursors. 

Cofactor I Binds 2 calcium ions per subunit. 

Enzyme regulation Could be inhibited by the not secondly cleaved propeptide. 

J Interacts with FLNA I By simiiarityl Binds to PACSI which 
i mediates TGN loca lization and connection to clathrin adapters 

I By similarity I | Ref.3 | 

' Golgi apparatus > trans-Golgi network membrane; Single-pass 
\ type I membrane protein. Cell membrane; Single-pass type I 
J membrane protein. Note: Shuttles between the trans-Golgi 



Subunit structure 



Subcellular location 
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Tissue specificity 


network and the cell surface. Propeptide cleavage is a 
prerequisite for exit of funn molecules out of the endoplasmic 
reticulum (ER). A second cleavage within the propeptide occurs 
in the trans GoIqI network (TGN). followed by the release of the 
propeptide and the activation of furin 1 similarity j. 


i Seems to be expressed ubiquitously. 


Domain 


Contains a cytoplasmic domain responsible for its TGN 
localization and recycling from the cell surface. 


Post-translationa! modification 


The inhibition peptide, which plays the role of an intramolecular 
chaperone, is autocatalytically removed in the endoplasmic 
reticulum (ER) and remains non-covalently bound to furin as a 
potent autoinhibitor. Following transport to the trans Golgi, a 
second cleavage within the inhibition propeptide results in 
propeptide dissociation and furin activation 1 similarity |. 

Phosphorylation is required for TGN localization of the 
endoprotease. In vivo, exists as dl-, mono- and non- 

phnftphnrylat^^H fprmR | By similarity 


o6C|Uenc6 simiiariii6S 

Ontologies 


oeionys lO me pepiiuase oo lomiiy. rurin suDiorniiy. 
Contains 1 homo B/P domain. 


Keywords 

Cellular component j Cell membrane 

1 Golgi apparatus 
I Membrane 


Domain 

1 


Signal 

Transmembrane 
Transmembrane helix 


! 

Ligand 

1 


Calcium 
Metal*binding 


! 

Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 

Cleavage on pair of basic residues 

Disulfide bond 

Glycoprotein 

Phosphoprotein 

Zymogen 


Technical term i 

Gene Ontology (GO) 

Cellular component 


3D-structure 

early endosome 

Traceable author statement. Source: MGI 

endoplasmic reticulum lumen 

Traceable author statement. Source: Reactome 

integral to membrane 

Traceable author statement. Source: MGI 
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plasma membrane 
Inferred from electronic annotation. Source: UniProtKB-SubCeli 

trans-Golgi network transport vesicle membrane 
Traceable author statement. Source: MGI 



Molecular function 



metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 



Complete GO annotation.., 



Sequence annotation (Features) 



Feature key 



Molecule processing 



I 



Position(s) Length Description I Graphical view 



Feature iqei 





Signal peptide 


1-24 


24 


1 

Potential 


1 








Propeptide 


25-107 


83 


Inhibition 
peptide 

1 By similarity | 


1 1 


PRO_00( 


)0( 


El 


Chain 


108-793 


686 


Furin 


1 


PRO.OOC 


)0( 



Regions 





1 

1 Transmembrane 


1 715-735 

! 

■■ 
■ 


21 


Helical; 

Potential | 


11 








Region 


■ 109-445 


337 


Catalytic 


1 1 








Region 


.. 

446 - 578 


133 


P-domain 


1 1 




Region 


758 - 761 


4 


Cell surface 
signal 


1 




□ 


Motif 


498 - 500 


3 


Cell attachment 

RitP 1 Potential I 


1 




□ 


Motif 


772 - 778 


7 


Trans Golgi 
network signal 


1 








Compositional bias 

< 


556 - 705 


150 


Cys-rich 


1 ' i 1 





Sites 



□ 


Active site 


153 


1 


Charge relay 
system 


1 




□ 


Active site 


194 


1 


Charge relay 
system 


1 




□ 

„ 


Active site 


368 




Charge relay 
system 


1 




□ 


Metal binding 


115 1 


Calcium 1 


1 





□ I Metal binding 

□ Metal binding 

□ } Metal bind ing 



162 



1 



Calcium 1 



208 

258 ] 



1 i Calcium 1 



1 



Calcium 2 



i.. 
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Metal binding 


301 


1 


Calcium 2 


1 




PI 


Metal Dinaing 






oaicium z 


1 




o 


Site 


76-76 


2 


Cleavage, 
second; by 

auiuiysio 

i By simiiarity j 


1 






Site 


107-108 


2 


Cleavage, first; 
by autolysis 

} By similarity [ 


1 





Amino acid modifications 
Modified residue 



Modified residue 



□ Glycosylation 




E] Glycosylation 



Glycosylation 



C3 I Disulfide bond 
^ ^ Disulfide bond 



772 



774 



387 



440 



Phosphoserine; 
by CK2 

{ By similarity j 



Phosphoserine; 
by CK2 

I By similarity 



1 ! N-linked 
(GlcNAc.) 



1 I N-linked 
(GlcNAc.) 



553 



1 i N-linked 
(GlcNAc. ..) 

I Potential | 



f 



□ I Disulfide bond 

Experimental Info 

Sequence conflict 



211 
j 303 
i 450 



360 
333 
474 



746 



1 j M-> Vin 

j AAA37 643. 

I Ref.2 I 



Secondary structure 

1 . 

I Helix I Strand | Turn 

Details... 

Sequences 



I I 



II 



II 



I 



Sequence 



Length Mass (Da) 



P23188 [UniParc]. 

Last modified November 1, 1991. Version 1 
Checksum: 5F121C3DE2E1A42D 



FASTA 



793 



86.804 
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10 


20 


30 
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140 
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IGGVRMLDGE 


VTDAVEARSL 
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260 


2 7£ 


2 8_0 
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300 
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KTVDGPARXA 


EEAFFRGVSQ 


GRGGLGSIFV 


WASGNGGREH 


31£ 
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DSCNCDGYTN 


SIYTLSISSA 


TQFGNVPWYS 


EACSSTLATT 


YSSGNQNEKQ 


IVrrULRQKC 


3 7 0^ 
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4 0£ 


410 


42£ 


TESHTGTSAS 


APLAAGIIAIi 


TLEANKNLTW 
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SKPAHIiNADD 
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430^ 
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450^ 
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47£ 


48£ 
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NDVEIIRASV 
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CQGPAPTDCL 
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690 


7 00 


710 


720 


SCPSHASLDP 


VEQTCSRQSQ 


SSRESRPQQQ 


PPALRPEVEM 


EPRLQAGLAS 


HLPEVLAGLS 


73£ 


740 


750 


760 


770 


780 


CLIIVLIFGI 


VFLFLHRCSG 


FSFRGMKVYT 


MDR6LISYKG 


LPPBAWQEEC 


PSDSEEDEGR 



790 

GERTAFIKDQ 



SAL 



« Hide 
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3D structure databases 






@ PDBe 

® RCSB PDB 

® PDBj 


Entry Method Resolution Chain Positions 
1 (A) 

1P8J X-ray 2.60 A/B/C/D/E/F/G/H 108-578 


PDBsum 
[»] 


ProteinModelPortal 


P23188. 


SMR 


P23188. Positions 30-100, 109-578, 593-681. 


Mod Base 


Search... 




Protein-protein interaction databases 




STRING 


P23188. 




Protein family/group databases 




MEROPS 


S08.071. 




PTM databases 






PhosphoSite 


P23188. 




Proteomic databases 






PRIDE 


P23188. 




Genome annotation databases 
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KEGG 


mmu:18550. 


Organism-specific databases 


CTD 


j 18550. 


MGI 


1 MGI:97513. Furin. 


Pliylogenomic databases 




eggNOG 


roNOG10014. 


HOGENOM 


HBG715943. 


HOVERGEN 


HBG008705. 


InParanoid 


P23188. 


OrthoDB 


EOG4ZW59M. 


PhylomeDB 


P23188. 


Enzyme and patliway databases 


BRENDA 

• 


3.4.21.76.244. 


Gene expression databases 




ArrayExpress 


P23188, 


Bgee 


P23188. 

.. 


CleanEx 


MM_FURIN. 


Genevestigator 


P231 88. 


GermOnline 


ENSMUSG00000030530. Mus musculus. 


Family and domain databases 


InterPro 
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[Graphical view] 


GeneSD 


G3DSA:3.40.50.200. Pept_S8_S53. 1 hit. 


PANTHER 


PTHR10795. SubtilSerProt. 1 hit. 


Pfam 


PF01483. P_proprotein. 1 hit. 
PF00082. Peptldase_S8. 1 hit. 
[Graphical view] 


PRINTS 


PR00723. SUBTILISIN. 


SMART 


SM00261. FU. 2 hits. 
[Graphical view] 


SUPFAM 


SSF49785. Gal_bindjike. 1 hit. 
SSF57184. Grow fac recept. 1 hit. 
SSF52743. Pept 88 S53. 1 hit. 
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PS00137. SUBTILASE HIS. 1 hit. 
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ProtoNet 
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[Graphical view] 
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Q28193 (FURIN_BOVIN)# Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 100. 



Names and origin 


Protein names 


i RQCOfnfnendQd namB' 
Furin 
EC=3.4.21.75 
Alternative name(s): 
! Dibasic-processing enzyme 
! Paired basic amino acid residue-cleaving enzyme 

1 Short nanie=PACE 

Trans Golgi network protease furin 


Gene nannes 


Name: FURIN 
; Synonyms:FUR. PACE 


Oraani^in 




1 axonomic loeniiTier 




Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Cetartiodactyla > Ruminantia > Pecora > Bovidae > Bovinae > 

DOS 


Protein attributes 


1 


Sequence length 


797 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Furin is likely to represent the ubiquitous endoprotease activity 
within constitutive secretory pathways and capable of cleavage 
at the RX(K/R)R consensus motif. 


Catalytic activity 


Release of mature proteins from their proproteins by cleavage 
of -Arg-Xaa-Yaa-Arg-|-Zaa- bonds, where Xaa can be any 
amino acid and Yaa is Arg or Lys. Releases albumin, 
complement component C3 and vWF from their respective 
precursors. 


Cofactor 


nalr.il jml By similarity 


Enzyme regulation 


Could be inhibited by the not secondly cleaved propeptide. 


Subunit structure 


Interacts with FLNA. Binds to PACS1 which mediates TGN 

Inrali7??tinn anH ronnf»rtinn tn niathrin aHaptprc; | By similarity | 


Subcellular location 

1 


Golgi apparatus > trans-Golgi network membrane; Single-pass 
type 1 membrane protein. Cell membrane; Single-pass type 1 
membrane protein. Note: Shuttles between the trans-Golgi 
network and the cell surface. Propeptide cleavage is a 
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j prerequisite for exit of furin molecules out of the endoplasmic 





rptirulum ^FR^ A ^^rond clea\/aae within the oroDeotide occurs 

in the trans Golgi network (TGN), followed by the release of the 
propeptide and the activation of furin. 


Tissue specificity 


Seems to be expressed ubiquitously. 


Domain 


Contains a cytoplasmic domain responsible for its TGN 
localization and recycling from the cell surface. 


Post-translational modification 


The inhibition peptide, which plays the role of an intramolecular 
chaperone, is autocatalytically removed in the endoplasmic 
reticulum (ER) and remains non-covalently bound to furin as a 
potent autoinhibitor. Following transport to the trans Golgi. a 
second cleavage within the inhibition propeptide results in 
propeptide dissociation and furin activation. 

Phosphorylation is required for TGN localization of the 
endoprotease. In vivo, exists as di-, mono- and non- 
phosphorylated forms 1 By similarity |. 


Sequence similarities 


Belongs to the peptidase S8 family. Furin subfamily. 
Contains 1 homo B/P domain. 

1 
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Ontologies 


Keywords 




Cellular component 


Cell membrane 
Golgi apparatus 
Membrane 


Domain 


Signal 

Transmembrane 
Transmembrane helix 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 

Cleavage on pair of basic residues 

Disulfide bond 

Glycoprotein 

Phosphoprotein 

Zymogen 


Gene Ontology (GO) 




Cellular conriDonent 


Intparsl tn mpmhran^ 

Inferred from electronic annotation. Source: UniProtKB-KW 
plasma membrane 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


metal ion binding 

Inferred from electronic annotation. Source: UniProtKB-KW 



Complete GO annotation... 



Sequence annotation (Features) 



Moi< 

iei 


Feature key 

3cule processing 

Signal peptide 
Propeptide 


Position(s) 

1-24 
25-107 


Length 
83 


Description 

1 Potential I 

Inhibition 
peptide 

1 By simiiarity | 


Graphical view 

1 

if 


Feature i( 
PRO_00( 


ie\ 

)0( 
)0( 

1 




Chain 


■■■■■ 1 ... -. . - . — 1..-.. 

108-797 


690 


Furin 


■ " "— ' ■■ ■"■ "■ ■' ' ' 

1 


PRO_00( 



Regions 





Transmembrane j 71 9 - 741 


23 


Helical; 








Potential | 
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□ 


Region 


762 ~ 765 


4 


Cell surface 
signal 


1 




— 


□ 


Motif 


498 - 500 


3 


Cell attachment 

ftif A 1 Potential I 


1 






□ 


Motif 


776 - 782 


7 


Trans Golgi 
network signal 


1 








Compositional bias 


556 - 708 


153 


Cys-rich 


1 1 






□ 


Compositional bias 


690 - 698 


9 


Poly-Pro 


1 







Sites 



Active site 



153 



Charge relay 
system 

By similarity I 



Active site 



194 



Charge relay 
system 

I By similarity | 



Active site 



368 



Charge relay 
system 

I By similarity 1 



Metal binding 



□ 



Metal binding 



115 



162 



Calcium 1 

I By similarity | 



Calcium 1 

{ By similarity | 



Metal binding 



Metal binding 



Metal binding 



Metal binding 



208 



258 



301 



331 



Calcium 1 

I By similarity | 



Calcium 2 

1 By simitarity I 



Calcium 2 

I By similarity | 



Calcium 2 



I By similarity 



Site 



75-76 



Cleavage, 
second; by 
autolysis 



By similarity 



Site 



107-108 



I 



Cleavage, first; 
by autolysis 

I By similarity I 



Amino acid modifications 



□ 


Modified residue 






Modified residue 

1 





776 


1 


Phosphoserine; 
by CK2 

1 By similarity | 


1 


778 


* 1 


Phosphoserine; 
by CK2 

1 By similarity I 


1 

i 
i 
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□ 


Glycosylation 


387 


1 


N-linked 
(GlcNAc.) 

1 Potential I 


1 




□ 


Glycosylation 


440 


1 


N-linked 

(GIClMAc.) 

1 Pctentiaf j 


1 


■ 


□ 


Glycosylation 


553 


1 


N-linked 
(GlcNAc.) 


1 




□ 


Disulfide bond 


211 ^360 




By similarity I 


1 1 




□ 


Disulfide bond 


303 ^ 333 




By similarity 


II 




□ 


Disulfide bond 


450 ^ 474 




By similarity 


II 





Sequences 



Sequence Length Mass (Da) 

g] Q28193 [UniParc]. FASTA 797 87.251 

Last modified November 1, 1997. Version 1. 
Checksum: 466F28EC0246C3D2 



i 12 20 30 40 50 eg_ 

\ MELRPWLFWV VAAAGALVLL VADARGEKVF TNTWAVHIPG GPAVADRVAR KHGFLNLGQI 

; 70 80 90 100 110 120 
FGDYYHFWHR AVTKRSIiSPH RLGHNRLQRE PQVKWLEQQV AKRRAKRDIY QEPTDPKFPQ 

t 

i 130 140 150 160 170 180 

i QWYLSGVTQR DLNVKEAWAQ GYTGRGIWS ILDDGIEKNH PDLAGNYDPG ASFDVNDQDP 



190 200 210 220 230 240 

DPQPRYTQMN DNRHGTRCAG EVAAVANNGV CGVGVAYNAR IGGVRMLDGE VTDAVEARSL 

j 250 260 272 280 290 300 

1 GliNPNHIHIY SASWGPEDDG KTVDGPAHIiA EEAFFRGVSQ GRGGLGSIFV WASGNGGREH 

i 

310 320 330 340 350 36£ 

j DSCNCDGYTN SIYTLSISSA TQFGNVPWYS EACSSTLATT YSSGNQNEKQ IVTTDLRQKC 

. 370 380 390 400 410 420 

\ TESHTGTSAF APLAAGIIAL TLEANKNLTW RDMQHLWRT SKPAHLNAND WATNGVGRKV 

i 432 442 450 462 470 482 

' SHSYGYGIiLD AGAMVALAQN WTTVAPQRKC TIDILTEPKD IGKRLEVRKT VTACLGEPSH 

I 

1 

; 492 502 512 522 532 542 

; ITRLEHAQAR LTLSYNRRGD liAIHLVSPMG TRSTLLAARP HDYSADGFND WAFMTTHSWD 

j 552 562 572 582 592 ^^9. 

\ EDPSGEWVLE lENTSEANNY GTLTKFTLVL YGTAPEGLPT PPESIGCKTL TSSQACWCE 

i 

612 620 630 640 650 662 

EGFSLHQKNC VQHCPPGFAP QVLDTHYSTE NDVEIIRASV CTPCHASCAT CQGPAPTDCL 



I 



672 682 702 712 '^^9. 

SCPSHASLDP VEQTCSRQSQ SSRESHQQQP PPPPRPPPAE VATEPRLRAD LLPSHLPEW 

732 ■'42 752 762 ''^2. 

AGLSCAFIVL VFVTVFLVLQ LRSGFSFRGV KVYTMDRGLI SYKGLPPEAW QEECPSDSEE 



7 92 

DEGRGERTAF IKOQSAIj 

I 

« Hide 
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G3DSA:3.40.50.200. Pept_S8_S53. 1 hit. 


PANTHER 


PTHR10795. SubtilSerProt. 1 hit. 
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; 

I 


PF01483. P^proprotein. 1 hit. 
PF00082. Peptidase_S8. 1 hit. 
[Graphical view] 


PRINTS 


PR00723. SUBTILISIN, 


SMART 

j 

SUPFAM 

i 
1 


SM00261. FU. 2hits. 
[Graphical view] 

SSF49785. GaLbindJike. 1 hit. 
SSF57184. Grow fac recept. 1 hit. 
SSF52743. Pept_S8_S53. 1 hit. 
SSF54897, ProtJnh_propept. 1 hit. 


PROSITE j 

i 


PS00136, 8UBTILA8E_A8P. 1 hit. 
P0OOI37. oUdTILAoc his. 1 nit. 
PS00138. SUBTILASE_SER. False negative. 
[Graphical view] 


ProtoNet | 


Search,.. 


Other Resources 




PMAP-CutDB 1 


Q28193. 
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Entry name 
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FURIN BOVIN 



! Primary (citable) accession number: Q28193 
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Entry history 
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UniProtKB/Swiss- 

Prot: 

Last sequence November 1, 1997 
update: 

Last modified: April 5, 201 1 

This IS version 100 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Relevant documents 




Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 
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P23377 (FURIN^RAT)* Reviewed. UniProtKB/Swiss-Prot 

Last modified March 8, 2011. Version 110. 



Names and origin 



Protein names 


Recommended name: 

r unn 

EC=3.4.21.75 
Alternative name(s): 

Dibasic-processing enzyme 

Paired basic amino acid residue-cleaving enzyme 

Short name=PACE 

Prohormone convertase 3 


Gene names 


; Synonyms: Fur, PcskS 


Organism 


h 

Rattus norvegicus (Rat) 


Taxonomic identifier 


I 10116 [NCBl] 


1 OaWI IVI I llw III iVCI^C 


U.Ur\Cll y V^ICI / IVIOlCl^wCI ' Vnfl iWl UCllCI ' V^l CllliCllCi ' V C7I vwMI CiiCI ' 

Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > 
Rattus 


Protein attributes 




Sequence length 


793 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 

form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 

i 


Furin is likely to represent the ubiquitous endoprotease activity 
within constitutive secretory pathways and capable of cleavage 

at the RX(K/R)R consensus motif. 


Catalytic activity 

1 

! 

i 


Release of mature proteins from their proproteins by cleavage 
of -Arg-Xaa-Yaa-Arg-|-Zaa- bonds, where Xaa can be any 
amino acid and Yaa is Arg or Lys. Releases albumin, 
complement component C3 and vWF from their respective 
precursors. 


Cofactor 


nalrJiiml By similarity 


Enzyme regulation 


Could be inhibited by the not secondly cleaved propeptide. 


Subunit structure 


Interants with Fl NA I By similarity j Rinds to PACSI which 

mediates TGN localization and connection to clathrin adapters. 

1 Ref2 1 


Subcellular location 


Golgi apparatus > trans-Golgi network membrane; Single-pass 
type 1 membrane protein. Cell membrane; Single-pass type 1 
membrane protein. Note: Shuttles between the trans-Golgi 
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network and the cell surface. Propeptide cleavage is a 
prerequisite for exit of furin molecules out of the endoplasmic 
reticulum (ER). A second cleavage within the propeptide occurs 
in the trans Golgi network (TGN). followed by the release of the 
propeptide and the activation of furin 1 By simiiantyj. 


Tissue specificity 


Seems to be expressed ubiquitously. 


Developmental stage 


Expressed at E7 day in endoderm and mesodemn, uniformly 
expressed until E10, when expression is higher in heart and 
liver primordia. In mid- and late-gestational stages, widely 
expressed. 


Domain 


Contains a cytoplasmic domain responsible for its TGN 
localization and recycling from the cell surface. 


Post-translational modification 


The inhibition peptide, which plays the role of an intramolecular 
chaperone, is autocatalytically removed in the endoplasmic 
reticulum (ER) and remains non-covalently bound to furin as a 
potent autoinhibltor. Following transport to the trans Golgi, a 
second cleavage within the inhibition propeptide results in 
propeptide dissociation and furin activation 1 similarity |. 

Phosphorylation is required for TGN localization of the 

anrlr^nrr^tcksafto In x/ivo ^vi^tQ rii« ITlonO- and non* 

phosphorylated fomris 1 By similarity |. 


oec|uence simiianiies 


DclunyS VO \r\tS pcpilUcloC OO ICIIIlliy. nuiiii ouiMriaiiiiiy> 

Contains 1 homo B/P domain. 


Ontologies 




Keywords 




Cellular component 

• 


Cell membrane 
Golgi apparatus 
Membrane 


Domain 


Signal 

Transmembrane 
Transmembrane helix 


Ligand 


Calcium 
Metal-binding 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Autocatalytic cleavage 

Cleavage on pair of basic residues 

Disulfide bond 

Glycoprotein 

Phosphoprotein 

Zymogen 



Gene Ontology (GO) 

Biological process 



! aging 

I Inferred from expression pattern. Source: RGD 
J positive regulation of cell migration 
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Inferred from mutant phenotype. Source: RGD 

positive regulation of transforming growth factor beta receptor 
signaling pathway 

Inferred from mutant phenotype. Source: RGD 



Cellular component 


Golgi cisterna 

Inferred from direct assay. Source: RGD 

endoplasmic reticulum membrane 
Inferred from direct assay. Source: RGD 

integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 
plasma membrane 

Inferred from direct assay. Source: RGD 


Molecular function 


metal ion binding 




Inferred from electronic annotation. Source: UniProtKB-KW 




serine-type endopeptidase activity 




Inferred from mutant phenotype. Source: RGD 


Complete GO annotation... 





Sequence annotation (Features) 



Feature key 

Molecule processing 

IS ! Signal peptide 
Propeptide 



I Position(s) ■ Length Description 



i Graphical view 



i Feature iclei 

I i 



1 -24 
25-107 



24 I I Potential | 



83 Inhibition 
peptide 

simiiarilyn 



I I 



-J. 



PRO OO0O( 



1^ 



Chain 



108-793 



686 



Furin 



PRO OOOOC 
— I 



Regions 

Transmembrane 



Region 



715-735 



758 - 761 



Motif 



Motif 



Compositional bias 



498 - 500 



21 



772 - 778 



Helical; 

Potential I 



Cell surface 
signal 



Cell attachme nt 

Ritg> I Potential I 



II 



4- 



Trans Golgi 
network signal 



556 - 705 



150 j Cys-rich 



I I 



Sites 

□ 1 Active site 



153 



1 j Charge relay 
system 

i By similarity I 



http ://www.uniprot.org/un iprot/P23 3 77 



4/12/2011 



Furin precursor - Rattus noi^|gicus (Rat) 




Page 4 of 8 



Active site 



194 



1 



Charge relay 
system 

{ By simitarity | 



Active site 



368 



1 



□ 



Metal binding 



Metal binding 



Charge relay 
system 

I By similarity 



115 



162 



1 



Calcium 1 

I By similarity | 



1 



Calcium 1 

By similarity | 



Metal binding 



208 



1 



Calcium 1 



I By similarity 



Metal binding 



258 



1 



Calcium 2 

I By simitarity I 



Metal binding 



301 



1 



Calcium 2 

I By similarity 



Metal binding 



331 



1 



Calcium 2 

I By similarity I 



I 



Site 



75-76 



Cleavage, 
second; by 
autolysis 

I By similarity j 



Site 



107-108 



Cleavage, first; 
by autolysis 

I By similarity | 



Amino acid nnodifications 



Modified residue 



772 



Phosphoserine; 
by CK2 

I By similarity I 



Modified residue 



774 



1 



Phosphoserine; 
by CK2 

I By similarity j 



□ 



Glycosylatlon 



387 



1 



N-linked 

(GlcNAc ) 

I Potentiall 



□ 



Glycosylatlon 



440 



1 



N-linked 
(GlcNAc .) 

I Potential I 



Glycosylatlon 



553 



1 



N-linked 
(GlcNAc .) 

I Potential ' 



□ 



Disulfide bond 



□ 



Disulfide bond 



, 211 ^ 360 I 

■ m.^ .«■. i». - 



I By similarity I 



I I 



I 303 ^ 333 • 



By similarity | 



— p. 



11 



□ 



Disulfide bond 



I 



450 — 474 



By similarity I 



II 
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Sequences 



Sequence Length Mass (Da) 

P23377 [UniParc]. FASTA 793 86.653 

Last modified November 1, 1991. Version 1. 
Checksum: 87C22C345AE0A25C 



10 20 30 40 50 60 

, MELRPWliLWV VAAAGALVIiL AAEARGQKIF TNTWAVHISG GPAVADSVAR KHGFHNLGQI 

70 8^ 90 10£ 110 120 

FGDYYHFWHR AVTKRSLSPH RPRHSRLQRV PQVKWLEQQV AKQRAKRDVY 'QEPTDPKFPQ 

130 140 150 160 170 180 

• QWYLSGVTQR DLNVKEAWAQ GFTGRGIWS ILDDGIEKNH PDIiAGNYDPG ASFDVNDQDP 

190 200 210 220 230 240 

' DPQPRYTQMN DNRHGTRCAG EVAAVANNGV CGVGVAYNAR IGGVRMLDGE VTDAVEARSL 

! 

j 25 0 26 0 270 280. 290 3 00 

j GLNPNHIHIY SASWGPEDDG KTVDGPARLA EEAFFRGVSQ GRGGLGSIFV WASGNGGREH 

j 310 320 330 340 350 360 

. DSCNCDGYTN SIYTLSISSA TQFGNVPWYS EACSSTliATT YSSGNQNEKQ IVTTDLRQKC 

[ 370 380 390 400 410 420 

TESHTGTSAS APLAAGIIAL TLEANKNLTW RDMQHLWQT SKPAHLNAND WATNGVGRKV 

I 

! 4 3£ 44£ 450 460 47 0 4 8£ 

' SHSYGYGLLD AGAMVAIiAQN WTTVAPQRKC IIEILAEPKD IGKRLEVRKT VTACLGEPNH 

490 500 510 52£ 530 540 

ISRLEHVQAR LTLSYNRRGD LAXHLXSPMG TRSTLIjAARP HDYS7UX3FND WAFMTTHSWD 

550 560 570 580 590 600 

j EDPSGEWVLE lENTSEANNY GTLTKFTLVL YGTASEGLSA PPESSGCKTL TSSQACWCE 

610 620 630 640 65£ 66£ 

EGFSLHQKSC VQRCPPGFTP QVLDTHYSTE NDVEIIRASV CTPCHASCAT CQGPAPTDCL 

670 680 690 700 710 720 

! SCPSHASIaDP veqtcsrqsq ssresrpqqp ppalrpevev eprlraglas hlpevlagls 

73£ 740 750 760 770 780 

CLIIALIFGI VFLFLHRCSG FSFRGVKVYT MDRGLISYKG LPPEAWQEEC PSDSEEDEGR 

7 90 

: GBRTAFIKDQ SAL 

* 

« Hide 
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Additional computationally mapped references. 



Cross-references 


Sequence databases 






@EMBL 
® GenBank 
® DDBJ 


X56660 mRNA. Translation: CAA39193.1. 




IPI 


IPI00210230. 


PIR 


KXRTF. 813106. 


RefSeq 


NP_062204,1. NM_01 9331.1. 


UnlGene 


Rn.3220. 




3D structure databases 






ProteinModelPortal 


P23377. 




SMR 


P23377. Positions 30-99, 108-574, 592-665. 


Mod Base 


Search... 




Protein-protein interaction databases 




STRING 


P23377. 




Protein family/group databases 






MEROPS 


808.071. 




Proteomic databases 






PRIDE 


P23377. 




Genome annotation databases 






EnsembI 


ENSRNOT0000001 5521 ; ENSRNOP0000001 5521 ; 
ENSRNOG0000001 1 352. 




GenelD 


54281. 


KEGG 


rno:54281. 


UCSC 


NM_019331. rat. 




Organism-specific databases 






CTD 


54281. 




RGD 


3274. Furin. 




Phylogenomic databases 






eggNOG | 


maNOG16185. 





GeneTree 



HOVERGEN 



t- 



ENSGT00600000084064. 
HBG008705, 
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InParanoid 


P23377. 


OrthoDB 


EOG42W59M. 


PhylomeDB 

Enzyme and pathway databases 
BRENDA 

Gene expression databases 

ArrayExpress 


P23377. 
1 3.4.21.75. 248. ♦ 
P23377. 


Genevestigator 


P23377. 


GermOnline 

Family and domain databases 
InterPro 


ENSRNOG00000011352. Rattus norvegicus. 

\ IPR006212. Furin repeat. 
IPR008979. Galactose-bd-like. 
IPR009030. Growth fac rcpt. 
IPR000209. Peptidase_S8/S53. 
IPR022398. Peptidase S8/S53 AS. 
IPR01 5500. Peptidase_S8_subtilisin-rel. 
IPR009020. ProtJnh_propept. 
IPR002884. PrprotnconvertsP. 
[Grapliical view] 


GeneSD 


G3DSA:3.40.50.200. Pept_S8_S53. 1 hit. 


PANTHER 


PTHR10795. SubtilSerProt. 1 hit. 


Pfam 


PF01483. P _j)roprotein. 1 hit. 
PF00082. Peptidase_S8. 1 hit. 
[Graphicai view] 


PRINTS 


PR00723. SUBTILISIN. 


SMART 


SM00261. FU. 2 hits. 
[Graphical view] 


SUPFAM 


SSF49785. Gal_bindjike, 1 hit. 
SSF57184. Grow_fac_recept. 1 hit. 
SSF52743. Pept_S8_S53. 1 hit. 
SSF54897. ProtJnh_propept. 1 hit. 


PROSITE 


PS00136. SUBTILASE ASP. 1 hit. 
PS00137. SUBTILASE HIS. 1 hit. 
PS00138. SUBTILASE„SER. 1 hit, 
[Graphical view] 


ProtoNet 

Winer rxvsources 

Entry information 


Search... 

610854 


Entry name 


FURIN.RAT 


Accession 


Primary (citable) accession number: P23377 
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Entry history 


Integrated into November 1, 1991 
' UniProtKB/Swiss- 
Prot: 

Last sequence Novemoer 1, 
update: 

Last modified: March 8. 201 1 

This is version 110 of the entry and version 1 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chprdata Protein Annotation Program 


Relevant documents 




Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 
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P29119(FURI1_XENLA)^k' Reviewed, UniProtKB/Swiss-Prot 

Last modified October 5, 2010, Version 81. 



Names and origin 


Protein names 


Recommended name: 

Furin-1 
EC=3.4.21.75 
Alternative name(s): 

Dibasic-processing enzyme 

Paired basic amino acid residue-cleaving enzyme 

Short nama=PAOF 


Gene names 


Name:furin 


Organism 


Xenopus iaevis (African clawed frog) 


Taxonomic identifier 


8355 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniate > Vertebrata > 
Euteleostomi > Amphibia > Batrachia > Anura > Mesobatrachia > 
Pipoidea > Pipidae > Xenopodinae > Xenopus > Xenopus 


Protein attributes 




Sequence length 


783 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed Into a mature 
form. 


Protein existence 


Evidence at transcript level. 


General annotation (Comments) 


Function 


Furin is likely to represent the ubiquitous endoprotease activity 
within constitutive secretory pathways and capable of cleavage 
at the RX(K/R)R consensus motif. 


Catalytic activity 


Release of mature proteins from their proproteins by cleavage 
of -Arg-Xaa-Yaa-Arg-|-Zaa- bonds, where Xaa can be any 
amino acid and Yaa is Arg or Lys. Releases albumin, 
complement component C3 and vWF from their respective 
precursors. 


Subcellular location 


Mf^mhran*:^- 5^inglp-paf;R m<=mhrAnP prntf^in Potential 


Tissue specificity 

' .> 1 III. — ..'i 


In all tissues analyzed. 


Sequence similarities 


Belongs to the peptidase S8 family. Furin subfamily. 


Ontologies 




Keywords 

Cellular component 


Membrane 
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Domain 


Signal 

Transmembrane 
Transmembrane helix 


Molecular function 


Hydrolase 
Protease 

Serine orotease 


PTM 


Cleavage on pair of basic residues 

Disulfide bond 

Glycoprotein 

Zymogen 


Gene Ontology (GO) 

Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 


integral to membrane 

Inferred from electronic annotation. Source: UniProtKB-KW 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key 



Molecule processing 



Position(s) Length Description Graphical view 



Feature identifier 
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' »■■■ ■» 
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24 
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Potential 
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Propeptide 
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I I Potential 
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301 
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151 
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Charge 

relay 

system 

1 By similarity I 
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Charge 

relay 

system 

1 By similarity 1 
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< 




Active site 
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Charge 
relay 


' i 
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system 

By similarity I 



Amino acid modifications 
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□ 



□ 



Glycosylation 
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Disulfide bond 



385 



551 



209 ^ 358 



Disulfide bond 



301 ^ 331 



1 
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N-linked 

( GlcNAc ) 

PotentiaTl 



N-linked 
(GlcNAc .) 

I Potential 
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By similarity 



I I 
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Sequences 



Sequence 



Length Mass (Da) 
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FASTA 



783 



86.444 



Last modified December 1, 1992. Version 1 
Checksum: 02476FD3247AF2C7 



10 



20 



30 



40 



50 



60 



MDLSPSLLLM LWTLLSVLVE EITGQKVYTN TWAAHISGGS AEADRLCKKY GFINHGLIFE 

70 8^ 90 100 110 120 

DHYHFSHRAV MKRSLTPKRT RQVLLKREPQ VHWLEQQVAK KRKKRDIYTD PTDPKFMQQW 



13 0- 

YLLDTNRHDL 



140 

HVKEAWEQGF 



150 

TGKGIWSIIi 



160 

DDGIEKNHPD 



170 

IjQANYDPAAS 



180^ 

YDVNDQDPDP 



1 19^ 200 21^ 220 230 240 

: QPRYTQIiNDN RHGTRCAGEV AAVANNGICG VGIAYNANIG GVRMLDGEVT DAVEARSLGL 

250 260 270 280 290 300 

i NPNHIHIYSA SWGPEDDGKT VDGPAKLAEE AFYRGVTQGR GGLGSIYVWA SGNGGREHDS 



310 

CNCDGYTNSI 



320 330 340 

YTLSISSTTQ MGNVPWYSEA CSSTLATTYS 



350 36£ 
SGNQNEKQIV TTDLRQKCTD 



370 380 390 400 410 420 

SHTGTSASAP LAAGIIALAL EANKNLTWRD MQHLWQTSN PAGLNANDWI TNGVGRKVSH 



43 0 

SYGYGLLDAG 
490 

TLEHVQARLS 

550 

PAGEWVLEIE 



440 

AMVAMAKTWV 
500 

LSYNCRGDLA 

560 

NVSNNNNYGT 



450 

TVGPQRKYVI 



460 

DIIiSEPKDIG 



510 520 
lYLTSPMGTR SCLLAPRPHD 



610 620 
YFIiHQKSCIK SCPQGFTSSI 



670 

PAHSHYNIiLD 



680 

YSCTHQTQRS 



570 

LTQFVLVIiYG 
630 

QNIHYTLDNN 
690 

RESPTLKDSS 



580 

TASEGIiSRKF 
640 

lEPLLVNVCV 
700^ 

HDYVARTSNL 



470 

KALEVRRKVE 
530 

YSADGFNDWS 

590 

DGDGSRNVAS 



480 

PCAGMSNYIS 
540 

FMTTHSWDED 

600 

SQSCIVCEEG 



650 660 
PCHVSCATCK GTTINDCLTC 



710 

PFIVAILSCL 



720 

FIIWFGSIF 



730 740 750 760 770 780^ 

LFIiQLRSGGV LGRKRLYMLD SGIISYKGIP SGAWQEEGFS ESETEETAAH SERTAFLKQQ 



STL 

« Hide 



http://www.uniprot.org/uniprot/P291 19 



4/12/201 



Furin-l precursor - Xenopu^^evis (African clawed frog) 




Page 4 of 5 



References 


[1] 1 "Prohormone processing in Xenopus oocytes: characterization of cleavage signals and 
cleavage enzymes." 

Korner J., Chun J., O'Bryan L.. Axel R. 
i Proc. Natl. Acad. Sci. U.S.A. 88:11393-11397(1991) [PubMed: 1722329] [Abstract] 
i Cited for: NUCLEOTIDE SEQUENCE FMRNAl. 


C ros s -ref e re n ces 




Sequence databases 




@EMBL 
# GenBank 
® DDBJ 


M80471 mRNA, Translation: AAA497 17.1. 


PIR 

W mm ^ 


i A41627 
B41627. 


\Jl llV7d its 


AA. i OO. 


3D structure databases 






\ P9Q1 1Q 


SMR 


P29119. Positions 28-97, 108-572. 590-665. 


Mod Base 


Search... 


Protein family/group databases 




MEROPS 


S08.071. 


Phylogenomic databases 




HOVERGEN 


HBG008705. 


Enzyme and pathway databases 




BRENDA 


3.4.21.75. 648. 


Family and domain databases 




InterPro 

1 


IPR006212. Furln_repeat. 
IPR008979. Galactose-bd-like. 
IPR009030. Growth fac rcpt. 
IPR000209. Peptidase S8/S53. 
IPR022398. Peptidase S8/S63 AS. 
IPR01 5500. Peptidase_S8_subtilisin-rel. 
IPR009020. Protjnh_propept. 
IPR002884. PrprotnconvertsP. 
[Graphical view] 


Gene3D { 


G3DSA:3.40.50,200. Pept_S8_S53. 1 hit. 


PANTHER ! 


PTHR10795. SubtilSerProt. 1 hit. 


Pfam 


PF01483. P ^proprotein. 1 hit. 
PF00082. Peptidase_S8. 1 hit. 
[Graphical viev^ 



http://www,uniprot.org/uniprot/P291 19 



4/12/2011 



Furin-l precursor - Xenopu^^vis (African clawed frog) 




Page 5 of 5 



PRINTS 


1 PR00723. SUBTILISIN. 


SMART 


SM00261. FU. 2 hits. 
[Graphical view] 


SUPFAM 


SSF49785. Gal_bindjike. 1 hit. 
SSF57184. Grow_fac_recept. 1 hit. 
SSF52743. Pept S8 S53. 1 hit. 
SSF54897. ProtJnh_propept. 1 hit 


PROSITE 


PS00136. SUBTILASE ASP. 1 hit. 
PS00137. SUBTILASE HIS. 1 hit, 
PS00138. SUBTILASE_SER. 1 hit. 
[Graphical view] 


ProtoNet 


Search... 


Entry information 





Entry name 
Accession 
Entry history 



FURI1_XENLA 

Primary (citable) accession number: P29119 





UniProtKB/Swiss- 




Prot; 




Last sequence December 1 . 1992 




update: 




Last modified: October 5. 2010 




This is version 81 of the entry and version 1 of the sequence. 




[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 



Relevant documents 



Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



© 2002-201 1 UniProt Consortium | License & Disclaimer | Contact 



EMBL-EBI 




http://www.uniprot.org/uniprot/P291 19 



4/12/2011 



Cruzipain precursor - Trypafi^oma cruzi 



Page 1 of 8 



P26779 (CYSP_TRYCR)'<r Reviewed. UniProtKB/Swiss-Prot 

Last modified January 1 1 , 201 1 . Version 91 . 

f " ■ I ,.i - ., 



Names and origin 



Protein names 

• 


Recommended name: 

Cruzipain 
EC=3.4.22.51 
Alternative name(s): 

Cruzaine 

Major cysteine proteinase 


Organism \ Trypanosoma cruzi 


Taxonomic identifier j 5693 [NCBI] 


Taxonomic lineage ! Eukaryota > Euglenozoa > Kinetoplastida > Trypanosomatidae > • 

j Trypanosoma > Schizotrypanum 

Protein attributes 


Sequence length 


467 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Function 

. 1 


Hydrolyzes chromogenic peptides at the carboxyl Arg or Lys; 
requires at least one more amino acid, preferably Arg. Phe, Val 
or Leu, between the terminal Arg or Lys and the amino-blocking 
group. 

The cysteine protease may play an important role in the 
development and differentiation of the parasites at several 
stages of their life cycle. 


Catalytic activity 


Broad endopeptidase specificity similar to that of cathepsin L. 


Enzyme regulation 


Strongly inhibited by E-64 (L-trans-epoxysuccinylleucylamido(4 
-guanidino)butane), Leupeptin, and N-alpha-p-tosyl-L-lysine 
chloromethyl ketone. 


Developmental stage 
Miscellaneous 

- .... _ . . 

Sequence similarities 

Ontologies 


Present in all developmental stages. 

Purified cruzipain is able to degrade itself, yielding a complex 
mixture of small peptides, and a major 25 kDa fragment. 

Belongs to the peptidase C1 family. 



Keywords 

Domain \ Signal 



http://www.uniprot.org/uniprot/P25779 4/12/201 1 



Cruzipain precursor - Trypaj^soma cruzi 



Page 2 of 8 



Molecular function 


Hydrolase 
Protease 
Thiol protease 


PTM 


Autocatalytic cleavage 
Disulfide bond 
Glycoprotein 

£j)f\ i Iwycfi 1 


Technical terni 


3D-structure 

Direct protein sequencing 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Molecular function 


cysteine-type endopeptidase activity 

Inferred from electronic annotation. Source: InterPro 



Complete GO annotation... 



Sequence annotation (Features) 



Mol< 


! 
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scule processing 

Signal peptide 


Position(s) 
. 1-18 


Length 
18 


Description 

Probable 


Graphical view 
1 


Feature identifi 
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37 
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104 


Activation 
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J Probable | 


1 1 


PRO_000002Q 


IB 

Site! 

Q 


Chain 
Active site 
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147 


345 
1 


Cruzipain 


1 
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PRO_0000026 
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Ami 
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1 
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1 
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Disulfide bond 


144<-^ 185 
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□ 
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1 1 







Natural variations 
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Natural variant 
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Experimental info 



Sequence conflict 
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Details... 
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P25779 [UniParc]. FASTA 467 49,836 

Last modified May 1 , 1992. Version 1. 
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Prot: 

Last sequence May 1 , 1 992 
update: 

Last modified: January 1 1 , 201 1 

This is version 91 of the entry and version 1 of the sequence. 
[Complete history] 



Entry status Reviewed (UniProtKB/Swiss-Prot) 



Relevant documents 



PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 

Peptidase families 

Classification of peptidase families and list of entries 

SIMILARITY comments 

Index of protein domains and families 



©2002-2011 UniProt Consortium | License & Disclaimer | Contact 
http://www.uniprot.org/uniprot/P25779 
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EMBL-EBlilin 




SIB 



http://www.uniprot.org/uniprot/P25779 
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Protein 

Translations of Life 



Display Settings: GenPept 

cysteine proteinase [Trypanosoma cruzl] 



GenBanlcAAA3Q269.1 
PASTA Gfaohtes 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
COMMENT 
FEATURES 

source 



Protein 
mat peptide 
Region 



AAA30269 249 aa linear INV 26-APR-1993 

cysteine proteinase [Trypanosoma cruzi] . 

AAA30269 

AAA30269.1 GI:162334 

locus TRBTUL2TAN accession M69121 . 1 

Trypanosoma cruzi 

Trypanosoma cruzi 

Eukaryota; Euglenozoa; Kinetoplastida; Trypanosoma tidae; 
Trypanosoma; Schizotrypanum. 

1 (residues 1 to 249) 

Ccunpetella,0. , Henriksson, J. , A8lund,L.« Frasch,A.C.C. , 
Pettersson, J. and Cazzulo^J.J. 

The major cysteine proteinase (cruzipain) from Trypanosoma cruzi is 
encoded by genes organized in tandems located in different 
chromosomes 
Unpublished 

Method: conceptual translation. 
Location/Qualifiers 
1. .249 

/organisms "Trypanosoma cruzi" 
/strain=''TUL 2" 
/ db_xre f = " taxon ; 5693 " 
1. .249 



Site 



Site 



Region 



CDS 



ORIGIN 



/products "cysteine proteinase" 
<1. .249 

/product- "cysteine proteinase" 
<5. .116 

/ regi on_name = " Pept idase_ClA " 

/note=" Peptidase CIA subfamily (MEROPS database 
nomenclature); composed of cysteine peptidases (CPs) 
similar to papain, including the mammalian CPs (cathepsins 
B. C, F, H, L, K. O, S, V, X and W) . Papain is an 
endopeptidase with specific substrate preferences...; 

Cd02248" 

/db_xre f = " CDD : 30292 " 
ojrder (42, 64 . 67, 112) 
/si te__t ype = " other " 
/note="S2 subsite" 
/db xref»"CDD; 30292 " 
order (66, 86) 
/si te_type= "active " 
/db_xref="CDD; 3 0292 " 
<142. .199 

/ re g ion_name = " DUF3 586" 

/notes" Protein of unknown function (DUF3586) ; pfaml2131" 
/ db_xre f CDD : 152566 " 
1. .249 

/coded_by-"M69121. 1:<1. .750" 



// 



1 isppcttsgh tvgatitghv elpqdeaqia acvavngpva vavdasswmt ytggvmtscv 

61 seqldhgvll vgyndsaavp ywiiknswta qwgedgyiri akgsnqclvk eeassawgg 

121 pgptpepttt tttsapgpsp syfvqmsctd aacivgcenv tlptgqcllt tsgvsaivtc 

181 gaetlteevf ftsthcsgps vrssvplnqc nwllrgsvef fcgssssgrl advdrqrryq 
241 pyqsrhrrl 



http://www.ncbi.nlm.nih.gov/protein/AAA30269 
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Protein 

Translations of Ufe 
Display Settings: GenPept 

cysteine proteinase [Trypanosoma cruzi] 

GenBank: CAA38278.1 
PASTA S£gs!l!£S 



IjOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



CAA33278 
cysteine 
CAA38278 
CAA38278.1 GI 
embl accession 



proteinase 



173 aa 
[Trypanosoma cruzi) 



linear INV 19-APR-1991 



10606 
X54414 



REFERENCE 
AXTTHORS 

TITLE 

JOURNAL 
PUBMBD 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



// 



Trypauaosoma cruzi 

Trypanosoma cruzi 

Bukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; 
Trypanosoma ; Schi zotrypanum . 

1 

A8lund«L., Henriksson, J. , Caiapetella.O. , Frasch,A.C., Petters8on,U. 
and Cazzulo,J.J. 

The C- terminal extension o£ the major cysteine proteinase 
(cruzipain) from Trypanosoma cruzi 
Mol. Biochem. Parasitol. 45 (2), 345-347 (1991) 
2038364 

2 (residues 1 to 173) 
Aaslund, L. 

Direct Submission , 

Submitted { 13-AUG- 1990) Dept. Medical Genetics, Biomedial Center, 
Box 589. S- 7512 3 Uppsala, Sweden 
Location/Qualifiers 
1. .173 

/organisms "Trypanosoma cruzi" 

/ db__xr e f = " t axon : 5693 " 
Protein <l.Tl73 

/product* "cysteine proteinase" 
Region <3 . .40 

/ r eg i on_name = " Pep t i das e^C 1 " 

/notea"Cl Peptidase family (MEROPS database nomenclature), 
also referred to as the papain family,- composed of two 
subfamilies of cysteine peptidases (CPs) , CIA (papain) and 
CIB (bleomycin hydrolase^ . Papain-like enzymes are mostly 
endopept Idases with some...,- cl00298* 
/ db_xre f =" CDD : 153664 " 
Region <66..123 

/region_name = "DUF3 586 " 

/note=" Protein of unknown function (DUF3586); pfaml2131" 
/db xref »"CDD: 1S2566 " 
CD?? 1..173 

/coded_by="X54414 . 1:<1. .522" 

/ db_xre f » "GOA : P25779 " 
/db 

/db~ 
/db' 

/db^ 

/db xref ="InterPro; lPR021981 " 
/ db_xre f = " PDB : lAIM " 
/ db_xr e f = " PDB : iewl " 
/db_xre f = " PDB : lEWM " 
/db~xre f « " PDB : lEWO " 
/db 
/db" 
/c3b' 
/db 
/db" 

/dbl 

/ db_xr e f = " PDB : 1ME4 " 
/db_xref ="PDB; 1U9Q " 
/ db_xr e f = " PDB : 2AIM " 
/ db_xr e f = " PDB : 2EFM " 
/ db_x r e f = " PDB : 20Z2 " 
/db 

/db_ 

/db_xref="PDB- 3IUT " 
/db_xref ="PDB; 3KKU " 

/db_xref-"UniProtKB/Sviss-Prot ; P25779 " 

1 aavpywiikn swtaqwgedg yiriakgsnq clvkeeassa wggpgptpe pttttttsap 
61 gpspsyfvqm sctdaacivg cenvtlptgq cllttsgvsa ivtcgaetlt eevfftsthc 
121 sgpsvrssvp Inqcnrllrg sveffcgsss sgrladvdrq rryqpyhsrh rrl 



xre f = " Inter Pro : IPR000169 " 
'xre f ■= " In terPro : IPR000668 " 

xre f = " InterPro : IPR01312B " 
"xref ="InterPro: IPR013201" 



xref g"PDB; lEWP " 
'xref »" PDB rlF29" 



_xre f = " PDB : iF2 A " 

xref =" PDB I 

xref' 
'xref« 



1F2B " 
="PDB: 1F2C " 
."PDBi 



1ME3< 



xref =" PDB: 3KD3" 
xref-"PDB:3I06" 



http://www.ncbi.nIm.nih.gov/protein/CAA38278 
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Protein 

Transiations of Life 
Display Settings: GenPept 



cysteine proteinase [T>7Panosoma cruzi] 



GenBank: AAA30270.1 
FA^TA Gfaphtes 



QSJQl 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 

COMMENT 
FEATURES 

source 



Protein 
sig peptide 
Region 



Region 



mat peptide 



Site 



CDS 



ORIGIN 



AAA30270 218 aa linear INV 26-APR-1993. 

cysteine proteinase [Trypanosoma cruzi] . 

AAA30270 

AAA30270.1 GI:162335 

locus TRBTUL2TAN accession M69121 . 1 

Trypanosoma cruzi 

Trypanosotwa cruzi 

Eukaryota; Euglenozoa; Kinetoplastida; Trypanosoma tidae; 
Trypanosoma; Schizotrypemum. 
1 (residues 1 to 218) 

Campetella«0. , Henriksson, J. , Aslund«L.« Frasch;A.C.C. , 
Pettersson, J. and Cazzulo, J. J. 

The major cysteine proteinase (cruzipain) from Trypanosoma cruzi is 
encoded by genes organized in tandems located in different 
chromosomes 
Unpublished 

Method: conceptual translation. 
Location/Qualifiers 
1. .218 

/organi8m= "Trypanosoma cruzi" 
/strain="TUL 2" 
/db_xre f = " taxon ; 5653 " 
1. .218 

/products "cysteine proteinase" 
1. . 18 

/note* "pre -enzyme domain" 

38. . 93 

/ r e g i onname = " I nh i b i t or I 2 9 " 

/note="Cathepsin propeptide inhibitor domain (129); 
C107031" 

/db xref="CDD; 157580 " 
124. .>218 

/ r e g i on_name >»"Peptidas e_C 1 A " 

/note= "Peptidase CIA subfamily {meropS database 
nomenclature) ; composed of cysteine peptidases (CPs) 
similar to papain, including the mammalian CPs (cathepsins 
B, C, F, H, L, K, O, S, V, X and W) . Papain is an 
endopeptidase with specific substrate preferences...; 
cd02248" 

/ db_xre f =" CDD : 30292 " 
125 . . >218 

/products "cysteine proteinase" 
/notes "pre -enzyme domain" 
order (141, 14 7) 
/si te_type= "active" 
/db xref="CDD; 30292 " 
1. .218 

/coded_by="M6 9 12 1.1:1192. .>1845" 



// 



1 msgwaralsl aavlwmacl vpaataslha eetlasqfve fkqkhgrvye saaeerfrls 
61 vfrenlflar lhaaanphat fgvtpfsdlt reefrsr^m gaahfaaaqe rarvpvnvev 
121 vgapaavdwr argavtavkd qgqcgscwaf saignvecqw flaghpltnl seqmlvscdk 
181 tdsgcggglm nnafewivqe nngavyteds ypyasgeg 



http://www.ncbi.nlm.nih.gov/protein/AAA30270 
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Protein 

Translations of Life 



Display Settings: GenPept 

cysteine protease [Trypanosoma cruzi] 

GenBank: AAA30180.1 
FASTA Graphics 



Goto: 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



AAA30180 165 aa 

cysteine protease [Trypanosoma cruzi] 
AAA30180 

AAA30180.1 GI: 162044 

locus TRBCYPAA accession M27305.1 



linear INV 26 -APR- 1993 



Method: 



Protein 



Region 



Site 



Site 



CDS 



ORIGIN 



Trypanosoma cruzi 

Trypanosoma cruzi 

Eukaryota; Buglenozoa; Kinetoplastida; Trypanosoma t i dae ; 
Trypanosoma; Schizotrypanum. 

1 (residues 1 to 165) 
Bakin,A.E. 
Unpublished 

2 (Bites) 

Eakin,A.E., Bouvier,J., Sakanari, J. A. , Craik.C.S. amd McKerrow, J.H. 
Amplification and sequencing of genomic DNA fragments encoding 
cysteine proteases from protozoan parasites 
Mol. Biochem. Parasitol. 39 (1) « 1-8 (1990) 

2406590 

[2] sites; for [1] . 

Draft entry and computer- readable sequence for [1] kindly submitted 
by A.B.Eakin, 24-AUG-1989. 

conceptual translation. 
Location/Qualifiers 
1. . 165 

/organism** "Trypanosoma cruzi" 
/db_xre f = " t axon : S693 » 
1. . 165 

/name = "cysteine protease" 
1. .165 

/ region_name ■ " Pept idase_C 1 A ■ 

/note=" Peptidase ClA subfamily (MEROPS dateibase 
nomenclature} ; composed of cysteine peptidases (CPs) 
similar to papain, including the mammalian CPs (cathepsins 

C, F, H, L, K, O, S, V, X and W) . Papain is an 
endopeptidase with specific substrate preferences...; 
cd02248" 

/ db_xre f =" CDD : 30292 - 
order (1,7, 143, 163) 
/ s i t e_t ype = " ac t i ve " 
/db xrefg'CDD; 30292 " 
order (4 9. . 50, 120, 141, 144) 
/ si te_t ype = " o t he r " 
/notes "S2 subsite" 
/ db_xre f CDD : 30292 ■ 
1 . . 165 

/coded_by=''M27305 . 1 :<1 . . >495" 



// 



1 qgqcgscwaf saignvsgqw flaghpltnl seqmlvscdk tdsgcsgglm nnafewivqe 
61 nnggvyteds ypyasgegis ppcttsghtv gatitghvel pqdeaqiaaw lavngpvava 
121 hasswmtytg gvmtscvseq Idhglllvgy ndsaavpywi iknsw 



http://www.ncbi.nlm.nih.gov/protein/AAA30180 
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P00749 (UROK^HUMAN)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 158. 



Names and origin 


rrotein names 


HeCOmm&nu&u HSniQ. 

Urokinase-type plasminogen activator 

Short name=U-plasminogen activator 
Short namp=iiPA 

EC=3.4.21.73 

Cleaved into the followina 3 chains: 

1 . Urokinase-type plasminogen activator long chain 
A 

2. Urokinase-type plasminogen activator short chain 

3. Urokinase-type plasminogen activator chain B 


Gene names 


NameiPLAU 


Organism 


Homo sapiens (Human) [Complete proteome] 


Taxonomic identifier 


9606 [NCBI] 


Taxonomic lineage 

Protein attributes 


Eukaryota > Metazoa > Chordata > Craniate > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Hominidae > Homo 


Sequence length | 431 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 

General annotation (Commei 


Evidence at protein level, 
nts) 


Function 


Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 


Catalytic activity 


Specific cleavage of Arg-|-Val bond in plasminogen to form 
plasmin. 


Subunit structure 


Found in high and low molecular mass forms. Each consists of 
two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalization and degradation. 

InterartR with MRHP Int^^rarfc: with PI Al JR 1 Ret. 18 11 Ref.19 I 


1 ■ 

Subcellular location j Secreted. 


Tissue specificity j Expressed in the prostate gland and prostate cancers. ^Q^-^Q 1 


Post-translational modification 


Phosphorylation of Ser-158 and Ser-323 abolishes proadhesive 
ability but does not interfere with receptor binding. 



http://www.uniprot.org/uniprot/P00749 
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Pharmaceutical use 



Sequence similarities 



Available under the name Abbokinase (Abbott). Used in 
Pulmonary Embolism (PE) to initiates fibrinolysis. Clinically 
used for therapy of thrombolytic disorders. 

Belongs to the peptidase S1 family. 

Contains 1 EGF-like domain. 

Contains 1 kringle domain. 

Contains 1 peptidase S1 domain. 



Ontologies 



Keywords 

Biological process 



Blood coagulation 
Fibrinolysis 



Gene Ontology (GO) 

Biological process 





1 Plasminogen activation 


Cellular component 


1 Secreted 


Coding sequence diversity 


Alternative splicing 
Polymorphism 


Domain 


EGF-like domain 

Kringle 

Signal 


Molecular function 


Hydrolase 

Protease 
Serine protease 


PTM 


Disulfide bond 
Glycoprotein 
Phosphoprotein 
Zymogen 


Technical temri 


3D-structure 
Complete proteome 
Direct protein sequencing 
Pharmaceutical 



I blood coagulation 

I Traceable author statement. Source: Reactome 
j chemotaxis 

j Traceable author statement. Source; Protinc 
; fibrinolysis 

i Traceable author statement. Source: Reactome 
proteolysis 

Traceable author statement. Source: Protinc 

regulation of cell adhesion mediated by integrin 
Inferred from direct assay. Source: BHF-UCL 

regulation of receptor activity 
Inferred from direct assay. Source: BHF-UCL 

regulation of smooth muscle cell migration 
! Inferred from direct assay. Source: BHF-UCL 

regulation of smooth muscle cell-matrix adhesion 



http://w\vw.uniprot.org/uniprot/P00749 4/1 2/201 1 
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Cellular component 



Inferred from direct assay. Source: BHF-UCL 
signal transduction 

Traceable author statement. Source: Protinc 



Molecular function 



cell surface 

Inferred from direct assay. Source: BHF-UCL 
extracellular space 

Inferred from direct assay. Source: BHF-UCL 
plasma membrane 

Inferred from experiment. Source: Reactome 



serlne-type endopeptidase activity 

Traceable author statement. Source: Reactome 



Complete GO annotation. 



Alternative products 



This entry describes 2 isoforms produced by alternative splicmg. [Align] [Select] 



isof orm 1 (identifier: P00749-1 ) 

This isoform has been chosen as the 'canonical' sequence. All positional infonmation in this entry 
refers to it This is also the sequence that appears in the downloadable versions of the entry. 

Isoform 2 (identifier: P00749-2) 

The sequence of this isoform differs from the canonical sequence as follows: 
1-29: MRALLARLLLCVLWSDSKGSNELHQVPS MVFHLRTRYEQA 



Sequence annotation (Features) 



Feature key 

Molecule processing 
Signal peptide 



Position(s) 



Length Description 



Graphicai view 



Fe^tu 




1 -20 



21 -431 



21 - 177 



20 I Ref.12 II Ref.14 | 



411 I Urokinase-type 

I plasminogen 
activator 



PRO 



i 



• Chain 



156-177 





! 

1 






Chain 


179- 



431 



157 I Urokinase-type 
plasminogen 
{ activator long chain 

? A 

22 I Urokinase-type 
plasminogen 
activator short 
chain A 



PRO 



I 



II 



PRO 



253 I Urokinase-type 
plasminogen 
activator chain B 



t-- 



PRO 



http://www.uniprot.org/uniprot/P00749 
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Regions 



H 


Domain 


27-63 


37 


EGF-like \ 1 1 








lJkjI T Idil 1 


70—1*51 


82 


Kringie \ 1 1 






M 


Domain 


179-424 


246 


Peptidase SI | 1 1 






m 


Region 

i 

< 


34-57 


24 

i 

i 


Binds urokinase 1 
plasminogen | 
activator surface 
receptor ^ 

I Bv simiiaritv i 


II 

1 






Site; 

□ 


Region ! 
Active site 


152-177 
224 


26 
1 


Connecting peptide ! 

Charge relay 
system 


II 

1 






□ 


Active site 


275 


1 


Charge relay 
system 


1 






"in" 


Active site 


376 


1 


Charge relay 
system 


1 






H 

Ami 

□ 


Site 

no acid modifications 
Modified residue 


177-178 
158 

L„ . . ... 


2 
1 


Cleavage; during 

zymogen activation 

Phosphoserine 

1 Ref.17 1 




1 

1 








Modified residue 


323 


1 


Phosphoserine 1 

1 ReM7 1 ; 


1 


□ 


Glycosylation 


38 


1 


O-linked (Fuc) \ I 

i Ref.12 1 








1 nation 

o 1 y c^uo y 1 diiui 1 


322 


1 


N-linked j 1 

(GlcNAc.) ! 


CA 




p''"'| 


Disulfide bond 


31 ^ 39 




- ri 






□ 


Disulfide bond 


33<^51 




i II 

1 ... ^ 




— 


□ 


Disulfide bond 


53 * — * 62 




; 1 








Disulfide bond 


70^ 151 






1 1 1 






Disulfide bond 


— — 

91 ^ 133 






1 1 




— 


□ 


Disulfide bond 


122^ 146 






II 




□ 


Disulfide bond 


168^299 




Interchain 
(between A and B 
chains) 


1 1 






□ 


Disulfide bond ' 209 ^ 225 


! 


II ! 




□ 


Disulfide bond 217^288 j ; II 


t 




□ 


Disulfide bond ' 31 3 <^ 382 ! 

> 1 




1 1 






□ i Disulfide bond \ 345 361 : 

1 1 ... _. ....1 . 


1 







http://www.uniprot.org/uniprot/P00749 
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Disulfide bond 



Natural variations 



372 400 



Alternative sequence 



I 



1 -29 



□ 



□ 



□ 



Natural variant 



Natural variant 



Natural variant 



Natural variant 



Experimental info 



Mutagenesis 




Mutagenesis 



□ 



Sequence conflict 



Sequence conflict 



Sequence conflict 



I 



15 



141 



214 



231 



158 



323 



150 



□ Sequence conflict 



151 



386 i 



430 i 



29 ! MRALL...HQVPS 

MVFHLRTRYEQA 
in isoform 2. 



1 [v-^L 
i [dbSN P:rs2227580] 



1 



1 



1 



[ dbSNP:rs222 7564] 

Ref.7 1 1 Ref.8 { 



Ref.11 Ref.27 



I Ref.29 1 1 Ref.30 | 



M 



[dbSN P:rs2227567] 



1 I S — > E: Abolishes 
phosphorylation, 
proadhesive 
function and ability 
to induce 
chemotactic 
response; when 
associated with E- 



1 



39!^ I Ref.17 I 



S -> E: Abolishes 
j phosphorylation, 
proadhesive 
function and ability 
to induce 
chemotactic 
response; when 
asso ciated w ith E- 

1fiR I Ref.17 I 



1 



D G in 
BAG60754. i ^e^-^ 



1 C -> W in 
1 CAA26535. 

1 G C in 

CAA26535. fSifTI 



1 j A V in 



I 



CAA26535. i 2 



http://www.uniprot.org/uniprot/P00749 
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Secondary structure 

1 . 

I Helix I Strand | Turn 

Details... 

Sequences 



Sequence Length Mass (Da) 



Isoform 1 [UniParc]. FASTA 431 48.507 

Last modified July 28, 2009. Version 2. 
Checksum: 62C72400BC231 1 5F 



j 1£ 20 30 4£ 50 60 
MRALIARLLL CVLWSDSKG SNEbHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 

70 80 90 100 110 12 0 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 

i 
i 

} 130 140 150 160 170 180 

j YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKII 

i 19^ 200 210 220 230 240^ 

I GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYIjG 



RSRliNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 

310 320 330 340 350 3G0 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKIil SHRECQQPHY YGSEVTTKML 

210 380 390 4O0I 410 420^ 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 



t 

i 430 
SHTKEENGLA L 



« Hide 
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tiniry nisiory 


Integrated into July 21, 1986 
UniProtKB/SwIss- 

Prot: 

Last sequence July 28, 2009 
update: 

Last modified: April 5. 201 1 

This is version 158 of tlie entry and version 2 of the sequence. 
[Complete history] 


Entry status 


Reviewed (UniProtKB/Swiss-Prot) 


Annotation program 


Chordata Protein Annotation Program 


Disclaimer 


Any medical or genetic information present in this entry is 
provided for research, educational and informational purposes 
only. It is not in any way intended to be used as a substitute for 
professional medical advice, diagnosis, treatment or care. 


Relevant documents 





Human chromosome 10 

Human chromosome 10: entries, gene names and cross-references to MIM 



Human entries with polymorphisms or disease mutations 

List of human entries with polymorphisms or disease mutations 



Human polymorphisms and disease mutations 

Index of human polymorphisms and disease mutations 

MIM cross-references 

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot 



PDB cross-references 

Index of Protein Data Bank (PDB) cross-references 
Peptidase families 

Classification of peptidase families and list of entries 



SIMILARITY comments 

Index of protein domains and families 



© 2002-2011 UniProt Consortium | License & Disclaimer | Contact 
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Protein 

Translations of Life 



Display Settings: GenPept 

pro-urokinase [Homo sapiens] 

GenBanlc AAA61 253.1 



Goto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AAA61253 431 aa 

pro-urokinase IHotno sapiens] . 
AAA612S3 

AAA61253.1 GI: 340160 

locus HUMUKPM accession M15476.: 



linear PRI 14 -JAN- 19 95 



Protein 



si g _ peptide 



mat peptide 



Region 



Site 



Site 



Region 



Site 



Site 



Sice 



CDS 



ORIGIN 



Homo sapiens (human) 

Homo sapienB 

Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Buteleostomi ; 
Mammalia; Eutheria; Buarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues l to 431) 

Holmes«w.E., Pennica,D., Blaber,M., Rey,M.w., Guenzler,w .A. , 
Stef fens, G.J. and Heyneker , H . L . 

Cloning and expression o£ the gene for pro-urokinase in Escherichia 
coli 

Biotechnology (N.Y.) 3, 923-929 (1985) 

Clean copy of sequence [Bio/Technology 3, 923-929 (1985)] kindly 
provided by G.Vehar 
(30-MAY-1986) . 

Method: conceptual translation. 
Loca t ion/Qual i f ier s 
1. .431 

/ organ ism* "Homo sapiens" 
/db_xre f = " taxon : 9606 " 
/mapa " I0q24 -qter " 
1. .431 

/ name = "pro-urokinase " 

1 . . 20 

/gene="PLAU" 

/note" "urokinase signal peptide; GOO-119-497" 
21. .431 
/gene="PLAU" 

/product '"urokinase; 000-119-497" 
67. . 152 

/ r eg i on_name = " KR " 

/note »"Kr ingle domain; Kringle domains are believed to 
play a role in binding mediators, such as peptides, other 
proteins, membranes, or phospholipids. They are autonomous 
structural domains, found in a varying number of copies, 
in blood clotting and.,.; cdOOlOS" 
/ db_xre f CDD : 29008 " 
79 

/ si te_type'» " other " 

/note= "putative domain interaction site" 
/db xref="CDD! 2 9008 " 
order (94,104, 132, 134, 143) 
/ si te_type" " other " 
/note="ligand binding site" 
/ db_xre f - " CDD : 29008 " 
179. .422 

/ region_name = "Tryp_SPc " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190" 
/db xref=*CDD; 29152 " 
179 

/site_type= "cleavage" 
/ db_xre f = "CDD : 29152 " 
order(224,275,376) 
/si te_type= " ac t ive " 
/ db_xre f =" CDD : 29152 " 
order(370,395,397) 
/si te_type= " other " 
/note» "substrate binding sites" 
/db_xre f - " CDD : 29152 " 
1. .431 

/geneowpLAU" 

/C0ded_by="MlS4 76.1:77. .1372" 
/ db_xr e f - " GDB : G 0 0 - I 1 9 - 4 9 7 ■ 



// 



1 mrallarlll cvlwsdskg snelhqvpsn cdclnggtcv snkyfsnihw cncpkkfggq 
61 hceidksktc yegnghfyrg kastdtmgrp clpwnsatvl qqtyhahrsd alqlglgkhn 
121 ycrnpdnrrr pwcyvqvglk plvqecmvhd cadgkkpssp peelkfqcgq ktlrprfkii 
181 ggefttienq pwfaaiyrrh rggsvtyvcg gslispcwvi sathcfidyp kkedyivylg 
241 rsrlnsntqg emkfevenli Ihkdysadtl ahhndiallk irskegrcaq psrtiqticl 
301 psmyndpqfg tsceitgfgk enstdylype qlkmtwkli shrecqqphy ygsevttlanl 
361 caadpqwktd scqgdsggpl vcslqgrmtl tgivswgrgc alkdkpgvyt rvshflpwir 
421 shtkeengla 1 



http://www.ncbi.nlm.nih.gov/protein/AAA6 1 253 
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Protein 

Translations of Life 



Display Settings: GenPept 

unnamed protein product [Homo sapiens] 



GenBank: CAA26535.1 

FASTA Graphtes 



Gcto: 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PDBMED 
COMMENT 
FEATURES 

source 



Protein 



CAA26535 

unnamed protein product 
CAA26535 

CAA26535.1 GI:35298 
embl accession X02760.1 



431 aa 

(Homo sapiens] 



linear PRI 30-MAR-1995 



(htiman) 



sig peptide 
Region 



Site 



Site 



Region 



Site 



Site 



Site 



CDS 



Homo sapiens 

Homo sapiena 

Bukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mcumnalia; Butheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 431) 

•Jacobs, P., Cravador,A., Loriau^R., Brockly, P., Colau,B., 

Chuchana,P., van El sen, A., Hersog^A. and Bollen,A. 

Molecular cloning, sequencing, and eaqpression in Escherichia coli 

of human preprourokinase cDNA 

DNA 4 (2), 139-146 (1985) 

3888571 

Data kindly reviewed (5-DBC-1985) } by A.Bollen. 
Locat ion/Qual i £ iers 
1. .431 

/organism="Homo sapiens" 
/db_xre f «» " taxon ; 9606 " 
1..431 

/names "unnamed protein product" 
1. .20 

/noteB"8ignal peptide (aa -20 to -1) " 

67 . . 149 

/ r e g i on^name = ** KR " 

/notes"Kringle domain,- Kringle domains are believed to 
play a role in binding mediators, such as peptides « other 
proteins, membranes, or phospholipids. They are autonomous 
structural domains, found in a varying number of copies, 
in blood clotting and...; cdOOlOS" 
/ db_xre f »" CDD : 29008 " 
79 

/ s i t e_ t ype ■ " o t he r " 

/note= "putative domain interaction site" 
/ db_xre f =" ODD : 29008 " 
order {94, 104,132, 134, 143) 
/ s i te_t ype »" other " 
/note="ligand binding site" 
/ db_xre f CDD : 29008 " 
179. .422 

/ r e g i on_name = " Tryp_S Pc " 

/note= "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have siibstitutions of the catalytic...; cd00190" 
/db xreff-"CDD; 29152 " 
179 

/ s i t e_t ype = " c 1 eavage " 
/db_xre f » " CDD : 29152 " 
order (224, 275,376} 
/si te_t ype o" active" 
/ db_xre f =" CDD ; 29152 - 
order(370,395,397) 
/si te_type=" other " 
/notes "substrate binding sites" 
/db_xre f » " CDD : 29152 " 
1. .431 

/coded_bye"X027 60. 1:41. . 1336" 
/notes "urokinase precursor" 
/db_xref»''GDB: 119497" 
/db_xr e f = " GOA : P00749 " 
/db_xre f = " HGNC : 9052 " 
/db_xre f » " Inter Pro : IPROOOOOl • 
/db_xre f = " InterPro ; IPR000742 " 
/db_xre f = " InterPro: I PROP 1254 " 
/db__xre f » " InterPro: I PRQO 13 14 " 
/db xref -"InterPro; IPR006210 " 
/db_xre f = " InterPro ; IPR009003 " 
/ db_xre f » " InterPro : IPR013032 " 
/db_xre f - • InterPro : IPR013S06 " 
/db_xre f = " InterPro : IPR018056 " 
/db 
/db 
/db 
/db 
/db 
/db 

/db^ 

/db xret»"PPB; 1F5L " 
/db_xre f - " PDB ; 1F92 " 
/db'"xref°"PPB; 1FV9 " 
/ db_x r e f = " PDB : 1GI7 " 
/db_xre£ 



xre f o " InterPro : IPR018114 " 
xref""PDB: 1C5W* 
xrefo"PDB; 1C5X " 
xre f = " PDB ; 
xref" "PDB: 
xref* "PDB I 
xref ="PDB ; 



1C5Y " 
1C5Z " 
lEJN " 
1F5K" 



'PDB: 1GI8" 



http://www.ncbi.nlm.nih.gov/protein/CAA26535 
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ORIGIN 



/db_xre £ = " PDB : 1GI9 " 
/db_xre £ » " PDB : 1GJ7 " 
/db xref-"PDB; IGJS * 
/ db_xr e f o " PDB : 1GJ9 " 
/ db_xr e f = " PDB : IGJA " 
/db_xre f » " PDB : IGJB " 
/db~xre f «» " PDB : 1C3JC " 
/ db"xr e f = " PDB : IGOD " 
/c3b_xre f = " PDB : IKDU •» 
/db_xre f - " PDB : ILMW " 
/db_xre f » " PDB : 103P " 
/c3b_xre f = " PDB : 105A " 
/db xref °"PDB; 105B " 
/db~xre f - " PDB : 105C " 
/db_xre f = " PDB iowd " 
/db xre £°" PDB; lows " 
/db xref-^PPB; lOWH " 
/db_xre f = " PDB : IQWI " 
/db~xre f » " PDB : lOWj " 
/db_xre f = " PDB : lOWK " 
/ db~xr e f » " PDB : 1SC8 " 
/db_xref «''PDB: ISQA " 
/db_xr e f = " PDB : ISQO " 
/db_xref ="PDB : ISQT" 
/db xref =" PDB; 1U6Q " 
/db_xre f = " PDB : lURK " 
/db_xre f = " PDB : 1VJ9 " 
/ db_xr e f » " PDB : IVJA " 
/db_xre f = " PDB : IWOZ " 
/db__xre f = " PDB : IWlQ " 
/db_xre f - " PDB : iwii " 
/db xref o" PDB: 1W12 " 
/db xrefo"PDB; lW13 " 
/db_xre f » " PDB : 1*04" 
/db_xre f - " PDB : 2FD6 • 
/ db_xr e f = " PDB : 2I9A " 
/db_xre f » " PDB : 2I9B " 
/db_xre f - " PDB : 2SWN " 
/db xref °" PDB; 208T " 
/db_xre f = " PDB : 208U " 
/db_ xref «" PDB; 203W " 
/ db~xr e f = " PDB : 2R2W " 
/db_ xref ="PDB: 2VIN " 
/ db_xr e f = " PDB : 2VIO " 
/db xref -"PDB; 2 VIP " 
/ db_xr e f - " PDB : 2VTQ " 
/db_xre f = " PDB : 2VIV " 
/db_xre f - " PDB : 2VIW " 
/db xref =" PDB; 2VNT " 
/db_xre f « " PDB : 3BT1 " 
/db_xre f = " PDB : 3BT2 " 
/ db_xre f - " PDB : 3IG6 " 
/db_xre f = " PDB ; 3KGP " 
/db_xre f = " pdb : 3khv " 
/db_xr e f - " PDB : 3Kib " 
/db xref ="PDB; 3M61 " 
/db xref =" PDB; 3MHW " 

/db_xref -"UniProtKB/Swiss- Prot : POOT^g" 

1 mrallarlll cvlwsdskg snelhqvpsn cdclnggtcv snkyfsnihw cncpkkfggq 

61 hceidksktc yegnghfyrg kastdtmgrp clpwnsatvl qqtyhahrsd alqlglgkhn 

121 ycrnpdnrrr pwcyvqvglk plvqecmvhd wadgkkpsBp peelkfqcgq ktlrprfkii 

181 ggefttienq pwfaaiyrrh rggsvtyvcg gslispcwvi sathcfidyp kkedyivylg 

241 rsrlnsntqg emkfevenli Ihkdysadtl ahhndiallk irskegrcaq psrtiqticl 

301 psmyndpqfg tsceitgfgk enstdylype qlkmtwkll shrecqqphy ygsevttkml 

3 61 caadpqwktd scqgdsggpl vcslqcrmtl tgivswgrgc alkdkpgvyt rvshflpwir 

421 shtkeenglv 1 



http://www.ncbi.nlm.nih.gov/protein/CAA26535 
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. Protein 

Translations of Life 



Oi^lay Settings: GanPepI 

preprourokinase [Homo sapiens] 

GenBanlc AAC07138.1 



csoto: 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCB 

KEYWORDS 

SODRCE 

ORGANISM 



REFERENCE 
AOTHORS 

TITLE 
JOURNAL 
PUBMED 

COMMENT 

FEATURES 

source 



Protein 

siq peptide 
mac peptide 

Region 



Site 



Site 



mat peptide 
mat peptide 
Reg i oil 



AAC97138 431 aa linear PRI 18-DEC'1998 

preprourokinase I Homo sapiens] . 

AAC97138 

AAC97138.1 GI:340158 

locus HUMUKMl accession K03226 . 1 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 431) 

Nagai,M., Hiramatsu, R . , Kaneda^T. , Hayasiike,N. , Arimura.H., 
Nishida.M. and Suyama,T. 

Molecular cloning of cDNA coding for human preprourokinase 

Gene 36 (1-2), 183-188 (1985) 

2415429 

Method: conceptual translation. 
Location/Qualifiers 
1. .431 

/organisms "Homo sapiens" 
/db_xr e f «» " t axon : 9606 " 
/ chromosome s « i q « 
/map- " 1 Oq2 4 - qt e r " 
1. .431 

/name- "preprourokinase" 
1. .20 
21. .177 

/product -"urokinase A chain** 
67.. 152 

/ r eg i on_name ■ " KR " 

/note="Kringle domain; Kringle domains are believed to 
play a role in binding mediators, such as peptides, other 
proteins, membranes, or phospholipids. They are autonomous 
structural domains, found in a varying number of copies, 
in blood clotting and...; cdOOlOS" 
/db_xre f - " CDD : 29008 •» 
79 

/site_type= "other " 

/note- "putative domain interaction site" 
/db_xre f - " CDD I 29008 " 
order (94 , 104 , 132, 134, 143) 
/site_type= "other " 
/note-'ligand binding site" 
/db xrefo "ODD: 29008 " 
156. . 176 



Site 



Site 



Site 



CDS 



ORIGIN 



/product- "urokinase Al chain" 
179. .431 

/product -"urokinase B chain" 
179.. 422 

/ region_name = " Tryp_SPc " 

/note- "Trypsin- like serine protease; Many of these are 
synthesized as inactive precursor zymogens that are 
cleaved during limited proteolysis to generate their 
active forms. Alignment contains also inactive enzymes 
that have substitutions of the catalytic...; cd00190* 
/db_xref- " CDD : 29152 " 
179 

/ si te_type= "cleavage " 

/db_xref-"CDD; 29X52 " 

order (224, 275, 376) 

/si te_type= "active " 

/db_xre f - " CDD : 29152 * 

order (370, 3 95, 397) 

/ si te_type» " other • 

/notes "substrate binding sites" 

/db_xre f = " CDD : 29152 " 

1. ,431 

/gene- "PLAU" 

/coded_by-''K03 226 .1:81, .1376" 
/db xref ="GDB:G00-119-497" 



// 



1 mrallarlll cvlwsdskg snelhqvpsn cdclnggtcv snkyfsnihw cncpkkfggq 
61 hceidksktc yegnghfyrg kastdtmgrp clpwnsatvl qqtyhahrsd alqlglgkhn 
121 ycrnpdnrrr pwcyvqvglk plvqecmvhd cadgkkpssp peelkfqcgq ktlrprfkii 
181 ggefttienq pwfaaiyrrh rggsvtyvcg gslispcwvi sathcfidyp kkedyivylg 
241 rsrlnsntqg emkfevenli Ihkdysadtl ahhndiallk irskegrcaq psrtiqticl 
301 psmyndpqfg tsceitgfgk enstdylype ql)cmtwkli shrecqqphy ygsevttkml 
361 caadpqwktd scqgdsggpl vcslqgrmtl tgivs*igrgc alkdkpgvyt rvshflpwir 
421 shtkeengla 1 



http://www.ncbi.nIm.nih.gov/protein/AAC97138 



4/12/201 



Urokinase-type plasminog^jpctivator precursor - Mus musculus (Mo^j^ 
P06869 (UROK_MOUSE) ^ Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 1 14. 
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Names and origin 


Protein names 


! 

Hecotn/venaea nstriQ, 
1 Urokinase-type plasminogen activator 

Short name=U-plasmlnogen activator 
Qhnrt nam^^iiPA 

EC=3.4.21.73 

Cleaved into the followina 3 chains: 

1 . Urokinase-type plasminogen activator iong cttain 
A 

2. Uroiiinase-type plasminogen activator short chain 
A 

3. Urokinase-type plasminogen activator chain B 


Gene names 


NameiPlau 


L/ryamsm 


mUS mUSCUIUS (MOUSe; 


Taxonomic identifier 


10090 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrate > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 

> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > Mus 

> Mus 


Protein attributes 




Sequence length 


433 AA. 


Sequence status j Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 


General annotation (Comments) 


Function 


Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 


Catalytic activity 


Specific cleavage of Arg-|-Val bond in plasminogen to form 
plasmin. 


Subunit structure 


Found in high and low molecular mass forms. Each consists of 



Subcellular location 
Sequence similarities 



two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
I LRP1B; binding is followed by internalizatio n and degra dation. 
I Interacts with MRC2. Interacts with PLAUR I By similarity 1 

■ Secreted. 

j Belongs to the peptidase S1 family, 
j Contains 1 EGF-iike domain. 

i 

I Contains 1 kringle domain. 



http://ww\v.uniprot.org/uniprot/P06869 
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Ontologies 



Contains 1 peptidase S1 domain. 



Keywords 



Biological process 


Plasminogen activation 


Cellular component 


Secreted 


Domain 


EGF-like domain 

Kringle 

Signal 


Molecular function 


Hydrolase 
Protease 
Serine protease 


PTM 


Disulfide bond 
Zymogen 


Technical term 


3D-structure 


Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 

regulation of cell proliferation 

Inferred from genetic interaction. Source: MGI 

response to hypoxia 

Inferred from mutant phenotype. Source: MGI 

smooth muscle cell migration 

Inferred from mutant phenotype. Source: MGI 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: UniProtKB-KW 


Complete GO annotation... 


• 



Sequence annotation (Features) 



Feature key Position(s) Length j Description Graphical view 



Feature identifien 



Molecule processing 
Signal peptide 



1 -20 



20 



Potential I 



Chain 



21-433 



413 



Chain 



Urokinase- 
type 

plasminogen 
activator 



PRO 0000028322 



21 - 178 \ 158 



Urokinase- 
type 



PRO 0000028323 



http://www.uniprot.org/uniprot/P06869 
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Chain 


157-178 


22 


plasminogen 
activator 
long chain A 

1 By similarity | 

Urokinase- 
type 

plasminogen 
activator 
short chain 
A 

1 By similarity | 


1 


PRO_00000283i 


:4 


M 

Reg 


Chain 

ions 
Domain 


180-433 
28-64 


254 
37 


Urokinase- 
type 

plasminogen 
activator 
chain B 

By similarity j 

EGF-like 


1 

1 1 


PRO_00000283: 


15 




Domain 


71-152 


82 


Kringle 


1 1 








Domain 


180-426 


247 


Peptidase 
S1 


1 1 







Region 


35-68 


24 


Binds 

urokinase 

plasminogen 

activator 

surface 

receptor 

1 By similarity 






— 
— 


s 

Site! 

□ 


Region 
Active site 


153-179 
226 




27 

• 

1 


Connecting 
peptide 

Charge 
relay system 


^ 

1 


„ 


□ 


Active site 


277 


1 


Charge 
relay system 


1 


1 

1 

1. r ■., , n , 


ED 

Amii 


Active site 

no acid modifier 
Disulfide bond 


378 

itions 

32 40 


1 


Charge 
relay system 


, 1 1 I 1. >i> ■■ I.. 

1 

II 
1 


. 




LJ 


Disulfide bond 


34 ^ 52 




Rei.3 1 


11 


— . 






Disulfide bond 


54 63 






II 
II 






□ 


Disulfide bond 


71 ^ 152 

■"■ i-i— — 1 - — . 


j 

i 


Ref.3 1 


1 1 


1 


□ 


Disulfide bond 


92*- 134 




Ref.3 1 


1 1 






Disulfide bond 


123 147 




Ref.3 ' 


II 


1 





http://www.uniprot.org/uniprot/P06869 
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□ 


Disulfide bond 


169^ 301 




Interchain 
(between A 
and B 

chains) 

1 By simiiarity | 


1 1 


■ — 




F" " '1 
[_j 


Disulfide bond 


21 1 227 




: — . : 1 

By similarity | 


II 




l_J 


Disulfide bond 


219 290 




By Similarity 


1 1 








Disulfide bond 


315*->384 




By simiiarity j 


1 1 






B 


Disulfide bond 


347 ^ 363 




By similarity | 


II 






E 


Disulfide bond 


374 ^ 402 
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^Ijjp^tivator precursor - Oryctolagus cuniculj|||^ 



Q8MHY7 (UROK_RABIT)# Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 2011. Version 54. 



Names and origin 



Protein names 



Recommended name: 
Urokinase-type plasminogen activator 

Short name=U-piasminogen activator 

Short name=uPA 

EC=3.4.21.73 

Cleaved into the foilowina 3 chains: 

1 . Urokinase'type plasminogen activator long chain 
A 

2 . Urokinase-type plasminogen activator short chain 
A 

3. Urokinase-type plasminogen activator chain B 







Gene names 


NamerPLAU 


Organism 


Oryctolagus cuniculus (Rabbit) 


Taxonomic Identifier 


9986 [NCBI] 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires 
> Lagomorpha > Leporidae > Oryctolagus 


Protein attributes 




Sequence length 


433 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at transcript level. 



General annotation (Comments) 



Function 1 Specifically cleave the zymogen plasminogen to form the active 

j enzyme plasmin. 

Catalytic activity I Specific cleavage of Arg-|-Val bond in plasminogen to form 





plasmin. 


- ■"" ' - »- — , I.,,, ....■„..lll ~^ 

Subunit structure 


Found in high and low molecular mass forms. Each consists of 
two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalization and degradation. 
Interacts with MRC2. Interacts with PLAl JR 1 By similarity |. 


Subcellular location 


.qAnretftd 1 By similarity I 


Post-translational modification 


Phosphorylation of Ser-325 abolishes proadhesive ability but . 

Hno« nnt intf»rFArA with reroptnr hinriing 1 By similarity I 


Sequence similarities 


Belongs to the peptidase SI family. 
Contains 1 EGF-like domain. 
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Contains 1 kringle domain. 
Contains 1 peptidase 81 domain. 
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Biological process 
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Inferred from electronic annotation. 


Source: InterPro 


Cellular component 
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extracellular region 

Inferred from electronic annotation. 


Source: UnlProtKB-SubCell 
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Inferred from electronic annotation. 


Source: UniProtKB--KW 
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BRENDA 
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Family and domain databases 
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Entry name 


UROK_RABIT 


Accession 


Primary (citable) accession number: Q8IViHY7 
Secondary accession number(s): Q8MIL0 


Entry history 


Integrated Into May 1,2007 
UniProtKB/Swiss- 

Prot: 

Last sequence October 1 . 2002 
update: 

Last modified: April 5. 201 1 

This is version 54 of the entry and version 1 of the sequence. 
[Complete history] 
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Reviewed (UniProtKB/Swiss-Prot) 
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Names and origin 



Protein names 



Recommended name: 
Urokinase-type plasminogen activator 

Short name=U-plasminogen activator 

Short name=uPA 
EC=3.4.21.73 

Cleaved into the following 3 chains: 

1 . Uroklnase'type plasminogen activator long chain 
A 

2. Urokinase-type plasminogen activator short chain 
A 

3. Uroklnase'type plasminogen activator chain B 



Gene names 
Organism 



Name:Plau 



Rattus norvegicus (Rat) 



Taxonomic Identifier 
Taxonomic lineage 



10116 [NCBI] 



Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Giires 
> Rodentia > Sciurognathi > Muroidea > Muridae > Murinae > 
Rattus 



Protein attributes 


Sequence length 


432 AA. 


Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at protein level. 



General annotation (Comments) 



Function 



Catalytic activity 



Subunit structure 



Subcellular location 



Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 

Specific cleavage of Arg-|-Val bond in plasminogen to form 
plasmin. 

Found in high and low molecular mass forms. Each consists of 
two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalizatio n and degra dation. 
Interacts with MRC2. Interacts with PLAUR I By similarity 1 



Secreted I By similarity I. 



Sequence similarities 



Belongs to the peptidase SI family. 
Contains 1 EGF-like domain. 
Contains 1 kringle domain. 
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Contains 1 peptidase S1 domain. 


Ontologies 


• 


Keywords 




BiolOQical process 


Plasminoaen activation 


Ceilular component 


Secreted 


Domain 


EGF-iike domain 

Kringle 

Signal 


Molecular function 


Hvdrolase 
Protease 
oerine proiease 


PTM 


Disulfide bond 
Zymogen 


Gene Ontology (GO) 




Biological process 


angiogenesis 

Inferred from mutant phenotype. Source: ROD 
chemotaxis 

Traceable author statement. Source: RGD 
embryo implantation 

Inferred from expression pattern. Source: RGD 
proteolysis 

Traceable author statement. Source: RGD 
response to hyperoxia 

Inferred from expression pattern. Source: RGD 
signal transduction 

Traceable author statement. Source: RGD 

skeletal muscle tissue regeneration 

Inferred from mutant phenotype. Source: RGD 


Cellular component 


extracellular region 

Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: UniProtKB-KW 



Complete GO annotation. 



Sequence annotation (Features) 



Feature key j Position(s) j Length j Description j Graphical view 



I Feature identlfie 



Molecule processing 

Signal peptide 



1 - 19 



19 



Potential 
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Q05589 (UROK_BOVIN)ii^ Reviewed, UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 97. 



Names and origin 


nrotein names 


r<Qconifn&nci6a name. 
1 Urokinase-type plasminogen activator 

> 

Short name=U-ptasniinogen activator 
I Short nampsiiPA 

1 EC=3.4.21.73 

f 

i Cleaved into the followina 3 chains: 

! 1 . UrokinasO'type plasminogen activator long chain 
A 

2 . Urokinase-type plasminogen activator stiort chain 
A 

3. Urokinase-type plasminogen activator chain B 


Gene names 


NamePLAU 


Organism 


Bos taurus (Bovine) 


Taxonomic identifier 


9913 [NCBI] 


Taxonomic lineage 
Protein attributes 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 
Cetartiodactyla > Ruminantia > Pecora > Bovidae > Bovinae > 
Bos 


Sequence length j 433 AA. 


Sequence status j Complete. 


Sequence processing The displayed sequence is further processed into a mature 

j form. 


Protein existence j Evidence at transcript level. 

General annotation (Comments) 


Function 1 


Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 


Catalytic activity 


Specific cleavage of Arg-|-Val bond in plasminogen to form 
plasmin. 


1 III 1 J 

Subunit structure 


Found in high and low molecular mass forms. Each consists of 
two chains, A and B, The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalization and degradation. 

Intf^mct^ with MRn9 lnt«^rart.Q wifh PI Al JR | By simiiarity | 


Subcellular location 


Secreted. 


Induction 

Sequence similarities 


By retinoic acid. 

Belongs to the peptidase S1 family. 
Contains 1 EGF-like domain. 
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P16227 (UROK_PAPCY)* Reviewed. UniProtKB/Swiss-Prot 

Last modified January 11. 2011. Version 92. 



Names and origin 


r^roiein narnes 


rsGCOfflfUGnuGu nalTIG. 

Urokinase-type plasminogen activator 

Short name=U-plasminogen activator 

Short name=uPA 
EC=3.4.21.73 

Cleaved into the followina 3 chains: 

1 . Urokinase-type plasminogen activator iong chain 
A 

2. Uroiiinase-type plasminogen activator short chain 
A 

3. Urokinase-type plasminogen activator chain B 


Gene names 


NameiPLAU 


yCll 1191 1 1 


r^cipii/ 1101^6 pn all |j9 | it?iiow uoiijwii/ 


1 axonomic lueniiTier 


9000 [NOblJ 


Taxonomic lineage 


Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 
Euteleostomi > Mammalia > Eutheria > Euarchontoglires > 
Primates > Haplorrhini > Catarrhini > Cercopithecidae > 
oercopithecinae > Papio 


Protein attributes 




Sequence length 

,^.._,„..^.,^,^,.^,„^^,,,^„,,,„„„ ^„„.^^^„ .„„^, .J. „ „„„. ■««,||,M,| * 


433 AA. 


: 

Sequence status 


Complete. 


Sequence processing 

i 


The displayed sequence is further processed into a mature 
form. 


Protein existence 


Evidence at transcript level. 


General annotation (Comments) 


Function 


Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 


Catalytic activity 

i 


Specific cleavage of Arg-|>Vat bond in plasminogen to form 
plasmin. 


Subunit structure \ 

f 


Found in high and low molecular mass fomis. Each consists of 
two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalization and degradation. 

Int^arartR with MRC? Interacts with Pl Al lR 1 By similarity | 


Subcellular location 


Secreted. 


Post-translational modification 
Sequence similarities 


Phosphorylation of Ser>157 and Ser-325 abolishes proadhesive 

ahility hut ririAS not int^rforc^ with rf^rAptnr hinHing 1 By similarity 1 

Belongs to the peptidase SI family. 
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Contains 1 EGF^Iike domain. 
Contains 1 knngie domain. 
Contains 1 peptidase S1 domain. 
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Gene Ontology (GO) 




Biological process 


proteolysis 

Inferred from electronic annotation. Source: InterPro 


Cellular component 
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Inferred from electronic annotation. Source: UniProtKB-SubCell 


Molecular function 


serine-type endopeptidase activity 

Inferred from electronic annotation. Source: UniProtKB-KW 


Complete GO annotation... 
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tntry status 
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P04185 (UROK_PIG)* Reviewed. UniProtKB/Swiss-Prot 

Last modified April 5, 201 1 . Version 94. 



Names and origin 


Protein names 


HecommenoQu naniB. 
Urokinase-type plasminogen activator 

Short name=U'plasininogen activator 
$^hort nAm^=iiPA 

wl IWI I 1 tot 1 1^ Urry 

EC=3.4.21.73 

Cleaved into the followina 3 chains: 

1 . UrokinasB'type plasminogen activator long chain 
A 

2. Urokinase-type plasminogen activator short chain 

3. Urokinase-type plasminogen activator chain B 


Gene names 


Name:PLAU 


Organism 


Sus scrofa (Pig) 


Taxonomic Identifier 


9823 [NCBI] 


Taxonomic lineage 

Protein attributes 


r- ^ ■ ~ - ■ 

Eukaryota > Metazoa > Chordata > Craniata > Vertebrata > 

Euteleostomi > Mammalia > Eutheria > Laurasiatheria > 

Cetartiodactyia > Suina > Suidae > Sus 


Sequence length 


442 AA. 


■ 

Sequence status 


Complete. 


Sequence processing 


The displayed sequence is further processed into a mature 
form. 


Protein existence Evidence at transcript level. 
General annotation (Comments) 


Function 


Specifically cleave the zymogen plasminogen to form the active 
enzyme plasmin. 


Catalytic activity 


Specific cleavage of Arg-|-Val bond in plasminogen to form 
plasmin. 


Subunit structure 

i 


Found in high and low molecular mass forms. Each consists of 
two chains, A and B. The high molecular mass form contains a 
long chain A which is cleaved to yield a short chain A. Binds 
LRP1B; binding is followed by internalization and degradation. 

Intf^rants with IntArantfi with PI Al JR 1 By similarity | 


Subcellular location 


5^Rnrfited 1 By similarity I 


Sequence similarities 


Belongs to the peptidase S1 family. 
Contains 1 EGF-like domain. 
Contains 1 kringle domain. 
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Comparative Modeling Methods: Application to the 
Family of the Mammalian Serine Proteases 
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Computer Assisted Molecular Design Group, Pharmaceutical Products Division, Abbott Laboratories, Abbott Park, 
Illinois 60064 



ABSTRACT Comparative modeling meth- 
ods are described that can be used to construct 
a three-dimensional model structure of a new 
protein from knowledge of its sequence and of 
the experimental structures and sequences of 
other members of its homology family. The 
methods are illustrated with the mammalian 
serine protease family, for which seven experi- 
mental structures have been reported in the lit- 
erature, and the sequences for over 35 different 
protein members of the family are available. 
The strategy for modeling these proteins is pre* 
sented, and criteria are developed for determin- 
ing and assigning the reliability of the modeled 
structure. Criteria are described that are spe- 
cially designed to help detect cases in which it 
is likely that the local structure diverges signif- 
icantly from the usual conformation of the fam- 
ily- 

Key words; protein structure, computer model- 
ing, structure prediction, sequence 
homology, structure homology 

INTRODUCTION 

It has been apparent for close to two decades that 
proteins from very different sources and sometimes 
with rather diverse functions can have homologous 
sequences and consequently very similar three- 
dimensional structures.^ This fact has been the ba- 
sis for the development of comparative modeling 
methods,^"^ which permit extrapolation from the ex- 
perimentally determined structure for one or more 
members of a homologous family to a new member of 
this family whose sequence has been determined but 
whose structure is as yet unknown. A number of 
factors combine to increase greatly the application 
of comparative modeling techniques today. The 
large number of protein structures^ and the explod- 
ing number of protein sequences^ that are being re- 
ported in the literature and that are readily avail- 
able in computerized databases provide the basic 
structural and sequence data needed to apply the 
method. The proteins in which we are interested are 
often available in only small quantities, too little for 
structural studies unless the gene or mRNA is 
cloned or synthesized and expressed. Even when this 
latter effort is deemed worthwhile and is initiated, 



the comparative modeling studies can be performed 
in the meantime, providing an approximate view of 
the structure of the molecule until sufHcient protein 
can be obtained and the experimental structure can 
be determined. Such a model structure can be very 
useful in the interim to help plan and interpret 
biochemical^^ and mutagenesis^^ experiments, to 
probe functional properties,^^ and improve our un- 
derstanding of ligand or substrate binding.^^"*® 

The use of comparative methods involves extrap- 
olation from one or more known structures to con- 
struct a new structure. It is important to consider 
how accurately this function can be performed in 
assessing whether it is a useful and worthwhile ex- 
ercise. Certainly, comparisons that have been pub- 
lished between predicted model structures and the 
experimental structures determined later^®^^® indi- 
cate that the modeled structure is not likely to be 
completely accurate. This paper reports the methods 
that we have developed to perform comparative 
modeling and the criteria that we use to estimate 
the reliability of various parts of the structure and 
especially to identify potential problem areas that 
are likely to be particularly difUcult to predict cor- 
rectly. 

To describe properly and illustrate the modeling 
techniques, we will use the mammalian serine pro- 
tease family.^® Members of this family are ubiqui- 
tous in nature; they are present all along the evolu- 
tionary pathway from bacteria to humans. They 
play important roles in a wide variety of body func- 
tions, including blood coagulation, fibrinolysis, com- 
plement activation, fertilization, and digestion. 
Therefore, modeling the various members of this 
family can provide valuable new information about 
divers critical biological functions of the body. 

Comparative modeling works best when there are 
several experimental structures and known se- 
quences. Experimental structures for seven differ- 
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TABLE I. Experimentally Known Three- 
Dimensional Structures of Serine Proteases 







Resolution 


Refer- 


Protein 


Source 


(A) 


ence 


Chymotrypsin 


Bovine 


1.8 


24* 


Trypsin 


Bovine 


1.8 


25* 


Blastase 


Porcine 


1.8 


26 


Kallikrein 


Porcine 


1.8 


27 


Mast cell protease 


Rat 


1.8 


28 


S. griseus trypsin 


S. griseus 


1.7 


23 


Tonin 


Rat 


1,8 


29 



*Structures have been determined by several groups for each of 
these proteins. The reference shown corresponds to the coordi- 
nates used in this work. 



ent serine proteases* that can be considered mem- 
bers of this mammalian family have been reported 
in the literature and deposited in the Brookhaven 
Protein Structure Database.® These include chy- 
motrypsin,^** trypsin,^® elastase,^^ kallikrein,^^ rat 
mast cell,^® Streptomyces griseus trypsin-like 
protein,^^ and tonin^^ (see Table 1). Several of these 
have been solved independently in more than one 
laboratory either in the same form or in alternate 
forms of the molecule.*^*^^ In addition, structures 
have been reported for the precursor, zymogen, form 
of chymotrypsin^^*^* and trypsin.^®'^* To comple- 
ment these structures, amino acid sequences for 
over 35 different serine proteases have been re- 
ported (Table II), not coimting species variations. 
Thus this family presents an ideal system for devel- 
oping comparative modeling methods and at the 
same time applying them to proteins involved in im- 
portant and interesting biological functions. 

COMPARATIVE MODELING METHOD: THE 
"SPARE PARTS" ALGORITHM 

Comparative modeling requires extrapolation 
from known structures to produce a model of an un- 
known structure. Our ability to extrapolate accu- 
rately is, unfortunately, greatly limited by our still 
rudimentary knowledge and understanding of pro- 
tein structure and energetics. Consequently, the 
techniques that are used to extrapolate influence 
the resultant model critically and may introduce 
considerable error. The methods that we have devel- 
oped attempt to systematize the modeling process by 
combining all the known structures and sequences 
to help improve the accuracy of the extrapolation. 



♦Structures have also been reported for several serine pro* 
teases from bacterial sources.*'*""'" Although these are cer- 
tainly members of this family, they differ sii^ciently in their 
structures that comparative modeling methods are not devel- 
oped enough to permit modeling them from the mammalian 
structures without serious errors.^ Therefore, they are not 
considered in this work. 



Analysis of the Structural Properties 
of the Family 

The first steps in the modeling process are illus- 
trated schematically in Figure 1. Let us assume that 
we have experimentally determined three-dimen- 
sional structures for several members of the homol- 
ogous family of interest, e.g., structures "A," "B," 
and "C" in Figure la. The known structures are su- 
perimposed in three dimensions to obtain a maximal 
overlap of the structures (Fig. lb). Performing this 
superposition of the structures is sometimes not a 
trivial procedure. This is true because we want to 
overlap the molecules based on the parts that are 
the same and ignore the parts that are difTerent. The 
more different the molecules are from each other, 
the harder it is to find a unique best overlap of the 
common features of the structures. Several pro- 
grams have been written that do this analyti- 
cally.37-38 

Once superimposed, there are parts of the known 
structures that overlap very well, indicating that 
the structures are closely conserved in these regions 
(see bold areas in Fig. lb). We call these i>ortions 
"structurally conserved regions," or SCRs, and ex- 
pect that they will usually remain conserved in all 
members of this homologous family. These regions 
are usually composed of the secondary structure el- 
ements, the immediate active site, and other essen- 
tial structural framework residues of the molecule. 
Between these conserved elements are highly vari- 
able stretches that differ significantly from one 
member of the family to the next. These are called 
"variable regions," or VRs. They are almost always 
loops that lie on the external surface of the protein, 
and they contain all the additions and deletions be- 
tween different protein sequences. 

After the structures have been overlapped in 
three dimensions and parsed into SCRs and VRs, the 
next step is to align their amino acid sequences. For 
purposes of comparative modeling, the sequence 
alignment is done differently from the usual 
methods. '^^ Instead of relying on criteria such as 
amino acid identity or homology, we use strictly the 
three-dimensional overlap as the criterion. When 
the a-carbons of the respective residues in the over- 
lapped protein structures occupy the same place in 
three-dimensional space, then the residues are cor- 
responded in the sequence alignment (Fig. 2). Thus 
the alignment is primarily concerned with the 
SCRs, since these are the portions of the structures 
that are basically the same in all protein members of 
the family. The alignment in the VRs is often arbi- 
trary, unless two or more structures have similar 
conformations in a particular VR (for an example of 
this see the VR in the upper left of the structures 
"A" (sequence C-D-F-A) and "C" (sequence C-R-Y- V) 
in Figs, 1 and 2). 

The resultant sequence alignment is then scruti- 
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TABLE II. Sequences of the S erine Proteases* 

FVotein 

Chymotrypsin 
Trypsin 
Elastase 
Kallikrein 
Mast cell protease 
S. griseus trypsin 
Tonin 

Haptoglobin heavy chain 
Protein Z 
Protein C 

Nerve growth factor a chain 
Nerve growth factor 7 chain 
Blood clotting factor VII 
Blood clotting factor IX 
Blood clotting factor X 
Blood clotting factor XI 
Blood clotting factor XII 
Plasmin 

A{K)lipoprotein A 
Tissue plasminogen activator 
Urokinase 
Thrombin 

Complement factor B 
Complement factor 2 
Complement factor D 
Complement factor IR 
Complement factor IS 
Adipocyte protease 
Cathepsin G 
T-cell serine protease 
Hannuka factor 

Cytotoxic T lymphocyte protease 
Batroxobin 
EGF binding protein 
Drosophila snake locus 

'''The sequences are taken from the sequence database.^ 
"VFor those proteins for which a three-dimensional structure is 
available, the sequence Is taken from the Brockhaven struc- 
ture database.^ 



nized carefully to identify strong stretches or pat- 
terns of sequence homology that are characteristic 
of each SCR in the structure. For example, the se- 
quence "L-S/T-V-TT-I-ir," where is a charged or po- 
lar residue, is conserved in the second SCR in Figure 
2. Similarly, "G-I-A" is found in all members at the 
third SCR. However, in the fourth SCR, the situa- 
tion is less clear. A proline usually appears in the 
second position but may be replaced by a glycine. 
Most SCRs can be identified by a characteristic (not 
necessarily contiguous) sequence pattern. In some 
cases, there is a pattern of hydrophobic or hydro- 
philic residues rather than specific side chains. 
Careful analysis of the sequences and the corre- 
sponding three-dimensional structures will usually 
allow some pattern to be discerned. 

The identification of these sequence homology 
patterns is a crucial step in the modeling. As is 
shown below, it is essential for the correct alignment 
of a "new" sequence. If the characteristic sequence 
pattern is not present, then it is not clear how to 
align the "new" sequence and, consequently, how to 



L/Ode 


Source 


CHTt 


Bovine 


TRPt 


Bovine 


ELAt 


Porcine 


KALt 


Porcine 


MCPt 


Rat 


SGTt 


S. griseus 


TONt 


Rat 


HPH 


Human 


PRZ 


Bovine 


PRC 


Human 


NGA 


Human 


NGG 


Human 


VII 


Human 


FIX 


Human 


FAX 


Human 


FXI 


Human 


XII 


Human 


PLM 


Human 


ALP 


Human 


TPA 


Human 


UKH 


Human 


THR 


Human 


CFB 


Human 


CF2 


Human 


CFD 


Human 


CIR 


Human 


CIS 


Human 


ASP 


Mouse 


CAG 


Human 


TCL 


Mouse 


HAF 


Human 


CTL 


Mouse 


BTX 


Snake 


EBP 


Mouse 


DSN 


Drosophila 



construct the model structure. This is one of the rea- 
sons that the comparative modeling method cur- 
rently depends so heavily on the retention of se- 
quence homology. The above steps are performed 
once for a protein family and need be reexamined 
only as new experimental structures and their se- 
quences become available. 

Construction of a "New" Structure 

We are now ready to begin modeling a "new" pro- 
tein sequence of interest that is a clear member of 
this homologous family. The first step is to align this 
"new" sequence to the sequences of the SCRs using 
the previously identified characteristic sequence ho- 
mology patterns. It should be possible to align an 
appropriate portion of the "new" sequence to each 
SCR with these patterns. Thus, the "new" sequence 
L-T-V-K-I-E fits the pattern L-S/T-V-it-I-it de- 
scribed above and can be aligned to the SCR at po- 
sitions 7-12 in Figure 2 and similarly for the G-I-A 
sequence of the SCR at 16-18. There will occasion- 
ally be more than one possible alignment for a par- 
ticular SCR, especially if the sequence pattern is a 
weak one. Such an example might occur in the SCR 
corresponding to residues 1—2 (Fig. 2). In such cases, 
the different possible alignments would have to be 
considered. As each portion of the "new** sequence is 
aligned to an SCR, the rest of the positions in that 
SCR are filled with the adjacent "new" sequence 
without permitting any additions or deletions 
within the SCR.t 

The remaining residues make up the VRs. The 
"new" sequence in each VR is examined to see if it 
corresponds in length and residue character to one of 
the VRs of the known structures. For example, the 
third VR in the '*new" protein in Figures 1 and 2, 
residues 13-15, has the same sequence length and 
character as this VR in protein "B" (A-W-D-S-L in 
"new" vs. A-W-N-T-M in "B") and therefore has been 
aligned to it. On the other hand, none of the known 
structures has a sequence that corresponds to resi- 
dues 3—6 of the "new" protein in residue length. 

Once the "new" sequence is aligned, as shown in 
Figure 2, the model building can begin. For the 
SCRs, usually the main chain coordinates from any 
one of the known structures can be taken (Fig. Ic). 
The side chains are mutated to those of the "new" 
sequence wherever necessary. In our implementa- 
tion, the Xi angle for the "new" side chain is chosen 
to maximize overlap of this side chain on the old one. 
Further side chain x angles are not fitted at this 



t An exception would occur if there are too few residues be- 
tween this SCR and the neighboring SCR. In that case, the end 
residues of an SCR would be left with a deletion. For examples, 
see position 96 in PLM and ALP in Figure 4. 
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Fig. 1 . Schematic representation o1 the comparative modeling 
method, a: There are three proteins in this homotogous family 
whose structures are known. A, B, and C. Each protein is repre- 
sented with a characteristic dashed or dotted pattern throughout 
the figure (see key in d). b: The three proteins are superimposed 
showing that parts of the structure are conserved (SCRs in t>old 
lines) and parts are variable from one protein to the next (V/Rs, 
respective dashed and dotted lines), c: Steps in the construction 
of the schematic '*new" model structure are presented in this and 
the next two parts. The SCRs (bold lines) are constructed from the 
main chain coordinates of any one of the known structures, since 
they are all very close to the same. The side chains are mutated 



to the "new" sequence where necessary, d: The vark>us VR con- 
formations found in the known structures are considered for each 
VR of the "new" protein. (Compare the respective VRs shown 
here with those in a and b.) The ones that do not fit are rejected, 
as shown by the crossed arrows. The nx>st suitable is selected in 
each case. In some cases, other conformational search**^^^^ or 
energetics'*®'*^ methods must be employed, since no suitable con- 
formation can be found for that VR among the known structures 
(see VR in upper left corner with the sequence C-F-N-L-Q for an 
example in which a different, new conformation is necessary), e: 
The composite structure shtowing the source of the respective 
"spare parts" selected for the model structure. 
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Rg. 2. Sequence alignment for the set of proteins in the sche- 
matic homologous family correspornJing to Figure 1. The align- 
ment Is performed based solely on the overlap of the three-dimen- 
sional structures and not by sequence alignment methods. The 
t>oxes delineate the SCRs as determined from the three-dimen- 
sional structure overlap. The sequence of a "new" protein is 
aligned based on the characteristic patterns of sequence homol- 
ogy found for the known structures and their sequences (see text). 
The single letter amirio acid codes are given in Figure 4. 
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stage, because the nature of the different side chains 
usually diverges beyond this point. An automatic 
angle-scanning energy-optimizing routine is usually 
not employed, because we find that it often selects 
an unsuitable conformation based on minor or insig- 
nificant differences in energy. However, incorpora- 
tion of recently compiled and reported rotamer 
libraries'*" may improve our ability to automate the 
side chain conformation selection process. 

For those VRs for which an appropriate example 
structvure can be selected from among the known 
structures during the alignment process descnbed 
above, the main chain fragment is taken directly 
from the VR of that known structure, and again the 
side chains are mutated to fit the "new" sequence 
(see examples in Fig. Id and e). Remaining VRs, 
such as the VR at positions 3—6 in Figures 1 and 2, 
have no good example to build upon among the 
known structures. Therefore, they have to be con- 
structed by more complex conformation search 
methods'*^"'*^ and energetic considerations.'*®''*'^ We 
have also used the protein database as a source of 
tentative starting conformations for loops of this 
type. The two ends of the particular loop that lie in 
the adjacent SCRs are taken, and the Brookhaven 
Protein Structure Database^ is searched for struc- 
tural fragments that closely match the conformation 
of these ends (specifically, the a-carbon positions) 
and have the appropriate number of residues be- 
tween the ends. Thus, for example, in the VR at 
positions 3—6 of the "new" sequence, the structures 
of the two ends corresponding to residues "A-D-A" 
(positions —1 to 2) and "L-T-V- , . (positions 7 to 
9. . .) are selected and typically ten structures that 
match these two ends best [lowest root mean square 
(rms) deviations] and that have five residues in be- 
tween are chosen and examined. Obviously, any 
such conformation that would collide with the re- 
mainder of the constructed "new" structure is elim- 
inated. Similarly, if the conformation does not pack 
well or buries a single charged group, it is rejected. 
In this way, one or a small number of initial confor- 
mations can be selected for this loop. Clearly this 
latter method does not produce an exhaustive list; 
however, it does provide loop conformations that are 
known to appear in other proteins. This method has 
previously been described by Kraulis and Jones^^ 
for constructing protein structures from fragments 
using nuclear magnetic resonance (NMR) data or 
using crystallographically generated electron den- 
sity maps. 

It is important to emphasize that it is the initial 
spatial alignment of the three-dimensional struc- 
tures performed above (Fig. lb) that allows this con- 
venient clipping of fragments or "spare parts" from 
the various known structures to construct a compos- 
ite model structure of the "new" protein. Thus over- 
lapping of the structures is essential for two crucial 
steps: for the original alignment of the sequences 



and for clipping together fragments in the subse- 
quent construction of each "new" model protein 
structure. Note that, the more experimentally 
known structures available, the more likely the 
boundaries of the SCRs will be well-defined and the 
greater the likelihood that a suitable example 
known structure, i.e., spare part, will be found for 
each "new" VR among the known structures. 

Because the different portions of the "new" con- 
structed model structure arise from quite different 
sources (Fig. le), we can assign appropriate qualita- 
tive reliability confidence levels to the respective 
parts of the structure. Clearly, the conformations in 
the SCRs are the most reliable, especially when sev- 
eral known structures are available and have been 
compared in detail. This is true only if the respective 
SCRs of the "new" sequence retain the characteristic 
homologous sequence patterns in those SCRs. When 
they do not, and this is illustrated below, it should 
be regarded as a red flag warning that something 
different may be happening in this region of the par- 
ticular "new" protein. In such cases, great care must 
be taken in constructing the "new" structure and the 
confidence level in this portion of the molecule re- 
duced appropriately. For the VRs, when a good 
model structure appears for the respective VR 
among the known structures, then a fair confidence 
level can be assigned. This level is not as high as in 
the SCRs because of the greater inherent variability 
of these loop regions. The actual confidence level for 
each such VR would depend on the size of the loop 
(larger loops have more degrees of freedom and are 
therefore less reliably determined), how good the se- 
quence homology is to the known structure selected 
to model this VR, and how well the chosen confor- 
mation fits and packs onto the rest of the "new" 
structure. The confidence is lowest for those VRs 
with no good model loop among the known struc- 
tures. When conformational methods or database 
search techniques have to be employed, ex- 
perience^^"^® shows that we are not able to predict 
the conformation reliably in many of these cases. 

The resultant structure is then examined in detail 
to identify possible serious errors. For example, the 
inner cores of the structure are checked to be sure 
that no inappropriate charges have been buried. The 
modeling method makes it unlikely that there are 
serious steric contacts between main chain atoms in 
the model. This is because the main chain coordi- 
nates in the SCRs were taken from the known struc- 
tures. Only if the bad contact was present in the 
parent known structures will it be found in the 
model. In the VRs, absence of main chain overlap 
was a major criterion for acceptance of a conforma- 
tion. Therefore, in practice, bad steric contacts can 
usually be relieved by side chain rotations; occasion- 
ally by having to select an alternative conformation 
for a VR. The model structure can then be intro- 
duced into an energy-minimization program^®*"*' to 
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Fig, 3. a-Carbon piots showing the superposition of the seven 
experimentalty known serine protease structures (see Table I). 
Note the SCRs, where the seven structures are virtually the same, 
and the VRs, where the structural divergence is considerable. 



The l<ey to the plots is as follows: CHT, green; TRP, solid red; 
ELA. solid blue; KAL. solid cyan; MCP, dotted green; TON. dotted 
red; SGT, dotted blue. 



relieve any remaining minor steric contacts and to 
optimize the bond angles and torsional angles. Typ- 
ically, we begin with restrained or template-forced'*^ 
minimization, forcing the atomic coordinates to re- 
main close to their initial positions. This allows bad 
contacts to relax without introducing large artifac- 
tual distortions into the structure. After several 
hundred cycles, the constraints can be progressively 
removed and the energy of the molecule minimized. 

Experience with comparative molecular mod- 
eling*^^"^^ has shown that these models are never 
completely accurate, especially in the conformations 
selected for the VRs. Therefore, any uses of the 
model structure and predictions that are made based 
on it should take into consideration this fact and the 
local reliability confidence levels discussed above. 
We always regard such a model as a working hy- 
pothesis; as new data emerge, whether from struc- 
tural, spectroscopic, or biochemical sources, the 
model is modified, corrected, improved, and refined 
to fit the new data. 

RESULTS AND DISCUSSION 
Initial Alignment of Known Serine 
Protease Structures 

Our analysis of the serine proteases began some 
years ago with the superposition of the then three 
known structures of chymotrypsin, trypsin, and 
elastase.^'"^ Since then, coordinates for four more 



structures have appeared in the literature and data 
bank": kallikrein,^^ mast cell protease,^® Streptomy- 
ces griseus trypsin-like protease,^ and tonin.^^ The 
overlap of these additional structures was performed 
by first superimposing the added experimental 
structure onto the rest using the a-carbon positions 
of residues 195, 57, and 102,$ The superposition was 
refined, when necessary, using repeated cycles of 
least squares fit on related residue a-carbon posi- 
tions between the protein to be fit and chymotryp- 
sin. Only residues whose positions were conserved in 
the two structures were fitted in the calculation; 
that is, any residue was eliminated from the fitting 
if the deviation in a-carbon positions was more than 
2 A on the first cycle and then more than 1.5 A on 
subsequent cycles. Figure 3 shows the final overlap 
obtained for these seven structures. Virtually all the 
SCR positions identified in the previous analysis of 
the first three proteins^ remain structurally con- 
served. The repertoire of conformations, i.e. ''spare 
parts," found for each of the VRs is increased with 
the larger number of structures, as expected (Table 
III). 



tThe residue numbering given for the serine proteases fol- 
lows that of the chymotrypsinogen molecule throughout this 
paper, in the text, in the tables, and in the figures. 
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TABLE III. Sequence Lengths of the Different VRs in the Serine Proteases 



VR 



VR 


oo— 41 






1 At 


1 Od 1 oo 


14b— loi 


loo— 1 /y 


ioo— loy 


OAQ QAA 

zUo — ^Uo 


Zl /— 


CHT 


7 


4 


9 


5 


9 


4 


14 


3 


4 


8 


TRP 


5 


3 


9 


5 


7 


6 


14 


5 


0 


8 


ELA 


10 


5 


9 


7 


9 


5 


16 


4 


4 


10 


KAL 


6 


3 


9 


9 


7 


8 


14 


5 


0 


9 


MCP 


10 


3 


9 


5 


9 


5 


13 


5 


0 


6 


SGT 


2 


8 


7 


3 


7 


5 


15 


6 


5 


8 


TON 


4 


3 


9 


16 


7 


6 


14 


5 


0 


9 


HPH 




16 


0 


1 


8 


4 


31 


6 

x^ 


6 


7 


PRZ 


7 


1 


4 


5 


11 


2 

x^ 


13 


3 


4 


6 


PRC 


7 

f 


5 


9 


5 


13 

X \^ 


10 

* x^ 


14 


6 

x^ 


4 


8 


NGA 


6 


3 


9 


16 

X- 


7 


6 


14 


5 

x^ 


0 


9 


NGG 


6 


3 


9 


12 

Xi Ah 


7 


6 


14 


5 


0 


9 


VII 


6 


8 


9 


5 


12 


5 


19 


5 


4 


8 


FIX 

& x^^ 


6 


5 


9 


7 


11 

X X 


5 


14 

* • 


5 


4 


8 


FAX 


7 


5 


8 


5 


11 

iX X- 


5 


14 


5 


4 


8 


FXI 

& ^^x 


9 


8 


9 


5 


9 


5 


15 


5 


3 


8 


XII 


5 


8 


9 


5 


9 


6 


16 


5 


7 


8 


PLM 


7 


8 


9 


-1 


9 


4 


16 


5 


4 


8 


ALP 


7 
• 


8 


9 


— 1 


9 


4 


7 


5 


4 


8 


TPA 




Q 
o 


Q 




a 

fj 




16 

J. \i 


11 

X X 


4 


8 


UKH 


11 


8 


9 


7 


9 


6 


16 


5 


4 


8 


THR 


8 


13 


10 


6 


12 


11 


14 


8 


6 


8 


CFB 


10 


8 


1 


14 


25 


3 


27 


7 


4 


14 


CF2 


6 


8 


5 


14 


25 


0 


28 


4 


4 


21 


CFD 


6 


7 


10 


5 


11 


5 


10 


3 


0 


9 


cm 


4 


12 


8 


8 


9 


3 


19 


5 


6 


6 


CIS 


3 


5 


10 


12 


11 


4 


23 


4 


7 


5 


ASP 


6 


8 


9 


5 


11 


5 


16 


3 


0 


9 


CAG 


9 


3 


9 


5 


9 


4 


14 


5 


0 


6 


TCL 


6 


4 


8 


5 


9 


5 


18. 


5 


0 


9 


HAF 


6 


4 


9 


5 


9 


6 


18 


5 


0 


10 


CTL 


9 


3 


9 


5 


9 


5 


15 


5 


0 


6 


BTX 


6 


3 


13 


2 


7 


6 


13 


5 


0 


9 


EBP 


6 


3 


9 


16 


7 


6 


14 


5 


0 


9 


DSN 


15 


5 


10 


5 


6 


5 


20 


4 


7 


8 


Min. 


2 


1 


0 


-1 


7 


0 


7 


3 


0 


5 


Max. 


15 


16 


13 


16 


25 


11 


31 


11 


7 


21 


Range 


13 


15 


13 


17 


18 


11 


24 


8 


7 


16 



Based on the above superposition of structures, 
the seven sequences were aligned as described above 
(Fig. 4). Note the wide variation in the length of the 
different sequences in each of the VRs (Table III). 
Using this sequence alignment, we attempted to 
identify the critical conserved stretches of amino 
acid sequence. These are characteristic of each SCR 
and are essential for aligning "new" sequences to 
the known ones» which is the first step in modeling. 
The patterns that were chosen are noted in Figure 4 
at the bottom of the sequence list in the row labeled 
"CON." In some cases, there was almost complete 
conservation of a single amino acid. In other cases, 
only a hydrophobic side chain (typical of an internal 
position) seems to be required. 

Because the identification of these stretches is 
such an important part of the modeling process, it is 
worthwhile describing in more detail how this is 
performed. In most cases, it is straightforward; the 
conserved sequence is readily apparent. For exam- 
ple, the characteristic lA^-lA^-G-G at positions 16— 



19 (see Fig. 4) defines the first SCR very clearly. 
Similarly, the remarkably conserved S/T/A-G-W-G 
sequence of residues 139—142 is highly characteris- 
tic of this SCR. On the other hand, the SCRs at 63- 
71 and 81-96 have no such clearly conserved se- 
quence, and it is often very difficult to align a "new" 
sequence to the known ones in these SCRs unambig- 
uously. 

To understand better why this latter group of 
SCRs has so little sequence homology, the structures 
of these two regions were examined in detail (Fig. 5). 
It is immediately apparent that these SCRs lie on 
the surface of the molecule, and virtually all the 
residues point out into solvent and thus can and do 
vary with little consequent effect on the rest of the 
structure. However, in each SCR, there are some 
side chains that point into the interior of the mole- 
cule and thus are more conserved. In the SCR at 
63-71, this includes the Val (or comparable ali- 
phatic residue) at 66 and the aliphatic side chain at 
68. The Gly at position 69 appears to be almost com- 
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Fig. 5. The location is shown for some of the SCRs that are 
difficult to align in the serine proteases because of only weak 
characteristic patterns of homology. This is usually due to the 
SCR lying on the surface of the molecule. This figure presents the 
Ca plot of CHT (dotted lines), with the location shown of the 
respective SCRs that are difficult to align: 63-71 and 81-96 (solid 
lines). Note that these SCRs lie on the surface of the molecule and 



that most of their side chains point out irYto solvent and thus are 
free to vary more rapidly than internal residues. In the SCR at 
63-71, conserved residues 66 and 68 are the only internal ones 
and are always hydrophobic; 69 is always a Gly. because it forms 
a turn. In the SCRs at 81-96, internal hydrophobic residues are 
83, 85, 89, and 94. Partial conservation is observed at positions 
91 and 92, with a His-Pro. 



pletely conserved, probably because it forms a turn 
and adopts a conformation (<() = --60'', ^ ~ ~25°) 
that is permitted only for Gly (Fig, 5). Similarly, 
small hydrophobic side chains occur at positions 83 
and 85 in the SCR at residues 81-86 (Fig. 5). More 
varied but always hydrophobic residues appear at 
89, and an aromatic one is typical at 94, both of 
which point into the molecule. In addition, there is 
also a tendency for a His-Pro at positions 91—92 (Fig. 
6); however, a number of sequences have only one or 
none of these two residues (Fig. 4). 



CNO 5S 60 65 70 75 80 CHO 

> < ><-> <- 

CHT AAHCGVT TSDVWAGEFDOGSSSE-KIQK CHT 

TRP AAHCYKS GIQVRLGQDNINV-VEGNQQF TRP 

ELA AAHCVDRE LTFRVWGEHNLNQ-NNGTEQY ELA 

KAL AAHCK NDNYEVWLGRHNLFE-NENTAQF KAL 

MCP AAHCKG REITVILGAHDVRK-AESTQQK MCP 

SGT AAHCVSGS GNNTSITATGGWDLQS GAAVK SGT 

TOM AAH rVSM Mvnxn.triftNMLgK-DEPFAOR IQM 

HPHold TAKNLFLN— — HSENATAKDIA'»PTLTLY===VGKKQL HPHold 

HPHnew TAKNLFLNHSENATAKDIAPTLTLyVGKK==""»-— — =QL HPHnew 

> < ><-> <- 

CON AAHC V \G CON 



Fig. 6. The oid^ '* and new sequence alignments for HPH, rel- 
ative to those of the known three-dimensional structures, for the 
region around the SCR 63-71 . Note that six residues have been 
moved from the VR after position 71 to the VR prior to position 63. 
Nomenclature and labeling are as in Figure 4. 



Fig. 4. Sequence alignment for many of the known serine pro- 
tease sequences. The source of these various sequences and the 
definitions of the protein name codes used in this Figure are given 
in Table II. The first seven proteins (above the line) are those with 
known experimental three-dimensional structures (see Table I). 
Their sequences have been aligned based on the superposition of 
the three-dimensional structures as described in the text. The 
remaining proteins were aligned using the characteristic se- 
quence matching patterns (see text). When the sequence align- 
ment is uncertain, the respective sequence is shown in italics. The 
lUPAC-lUB convention standard single letter amino acid code, 
used in this figure, is as follows: A - Ala, C = Cys. D = Asp. E 
= Glu. F = Phe. G = Gly, H = His. I = lie. K = Lys. L = Leu, 
M = Met. N = Asn. P = Pro, Q = Gin. R - Arg. S = Ser. T = 
Thr, V = Val, W = Trp, Y = Tyr. An asterisk indicates that the 
sequence continues past the last residue shown in the figure. 
Positions of relative deletions in the sequer)ces are denoted by a 



dash in the known structures and by a double dash in the se- 
quences aligned using the characteristic homology patterns. The 
line labeled "CNO" gives the chymotrypslnogen residue numt>er- 
ing used throughout this paper. The symbols delineate the 
SCRs. The "CON" line lists the conserved, characteristic se- 
quence patterns used to align the "new" sequences (see text). 
They are coded as follows: upper case, almost completely con- 
sented side chain; lower case, high frequency of this amino acid 
at this position: X. nonpolar residues, typically A, V, L, 1. M; u, 
denotes a polar residue, typically S, T, Q. N plus the charged 
residues; o. denotes S or T, (may be substituted by A occaston- 
ally); <J>. denotes aromatic, usually Y or F, sometimes W; -»-, de- 
notes a positive residue such as R. K. or H; denotes a negative 
residue such as D or E. The line labeled "S-S" gives the position 
of the half cystine residue that is disulfide bridged to the half 
cystine at the label position. A question mark is placed at positions 
where it is not clear to which residue a disulfide bridge is formed. 
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Fig. 7. Ca plots for the old (dotted lines) and new (solid lines) 
structures of HPH based on the respective alignments shown in 
Figure 6. The differences appear around the SCR at positions 
63-71. Because the VR from 59 to 62 In the new HPH alignment 
IS six residues longer than in the previous alignment (Fig. 6) and 
at least eight residues longer than any of the currently known 
structures in this loop (see Figs. 4 and 6. Table III), the confor- 
mation that has been drawn for this VR (dashed lines) is arbitrary. 
It has been included only to give a sense of the relatively large 
size of this loop. The difftcuities associated with finding the correct 



conformation for such targe additions are described in the text. 
The VR at residues 72-80 is highly truncated, seven residues 
shorter, in the new structure. Compare the conformations of the 
known structures for the VRs at 59-62 and 72-80 (Fig. 3) with 
these loops in HPH. Whereas the Ca positions (and main chain 
coordinates) for residues 63-71 are the same in this figure for the 
old and new structures of HPH. the side chains at each residue 
position are, of course, different in the two alignments (Fig. 6), 
giving different overall structures for this part of the molecule as 
well. 



Alignment of "New" Serine 
Protease Sequences 

Identification of the characteristic sequence con- 
servation patterns for each SCR allows the process 
of aligning "new" sequences to begin. Figure 4 
shows the alignment of a large number of serine 
protease sequences to those of the known structures. 
The proteins included are taken from a very wide 
variety of functions and species. It is worth noting 
that three of these proteins, although clearly homol- 
ogous to the rest and therefore full members of the 
family, are no longer functional serine proteases. 
These are the heavy chain of haptoglobin (HPH), 
protein Z (PRZ), and the a-chain of nerve gn^owth 
factor (NGA), For all these proteins, the character- 
istic pattern for each SCR was located and matched. 
Then the remaining positions in that SCR were 
filled, without permitting any additions or deletions. 

One of the results that emerges immediately from 
this alignment (Fig. 4) is the strong reinforcement of 
the characteristic patterns in all these "new" se- 
quences. The same basic patterns appear in virtu- 
ally every protein, yet, in almost every pattern, 
there are individual protein sequences that have 
exceptions. Usually the exceptions are minor devia- 
tions from the theme of the conservation. For exam- 
ple, the chosen four-residue stretch from 139 to 142, 
S/T/A-G-W-G, is present in 26 of 35 protein se- 
quences reported in Figure 4. The Trp is replaced by 
a Phe or a Tyr, both of which are t3rpically accept- 



able replacements for a Trp, in four other proteins. 
In the same way, the characteristic sequence C-G-G 
almost always appears at positions 42—44, but some- 
times one of the Glys becomes an Ala, and occasion- 
ally the Cys is replaced by an alternative small side 
chain. It is clear that, with perhaps the occasional 
exception of the difficult SCRs between 63—71 and 
81—96 discussed above, all the sequences shown can 
be aligned relatively trivially and unambiguously to 
the respective characteristic conserved sequences in 
the SCRs (Fig. 4). 

The close examination of the SCRs at 63—71 and 
81—96 described in the previous section (Fig. 5), to- 
gether with the strong reinforcement of the charac- 
teristic sequence patterns in these SCRs (Fig. 4), has 
led to the realignment of the HPH sequence in this 
region. This results in a large change in the three- 
dimensional model structure for this protein from 
that previously proposed.^'"* The old alignment (Fig. 
6) was forced by the need to avoid placing the 
charged Asp 65 in position 66, where it would be 
buried in the hydrophobic core of the N- terminal 
3-barrel. Pro was placed at position 69 as a replace- 
ment for a Gly that would permit a turn. Realization 
that the aliphatic residues at positions 66 and 68 
and the almost complete conservation of a Gly at 
position 69 (Figs. 4 and 5) make up the characteris- 
tic sequence pattern for this SCR led to the realign- 
ment of the HPH sequence relative to those of the 
known structures as shown in Figure 6. As a result 
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Rg. 8. Ca plot for CHT is shown indicating ttie sites of VRs 
with large additions (dashed lines) and with large deletions (dotted 
lines) in CFB. The unusual residues described in the text are 
represented by solid lines. These residues include 16-20, 43. 
139-142, and 154-156. As can be seen, the unusual residues 
are localized in one region of the molecule arourKi the normal site 

of the new alignment, the HPH sequence has a very 
large addition in the VR at 59—62 and a very short 
loop in the VR at 72-80. Now, however, the require- 
ments of aliphatic residues at 66 and 68 and Gly at 
position 69 are completely satisfied. The difference 
between the old and new structures for HPH is il- 
lustrated in Figure 7, which demonstrates clearly 
that changes in the alignment can result in large 
changes to the derived structures. 

From an evolutionary perspective, deviation from 
the typical pattern of sequence conservation occurs 
most often in those members of the family that are 
no longer serine proteases in function. For example, 
HPH and PRZ are no longer serine proteases in 
function. Both have lost the active site Ser 195 (it is 
an Ala), and His 57 is also replaced by a Lys in HPH. 
Nevertheless, both proteins retain most of the char- 
acteristic sequence homology patterns, yet they de- 
viate more than the typical sequence. Examples of 
this are the immediate area around the Ser 195 or 
the SCR at 63—71. The Streptomyces griseus trypsin- 
like protein also has a weaker homology pattern 
than the rest, as has been previously noted.^ Even 
further diverged are the other bacterial serine 

20-22. 

proteases. These latter are not considered in this 
paper because of the considerably greater difficulty 
in predicting their structures correctly by these 
methods (see footnote * on page 318). 



of the N-terminus of the protease peptide chain, residue 16. Res- 
idues 16 and 17 should not be in their usual positions, because 16 
is not N-terminal in CFB or in CF2. Consequently, they really 
should be placed in the approximate position expected from our 
l<now)edge of the zymogen structures'"^ where the cleavage at 
16 has not yet occurred. 

Analysis of the Variable Regions 

Once the sequences that correspond to the SCRs 
are identified and aligned, attention can be focussed 
on the remaining residues forming the VRs. Based 
on the alignment in Figure 4, the sequence lengths 
deduced for the various VRs are summarized in Ta- 
ble III. It is clear from Table III that an amazing 
degree of variation occurs in the length of these 
loops and therefore in their conformations (Fig. 3). 

In previous studies,** we classified the modeling of 
the VRs into five different classes. 1) A known struc- 
ture has the same-length VR and the same residue 
character. In this case, a model structure is con- 
structed for the "new" VR by clipping in the respec- 
tive VR from the known structure. In such cases, the 
confidence level in this part of the structure can be 
reasonably good, especially if it is a short VR. One 
example occurs among the known structures of the 
serine proteases at residues 97—101, where the VRs 
of chymotrjT)sin (CHT), trypsin (TRP), and the mast 
cell protease (MCP) are all the same length and 
have the same conformation (see this loop in Fig. 3). 
2) Different lengths appear in this VR among the 
known structures, but they all have the same struc- 
tural motif, e.g., a ^-bend connecting two antiparal- 
lel p-strands. The application of a common struc- 
tural motif to the "new" VR conformation can be 
illustrated by examining the VR at positions 35—41 
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in the known structures (Fig. 3). In each case, the 
known structure consists of two antiparallel p- 
stands ending in a p-bend» this despite the fact that 
the loops are very different in length, varying from 
two to ten residues (Table III). Thus, it would be 
sensible to model this VR in a "new" structure, at 
least initially, as a p-bend. Unfortunately, this con- 
sideration may not always be sufficient to model the 
structure accurately enough. For example, we ini- 
tially modeled this VR in the MCP, which has ten 
residues, after the conformation of elastase (ELA) in 
this VR, which also has ten residues.** When the 
crystal structure emerged subsequently,^® this loop 
was indeed an antiparallel pair of p-strands ending 
in a p-bend; however, the strands were bent differ- 
ently, leading to a quite different conformation for 
this loop (Fig. 3). This shows that very careful anal- 
ysis must go into the construction of even this class 
of loops, especially when the loop to be built is large 
and has many potential degrees of freedom. 

In the next two cases of VRs, the modeling is sig- 
nificantly less reliable. 3) The closest known VR 
conformation has only a small (i.e., one or two resi- 
dues) relative addition or deletion to the VR that is 
being constructed. The addition or deletion of even 
one residue can lead to considerable changes in the 
overall conformation of the VR. Examples of this can 
be seen in Figure 3, where the VR encompassing 
residues 185-189 is shown. The range of lengths 
among the known structures is not large for this VR. 
The shortest is CHT, with three residues; ELA has 
four; TRP, KAL, MCP, and TON all have five; and 
SGT has six. As can be seen, the differences between 
the loops differing by only one residue are consider- 
able, requiring in each case significant rearrange- 
ment of the loop conformation. 4) The conformations 
of the VRs in the known structures vary consider- 
ably, even when the sequence length is the same. 
There are some loops that have different conforma- 
tions even when the residue lengths are the same. In 
many cases, it is not easy to identify why the differ- 
ent conformations are occurring. Figure 3 shows an 
example of such a VR in the 72-80 region of the 
serine proteases. In cases such as these, construction 
of the VR is very difficult since we do not know 
which known structure to use as the example from 
which to model. 

In both the latter cases, a conformational 
search^^""** is needed, in principle, to find the correct 
conformation. Such efforts are very time-consuming 
and are currently relatively unreliable. ^'^^ It is pri- 
marily for loops of these two classes that the Protein 
Data Bank^ is searched for possible tentative con- 
formations for these loops, as was described above. 

The last class of VRs is in some ways the most 
dif^cult to construct but perhaps is in other ways 
easier than previously expected. 5) A very large ad- 
dition occurs in the VR of the "new" structure. Loops 
that have large additions are relegated to this class. 
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Examples of this abound in Figure 4, e.g., a 17 res- 
idue addition in the 166-179 loop of HPH or the 11 
residue addition in the 97-101 VR of TON relative 
to CHT. These additions are so large that they pre- 
clude the use of even the most current and advanced 
conformational search methods and energetics anal- 
ysis. Therefore, in the past,'* we adopted the practice 
of not constructing structures for these VRs at alL 
However, the recently determined experimental 
structures of KAL^'^ and TON^® show the conforma- 
tions of two sequences in the VR 97—101 with rela- 
tively large additions (Figs. 3 and 4, Table III). In 
both cases, the loops are disordered in the crystal 
structures and therefore most probably in solution 
as well. In the case of KAL, this loop is even cleaved 
in vivo by an exogenous protease in many of the 
molecules. If this observation of disorder is true, in 
general, for large additions in the VRs, then it will 
greatly simplify the modeling of these regions, since 
there will be no unique structure that has to be de- 
termined. 

Sequence Exceptions Warn of Problems 

The importance of the characteristic sequence ho- 
mology to the reliability of the comparative model- 
ing goes beyond the above-described critical task of 
aligning the sequence of a "new" protein accurately. 
Specific exceptions to or violations of the character- 
istic sequences can signal a novel functional prop- 
erty of the protein. Larger discrepancies can be im- 
portant indicators that the usual extrapolation 
methods of comparative modeling outlined above 
are not valid for this particular protein in this spe- 
cific region. 

For this purpose, it is important to distinguish 
between cases of sequence divergence and violations 
of the characteristic pattern. Some of the proteins 
whose sequences are shown in Figure 4, such as 
HPH, PRZ, and NGA, are no longer serine proteases 
even though their degree of sequence identity and 
homology leave no doubt about their being full 
members of this serine protease structural family. 
Examination of the characteristic sequence patterns 
in these proteins shows that they deviate the most 
from the normal patterns. Nevertheless, clear indi- 
cation of the pattern usually persists and permits 
proper, unambiguous alignment of the sequence. 
For example, the immediate sequence around the 
active site His 57 and Ser 195 in HPH and PRZ 
differs more, especially in PRZ, from the usual pat- 
tern than any other sequence. However, enough of 
the pattern remains (even in HPH, where the His 
and Ser themselves are no longer present) to be 
clearly recognized and used for sequence alignment. 

In other cases, the pattern has faded, and it is 
difHcult to assign the actual alignment. However, 
the possibilities that are present could probably be 
accommodated within the rubric of the SCRs of that 
part of the structure. An example of this appears in 
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the DSN protein in the SCR from 63 to 71. The usual 
Gly residue that invariably appears at position 69 is 
absent. This leads to an ambiguity regarding how to 
align the sequence. However, at least one of the pos- 
sibilities, the one shown in Figure 4, is likely to fit 
into the usual serine protease structure at this po- 
sition with only minor deviations. 

On the other hand, there are examples of larger 
changes, including the complete absence of one or 
more characteristic sequence patterns. This can be 
illustrated by the SCR at 134-146. This SCR has the 
highly conserved, very characteristic S/T/A-G-W-G 
sequence at 139-142, as noted above. Virtually ev- 
ery serine protease has this sequence, with a rare 
change from Trp to Tyr or Phe and a very rare 
change of the second Gly to something else (see PRZ 
for an example of divergence as discussed in the pre- 
vious paragraphs). Note, however, the sequence for 
this SCR in CFB and CF2 (Fig. 4). In these cases, the 
characteristic residues are completely missing. 
Clearly something is happening to the structure 
here that is unusual. The absence of these residues 
creates a serious problem in properly aligning these 
two sequences in this SCR. There is no obvious pat- 
tern of residues that would correspond to the S/T/ 
A-G-W-G sequence at 139-142 among the residues 
that are in this region of the sequence. It is possible, 
in principle, to place the sequence A-L/H-F-V at 
these positions, thereby replacing the conserved Trp 
with a Phe. However, this leaves internal positions 
136 and especially 138 with charged residues such 
as Asp and Lys, where nonpolar residues are char- 
acteristic and are called for by the structure. The 
sequences have been tentatively aligned as shown in 
Figure 4 to satisfy the requirements that internal 
residues be nonpolar, or at least uncharged. The un- 
certain alignments are shown in italics in Figure 4. 

If we look further at these two sequences, CFB 
and CF2, we find another anomalous site. The clas- 
sic, characteristic pattern for positions 42—44 is C- 
G-G. Examination of the sequences in Figure 4 
shows that substitution of one of the Gly residues by 
Ala occurs occasionally. The startling introduction 
of a much larger Met side chain at position 43 in 
CFB and an even larger and charged Arg in CF2 
indicates that once again something strange is hap- 
pening in these proteins. Examination of the loca- 
tion of these changes in the serine protease three- 
dimensional structure (Fig. 8) shows that they are 
immediately adjacent to each other in the structure 
even though they are quite distant in the sequence. 

A further anomalous sequence occurs at positions 
16-19 of these two sequences. Activation of a serine 
protease from its zymogen protein involves the 
cleavage of the peptide chain prior to residue 16 to 
generate a free amino terminus at this point in the 
molecule.^^*^^ The earliest serine protease crystal 
structure solutions^* showed that this amino termi- 
nus formed an internal salt bridge with the side 



chain of Asp 194. The homology observed in the res- 
idues at positions 16—19 is likely to be diagnostic of 
this activation step in all these enzymes. Thus it is 
interesting that several of the sequences in Figure 4 
show major deviations from the characteristic se- 
quence pattern for these four residues. All the serine 
proteases have lie or Val (or Leu) at positions 16 and 
17 and usually Gly at 18 and 19. Exceptions to this 
pattern are found in PRZ and NGA, neither of which 
is a true protease and thus presumably no longer 
retains an activation step. The two proteins CFB 
and CF2 are also striking in the absence of any ho- 
mology to the characteristic pattern for this stretch 
of residues or even between the two themselves. It is 
known that, unlike the other serine proteases, these 
two proteins are clipped some 200 residues N-ter- 
minal to residue 16.^^*®'* Thus the activation mech- 
anism for these proteins must be different from that 
of the other serine proteases. Figure 8 shows that 
the normal location of the N-terminal residues 16 
and 17 is also close to the two unusual sequences 
described above for the three-dimensional structure. 

Thus significant violations of three spatially ad- 
jacent characteristic sequence patterns appear in 
these two proteins, coupled with functional signifi- 
cance expressed in the absence of the normal mech- 
anism of zymogen activation. It is interesting to note 
that CFB and CF2 occupy parallel functional roles 
in the alternative and classical pathways of the com- 
plement system, respectively.®^ They form the pro- 
teolytic subunits of the respective C3 convertases in 
the two pathways and, upon binding of C3b sub- 
units, change their specificity to become C5 conver- 
tases. Taken together, these observations suggest 
that some unusual three-dimensional structure is 
occurring in this particular region of the molecule 
involving the activation site and the immediately 
adjacent active and specificity sites. AVhat is difficult 
to predict or determine is just how different these 
structures actually are from the typical serine pro- 
tease structural theme in this region. The excellent 
preservation of the characteristic sequence homol- 
ogy patterns everywhere else in the CFB and CF2 
sequences (Fig. 4) suggests strongly that the re- 
maining parts of the molecule are close to their nor- 
mal conformation. However, there is no such reas- 
surance for this part of the molecule. 

As one begins to construct this part of the CFB 
molecule onto the typical SCR framework of the 
serine proteases, several problems are encountered 
very rapidly. The introduction of side chains at po- 
sitions 140 and 43 in place of Gly, in such close prox- 
imity, causes a steric collision of these two groups. 
In constructing the N-terminal portion of the chain, 
the conformation for residues 16—19 must be taken 
from the zymogen structure of chymotrypsin- 
ogen,^^*^^ since these residues cannot reside in the 
usual N-terminal He 16 binding pocket. This leaves 
the problem of what to do with Asp 194, the other 
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half of the normal salt bridge to the amino terminus 
at He 16. Both possible conformations of the aspar- 
tate, that of the zymogen and of the mature enzyme, 
must be considered based on zymogen crystal 
studies. ^^'^^ Unfortunately, neither conformation 
seems possible without a significant change in the 
local CFB structure. In the zymogen conformation, 
the side chains of Met 43 and Ala 140 (both usually 
glycines) prevent the Asp 194 side chain from occu- 
pying its usual place. In the mature enzyme form, it 
is the side chain of Phe 142 (also typically a glycine) 
that is too close to permit the usual conformer (see 
Fig. 8). 

The resulting changes that need to be made to 
accommodate these unusual side chains are difficult 
to predict. Does the structure undergo a large num- 
ber of small changes that results in the accommoda- 
tion of the above groups in close to the normal con- 
formation, or is there a significant change in this 
portion of the molecule that gives a very different 
conformation around Asp 194 and residues 139- 
142? It is not possible to answer this question using 
the energetics methods currently available. Such 
problems may have to wait either for experimental 
structure determinations or for improved under- 
standing of protein structure and energetics. There- 
fore, it is particularly important that comparative 
modeling methods, as described in this paper, allow 
us to recognize such regions of the molecule that are 
likely to be particularly difficult to construct reli- 
ably. 
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The t3^e II transmembrane multidomain serine pro- 
teinase MT-SPl/matriptase is highly expressed in many 
human cancer-derived cell Unes and has been impli- 
cated in extracellular matrix re-modeling, tumor 
growth, and metastasis. We have expressed the catalytic 
domain of MT-SPl and solved the crystal structures of 
complexes with benzamidine at 1.3 A and bovine pancre- 
atic trypsin inhibitor at 2.9 A. MT-SPl exhibits a trypsin- 
like serine proteinase fold, featuring a unique nine-res- 
idue 60-insertion loop that influences interactions with 
protein substrates. The structure discloses a trypsin- 
like SI pocket, a small hydrophobic S2 subsite, and an 
open negatively charged S4 cavity that favors the bind- 
ing of basic P3/P4 residues. A complementary charge 
pattern on the surface opposite the active site cleft sug- 
gests a distinct docking of the preceding low density 
lipoprotein receptor class A domain. The benzamidine 
crystals possess a freely accessible active site and are 
hence well suited for soaking small molecules, facilitat- 
ing the improvement of inhibitors. The crystal structure 
of the MT-SPl complex with bovine pancreatic trypsin 
inhibitor serves as a model for hepatocyte growth factor 
activator inhibitor 1, the physiological inhibitor of MT- 
SPl, and suggests determinants for the substrate 
specificity. 



The activity of proteolytic enzymes is required at multiple 
stages during the growth, invasion, and progression of human 
tumors (for a review, see Ref. 1). For example, these complex 
processes entail extensive re-modeling of the extracellular ma- 
trix as well as the activation of latent growth factors and 
pro-angiogenic proteins. Consequently, the high level expres- 
sion of particular proteinases often correlates with poor patient 
survival for several different cancers (see, for instance, Ref 2). 
Among these cancer-associated enzymes are serine proteinases 
such as urokinase-type plasminogen activator (uPA),^ elastase. 



♦ This work was supported by Grant SFB469 from the University of 
Munich, by Grant ERBFMRXCT98 from the Training and Mobility 
Program of the European Union, and by the Fonds der Chemischen 
Industrie. The costs of publication of this article were defrayed in part 
by the payment of page charges. This article must therefore be hereby 
marked "advertisement" in accordance with 18 U»S.C. Section 1734 
solely to indicate this fact. 

The atomic coordinates and structure factors (code lEAX (Bz-MT- 
SPI) and lEAW (BPTI-MT-SPl)) have been deposited in the Protein 
Data Bank, Research Collaboratory for Structural Bioinformatics, Rut- 
gers University, New Brunswick, NJ (http://www.rcsb.org/). 

11 To whom correspondence should be addressed. Tel.: 49-89-8578- 
2676; Fax: 49-89-8678-3516; E-mail: bode@biochem.mpg.de. 

^ The abbreviations used are: uPA, urokinase-type plasminogen acti- 
vator; sc-uPA, single-chain urokinase-type plasminogen activator; 



plasmin, and cathepsin G; matrix metalloproteinases including 
gelatinases, interstitial collagenases, stromelysins, matrilysin, 
and membrane-type metalloproteinases; and lysosomal cys- 
teine proteinases such as cathepsin B. Recently, several mem- 
bers of an important, emerging subfamily of serine proteinases, 
the t3rpe II transmembrane serine proteinases (TTSPs; re- 
viewed in Ref. 3), have also been implicated in tumor growth 
and progression. 

MT-SPl (matriptase/TADG-15/suppressor of tumorigenicity 
14; EC 3.4.21) was first isolated by Shi et aL (4) as a novel 
proteinase that was expressed by human breast cancer cells. 
The enzyme was initially assigned as a gelatinase, because of 
its gelatinolytic properties and gelatinase-like molecular 
weight. However, Isolation and sequencing of the cDNA re- 
vealed a 683-residue multidomain proteinase with a C-termi- 
nal serine proteinase domain. The enzyme was then named 
matriptase to emphasize its matrix degrading properties and 
trypsin-like specificity (5). Independently, Takeuchi and co- 
workers (6) cloned and characterized a t3rpe-II membrane- 
bound trypsin-like serine proteinase from a human prostatic 
cancer cell line, which they called membrane-type serine pro- 
teinase 1, MT-SPl. This 855-residue proteinase contained two 
tandem repeats of the complement component Clr/s domain 
(CUB, derived from complement factor/lR-urchin embryonic 
growth factor/bone morphogenetic protein) and four tandem 
repeats of the low density lipoprotein receptor (LDLR) class A 
domain between the N-terminal transmembrane sigpnal anchor 
and the C-terminal catalytic domain (5, 6). Because the 
matriptase sequence reported by Lin turned out to be part of 
the translated MT-SPl cDNA sequence, matriptase is likely to 
be a form of MT-SPl produced by ectodomain shedding (7). 
Alternatively, the two cDNAs may result from alternative 
splicing. MT-SPl is highly expressed in prostate, breast, and 
colorectal cancers in vitro and in vivo (8), and inhibition of this 
enzyme suppresses both primary tumor growth and metastasis 
in a rat model of prostate cancer (5, 6). A mouse homologue was 
cloned by another group and called epithin (9). 

The substrate specificity of MT-SPl has been mapped using 
a positional scanning synthetic combinatorial library and sub- 
strate phage display (10). The preferred cleavage sequences 
contained Arg/Lys at P4 and basic residues or Gin at P3, small 
residues at P2, Arg or Lys at PI, and Ala at PI'. This specificity 
profile corresponds well to the cleavage sequences of recognized 
surface localized protein substrates of MT-SPl such as the 
proteinase-activated receptor-2 (PAR2), single-chain uPA 
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(sc-uPA), the proform of MT-SPl, and the hepatocyte growth 
(scattering) factor, which have been shown in vitro and/or in 
vivo to be efficiently activated by MT-SPl (10, 11). 

Although human breast cancer cells produce MT-SPl pri- 
marily as the free enzyme, in human milk and normal tissues 
the enzyme is found in complex with an inhibitor called hepa- 
tocyte growth factor activator inhibitor 1 (HAI-1; Ref. 12). This 
membrane-bound inhibitor was originally isolated from human 
stomach carcinoma cells (13) as a 478 residue glycoprotein 
containing two Kunitz-type domains separated by an LDLR 
domain and followed by a transmembrane segment, but has 
been subsequently detected in several tissues. Soluble, presum- 
ably proteolytically cleaved forms of HAI-1, lacking the C- 
terminal hydrophobic domain, have also been reported (14). In 
addition to HAI-1, a smaller inhibitor (HAI-2) has been iden- 
tified and characterized, which lacks the LDLR domain sepa- 
rating the two Ktmitz-tjTpe modules in HAI-1. Site-directed 
mutagenesis studies suggested that the first Kunitz domain of 
HAI-2 is responsible for the inhibitory activity toward hepato- 
cyte growth factor activator (15). 

We have expressed and purified the catalytic domain of 
human MT-SPl (MT-SPl(cd)), and we have crystallized and 
solved the high resolution x-ray crystal structure of this en- 
zyme in the presence of benzamidine (Bz). Because recombi- 
nant HAI-1 was not available, we also determined the struc- 
ture of the MT-SPl complex with the Kunitz-type bovine 
pancreatic trypsin inhibitor (BPTI), which shares a 36% se- 
quence identity with the first Kunitz domain of HAI-1 and is a 
nanomolar range inhibitor of MT-SPl. These crystal structures 
provide important new insights into the molecular determi- 
nants of the unique specificity of MT-SPl not obtainable fi*om 
modeling (16), and give hints about the interaction of this 
proteinase with the physiological inhibitor HAI-1. This infor- 
mation is expected to facilitate the design of potent, selective 
small molecule inhibitors of MT-SPl that may yield lead com- 
pounds for the development of novel anti-cancer agents. Be- 
cause of the high accessibility of the substrate binding site, the 
MT-SPl(cd) crystals are well suited for soaking of small mole- 
cule inhibitors facilitating the fiirther elaboration of initial lead 
compounds. 

EXPERIMENTAL PROCEDURES 

Cloning and Purification — ^The human prostate adenocarcinoma cell 
line, PC-3, was purchased from ATCC (CRL-1435). PC-3 cells were 
lysed in Trizol reagent (Invitrogen, Carlsbad, CA) and total RNA was 
isolated according to the manufacturer's protocol. Poly(A)"^ RNAs were 
purified using oligo(dT) beads (Oligotex; Qiagen» Valencia, CA) and 
subsequently converted to single-stranded cDNAs by reverse transcrip- 
tion using ProSTAR first-strand reverse transcriptase-PCR kit 
(Stratagene, La JoUa, CA) and Superscript II RNase H~ reverse tran- 
scriptase (Invitrogen). Single-stranded cDNAs from PC-3 cell RNA were 
subjected to PCR with sense and antisense degenerate oligonucleotide 
primers (sense primer, 5'-TGGRT(I)VT(I)WS(I)GC(I)RC(I)CAYTG-3'; 
antisense primer, 5'-(I)GG(I)CC(I)CC(I)SWRTC(I)CCyT(I)RCA(I)G- 
HRTC-3', where R = A or G; V = G, A, or C; W = A or T; S =^ G or C; 
Y = C or T; and H = A, T, or C). The primer sequences corresponded to 
two highly conserved regplons in all chymotrypsin-like serine proteina- 
ses. PCR products were purified using a gel extraction kit (QIAquick gel 
extraction kit; Qiagen), ligated into pCR2.1-TOPO (Invitrogen), and 
transformed into Escherichia coli TOPIO cells (Invitrogen). To obtain 
additional MT-SPl cDNA sequences, both rapid amplification of cDNA 
ends and gene-specific amplification reactions were performed. A 
human prostate Marathon-Ready cDNA (CLONTECH, Palo Alto, CA) 
was used to isolate part of the cDNA encoding MT-SPl. The 3' region of 
MT-SPl cDNA was successfully obtained by a 3 '-rapid amplification of 
cDNA ends reaction using a gene-specific primer, 5'-CACCCCTTCTT- 
CAATGACTTCACCTTCG-3'. The 5' end of the MT-SPl proteinase 
domain was obtained by a PCR amplification reaction using two MT- 
SPl-specific primers. 5'-TACCTCTCCTACGACTCC-3' for the sense 
primer and 5'-GAGGTTCTCGCAGGTGGTCTGGTTG-3' for the anti- 
sense primer. These fragments were subcloned into pCR2.1-TOPO. 



After trfinsformation into E. coli cells, the insert DNAs were charac- 
terized by Southern blot analysis (using the internal cDNA fragment as 
probe) and by DNA sequence analj^is. To obtain a cDNA encoding the 
entire proteinase domain of MT-SPl, an end-to-end PCR amplification 
using the gene-specific primers 5'-TCTCTCGAGAAAAGAGTTGTTGG- 
GGGCACGGATGCGGATGAG-3' for the 5' end and 5'-ATTCGCGGC- 
CGCCTATACCCCAGTGTTCTCTTTGATCCA-3' for the 3' end. An 
800-bp DNA fragment was amplified, purified, digested with Xhol and 
Notly and subcloned into the Pichia pastoris expression vector, 
pPIC9KX. Transformation was performed in a Bio-Rad GenePulser II 
(voltage = 1500 V, capacity = 50 microfarads, and resistance = 200 
ohms). The screening of transformed Pichia clones for MT-SPl expres- 
sion was performed by testing clones with Spectrozyme t-PA (CH3SO4- 
D-HHT-Gly-Arg-pNA.HCl; American Diagnostica). 

The production of multimilligram amounts of MT-SPl was carried 
out by fermentation in a BioFlo 3000 fermentor (New Brunswick Sci- 
entific, NJ) using a SMD1168/pPIC9K:MT-SPl Sac SCI clone. The 
medium was inoculated with an overnight culture of the P. pastoris 
transformant. Cells and cell debris were removed by centrifugation, the 
supernatant was concentrated, £md the buffer was exchanged into 60 
mM Tris-HCl, 50 mM NaCl, 0.05% Tween 80, pH 8.0 (buffer A). The 
concentrated MT-SPl-containing solution was applied onto a 150- ml 
benzamidine column equilibrated with buffer A, and the column was 
washed with 50 mM Tris-HCl, 1.0 m NaCl. 0.05% Tween 80, pH 8.0 
(buffer B), and eluted with 50 mM Tris-HCl, 1.0 m L-arginine, 0.05% 
Tween 80, pH 8.0 (buffer C). Fractions containing MT-SPl activity were 
pooled and concentrated. The buffer was exchanged into 50 mM 
Na2HP04, 125 mM NaCl. pH 5.5 (buffer D), and the partially purified 
MT-SPl was passed through a Q-Sepharose Fast Flow HiTrap column 
(Amersham Biosciences, Inc.) pre-equilibrated with buffer D. The flow- 
through was collected, and the protein concentration was determined 
by measurement of A.2tjo (using an extinction coefficient of 2.012 mg^ 
Aaao)- Purified MT-SPl was deglycosylated with endoglycosidase H 
(ProZyme, 5 units/ml) and further purified on a Akta Explorer system 
using a 7-ml Source 15Q anion exchange column (Amersham Bio- 
sciences, Inc.). The protein was eluted in a buffer containing 50 mM 
HEPES, pH 6.5, with a 0-0.33 m NaCl gradient. Fractions containing 
protein were pooled, and benzamidine was added to a final concentra- 
tion of 10 mM. The protein purity was examined by SDS-PAGE, and the 
protein concentration was determined by measurement ofA^so- 

Crystallization, Structure Determination, and Crystallographic Re- 
finement — Plate-like crystals of the Bz-MT-SPl complex were grown 
from 0.1 M Tris-HCl, pH 8.0, 1.5 M ammonium sulfate, 3% ethanol at 
18 "C using the hanging drop vapor diffusion technique. These crystals 
belong to the orthorhombic space group diffract x-rays to beyond 
1.3-A resolution, and have one molecule in the asymmetric unit. The 
Bz-MT-SPl crystals were transferred to 0.1 m Tris-HCl, pH 8.0, 1.5 M 
ammonium sulfate, 23% glycerol. A complete native data set to 1.3-A 
resolution was collected from a single crystal under a nitrogen stream 
at 100 K using synchrotron radiation and a CCD system (MAR Re- 
search, Hamburg, Germany) at DESY, Hamburg, Germany. 

These data were evaluated with the MOSFLM package (43) and 
loaded and scaled using SCALA firom the CCP4 program suite (17). For 
the determination of the orientation and position of the MT-SPl mole- 
cules in the crystals, rotational and translational searches were per- 
formed with AMoRe (18) using data from 20- to 3. 6- A resolution and a 
modified enteropeptidase search model (19) with all nonidentical resi- 
dues reduced to Ala. A unique solution was found with a correlation 
factor of 39.4% and an/2-factor of 46.2%; the corresponding values of the 
next best solution were 12.8 and 56.3%, respectively. Crystallographic 
refinement was done in several cycles consisting of model building 
performed with MAIN (20) and conjugate gradient minimization and 
simulated annealing using CNS (21). The target parameters of Engh 
and Huber (22) were used. This procedure converged rapidly, yielding a 
model with excellent parameters (see Table I). In the final model build- 
ing/refinement cycles, water molecules were inserted at stereochemi- 
cally reasonable sites, and individual restrained atomic B-values were 
refined. 5.1% of all reflections were omitted from the refinement to 
calculate the/Z^^; the final R and Rf^ are 18.4 and 19.3%, respectively, 
for all data to 1.3 A. The whole main chain of the MT-SPl catalytic 
domain is in appropriate electron density. Only a few side chains 
projecting out into solution are partially undefined in the electron 
density; the occupancy of all undefined atoms was set to zero. Modeling 
was performed interactively using MAIN; these models were energy 
refined with CNS. 

Crystals of the complex were grown from 0.1 m Hepes, pH 6.5, 20% 
polyethylene glycol 4000, 2% CsCl, at 18 "C using the hanging drop 
vapor diffusion technique. These crystals belong to the triclinic space 



2162 



Crystal Structures ofMT-SPl 



Table I 
Crystallographic data 



Inhibitor 


6z 


BPTI 


Data collection 






Space group 


C222 


PI 


Molecules in asymmetric unit 


1 


2 


Cell constants 






a 


66-92 A 


• 

47.10 A 


b 


141.60 A 


54.23 A 


c 


51.94 A 


67.82 A 


a 


90.00 


107.62' 




90.00' 


96.86* 


7 


90.00 


103.36 


Limiting resolution 


1.30 A 


2.93 A 


Emerge, overall 


8.6% 


12.0% 


^mcrao* outermost shell 


29.4% 


34.3% 


Unique reflections 


58,805 


12,200 


Completeness, overall 


96.2% 


94.4% 


Completeness, outermost shell 


93.5% 


93.1% 


Non-hydrogen protem atoms 


1864 


4618 


Heterogen atoms 


14 


0 


Solvent atoms 


381 


80 


Refinement statistics 






Test set size 


5.1% 


8.1% 


Kesolution range 


18.0 -1.30 A 


12.0 -2.93 A 


Completeness, overall 


96.2% 


94.4% 


Completeness, outermost shell 


91.7% 


93.1% 


R 


18.4% 


19.9% 


R, outermost shell 


23,7% 


30.0% 




19.3% 


27.9% 


Rrrxx^ outermost shell 


25.0% 


40.8% 


Root mean square standard 






deviations 






Bond length 


0.010 A 


0.007 A 


Bond angles 


1.6* 


1.4* 


Average B-value/S.D. 


13.7 A* 


14.4 A'* 


Ramachandran plot 






Most favored region 


89.3% 


74.3% 


Favored region 


10.7% 


23.5% 


Generously allowed region 




2.3% 



group Pi, diffract x-rays to beyond 2.9-A resolution, and have two 
molecules in the asymmetric unit. After transfer of one crystal to its 
mother solution containing 20% glycerol, a complete native data set to 
2.93-A resolution was collected from a single crystal under a nitrogen 
stream at 100 K using an Image Plate system (MAR Research). The 
data were processed as described above, with a combined model of the 
BPTI-trjrpsin complex (23) and our coordinates as replacement input. 
The procedure converged rapidly, leading to R and Rf^^ of 19.9 and 
26.9%, respectively. 8,1% of the reflections were omitted from the re- 
finement for the calculation of the Rtroe- 

RESULTS 

Overall Structure of the MT-SPl Catalytic Domain — Our 
cloning, expression, and purification procedure described un- 
der "Experimental Procedures" yielded multi milligram quanti- 
ties of highly purified MT-SPl(cd), which in the presence of Bz 
formed crystals diffracting to beyond 1.3-A resolution. MT- 
SPl(cd) resembles an oblate ellipsoid with diameters of 35 and 
50 A. Similar to other trypsin-like serine proteinases, the chain 
is folded into two adjacent six-stranded /3-barrels strapped 
together by three trans-domain segments. The surface contains 
several turn structures, a S^o'l^elix (residues 60(1) to 64, using 
the ch3rmotrypsinogen numbering, see Figs. lA and 2), and two 
a-helices (segments 164-172 and 235-243). The catalytic triad 
is located along the junction of the barrels, whereas the active 
site cleft runs perpendicular to this junction. 

Recombinant MT-SPl(cd) consists of the B-chain of mature 
MT-SPl. The chain starts with Val^® (corresponding to 
Val®^'^(g) in the generic MT-SPl sequence numbering; Ref. 6). 
Cys^^^, which would be disulfide-linked with Cys^ of the A- 
chain in the full-length MT-SPl molecule, is an unpaired sur- 
face located cysteine residue in this construct. The MT-SPl 
B-chain contains three disulfide bridges (Cys^^-Cys^®, Cys^®*- 



Cys^®^, and Cys^^^-Cys^^°) that are also present in most other 
trypsin-like serine proteinases. However, based on the electron 
density of the 1.3-A Bz-MT-SPl(cd) structure, the disulfide 
bridge Cys'^^-Cys^ is present in only about half of the crystal- 
lized protein molecules. In the other half, these cysteines 
clearly exist in the reduced form, with Cys^ Sy located in the 
same position as in the Cys^^-Cj^^ disulfide, and Cys"*^ Sy 
rotated toward the interior of the molecule between Tyr^^ Oy 
and the Leu^ side chain, thereby avoiding steric hindrance 
with the adjacent Cys^ thiol group. A similar partial opening 
of the C3^'*^-Cys®® disulfide bridge has recently been found in a 
high resolution structure of the recombinant human uPA cat- 
alytic domain (24). 

The a-ammonium group of the N-terminal Val^^ of MT-SPl 
forms the highly conserved internal salt bridge with the side 
chain carboxylate of Asp^^'*, stabilizing the substrate binding 
site and the active site in the catalytically active conformation. 
In contrast to most (ch3rmo)trypsin-like proteinases, the whole 
C-terminal region of MT-SPl, including the last residue, 
Val^^"*, is fully defined by electron density. After the conserved 
C-terminal or-helix, the MT-SPl polypeptide makes a sharp 
turn at Gly^'*^ and forms an as yet unobserved surface-located 
salt bridge with the Arg^^^ g^anidyl group via its C-terminal 
carboxylate group. 

An optimal superposition with several related serine protein- 
ases reveals highest topological similarity of MT-SPl(cd) with 
the catalytic domain of another membrane-ts^pe serine protein- 
ase, enteropeptidase/enterokinase (Fig. LB). 222 C^ atoms of 
topologically equivalent residues are found within a 2.0-A dis- 
tance, corresponding to an root mean square deviation of 0.70 
A, with 109 of these topologically equivalent residues being 
identical. The next best fit, with a 0.73-A root mean square 
deviation for 212 C« atoms, is observed with bovine trj^sin 
(25), followed by bovine ch3niiotr3rpsin (26) and human throm- 
bin (27). The topological equivalence with chyinotrypsin(ogen) 
formed the basis for the sequence alignment and the chymot- 
rypsinogen numbering of the MT-SPl catalytic domain used in 
this paper (Fig. 2). This alignment requires a nine-residue 
insertion between residues Ile®° and Pro®^, single-residue in- 
sertions behind residues Gly^®*, Glu^®®, Ala^°^ and Ala^^^, and 
single-residue deletions at positions 149 and 218. Inserted res- 
idues are marked by suffixes following the residue number of 
the preceding conunon residue. 

Loops Surrounding the Active Site Cleft — The narrow sub- 
strate specificity of MT-SPl arises from unique structural de- 
terminants of its active site cleft that is shaped in part by the 
surrounding surface loops. In the following brief description, 
these loops (defined according to the residue number of their 
central residue) will be addressed in an anticlockwise manner 
with respect to the standard orientation displayed in Fig. lA. 

To the east of the cataljdic triad, the rigid 37 loop projects out 
of the molecular surface of MT-SPl. This loop contains two of 
the "zjmiogen triad" residues, Ser^^ and His'*'^, which, together 
with Asp^^**, would stabilize the inactive zymogen-like confor- 
mation of the proenzyme (28, 29). Around Gln^®, this loop 
deviates markedly from the path followed in most other chy- 
motrypsin-like proteinases (Fig. LB). This conformation seems 
to be mainly stabilized by the Gln^ side chain, which is held in 
an exposed position via hydrogen bonds made through its ter- 
minal carboxamide group with Tyr^(G) Oy and with the 
ir-electron system of the Phe^(E) phenyl ring. 

The most striking feature of MT-SPl is the unusually large 
60 insertion loop, which is of the same length (nine additional 
residues) and exhibits a similar /3-hairpin conformation as the 
corresponding loop in thrombin (27), but is oriented differently 
(Fig. LB). The eight residues of MT-SPl between Tyr^® and 
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Fig. 1. Ribbon stereo plots of MT- 
SPl(cd) shown in standard orienta- 
tion. Figure was prepared with SETOR 
(38). A, the fold, secondary structure ele- 
ments, and loops of MT-SPl. Helices and 
/3-strand8 are given as red helices and 
blue strands. The active site residues, 
Asp^^^, and the disulfide bridges (yellow) 
are shown as stick models. B, MT-SPl 
{red) superimptosed with the enteropepti- 
dase catalytic domain (green; Ref 19), bo- 
vine chymotrypsin A (yellow; Ref 39), and 
the heavy chain of human a-thrombin 
(blue; Ref 27). The active site residues of 
MT-SPl, Asp^^°, and the bound benzami- 
dine are shown as stick models. 
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Fig. 2. Structure-based amino acid sequence alignment of the human MT-SPl catalytic domain with human enteropeptidase (40), 
human plasmin, bovine chymotrypsinogen A, bovine trypsin, and human thrombin. The numbers given correspond to the chymotrypsino- 
gen numbering. /3-Sheets and cr-helices in MT-SPl are indicated by arrows and cylinders, respectively. Figure was made with ALSCRIPT (41). 



Ser^(H) form a protruding irregular j3-hairpin loop, which is 
stabilized through internal main chain-side chain hydrogen 
bonds made by the Asp®^(A) and Asp^(B) carboxylate groups. 
In thrombin, the rigid 60 loop partially occludes the active site 
and hence contributes to the narrow substrate specificity (27). 
In MT-SPl, the loop is rotated away from the active site cre- 
ating a more open cavity than in thrombin. Unlike the basic 
side chains of Lys®°(F) in thrombin or Arg®°(G) in enteropepti- 



dase (19), the Arg^°(F) side chain of MT-SPl is directed away 
from the active site cleft and located between the side chains of 
Asp®°(A) and Asp^(I). 

To the north of the active site, the 99 loop of MT-SPl pro- 
trudes from the molecular surface and forms a ''roof on top of 
the active site cleft. Noteworthy is the clustering of aromatic 
side chains around this loop. The benzyl side chain of Phe^, 
which protects the hydrogen bonding interaction between the 
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Fig. 3. Stereo section of the Bz-MT-SPl structure around the SI pocket, superimposed with the final 2F„b, - electron 

density map contoured at l.Oo-. The MT-SPl model is slightly rotated compared with the standard orientation shown in Fig. 1. Figure was made 
with MAIN (20). 



catalytic residues Asp^°^ and His^"^ from the solvent, delimits 
the 82 subsite. Unique to MT-SPl is the well defined benzyl 
side chain of Phe®^, which extends away from the apex of the 99 
loop partially shielding the circularly arranged carbonyl groups 
of Asp^, Phe^"^, and Thr^^ (see Fig. 3), 

The "autolysis" or 145 loop of MT-SPl is one residue shorter 
and less exposed than the corresponding segment of chymo- 
trypsin. This loop encircles the extended side chain of Ifis^^^ 
and forms the southern boundary or **floor" of the active site 
cleft (Fig. 4). The presence of Gly^®^ in this loop is noteworthy, 
because this would allow for the accommodation of bulkier P2' 
side chains of peptide substrates. 

In the pancreatic serine proteinases, the 70—80 loop forms 
the calcium binding site, with the carboscylates of Glu*^^ and 
Glu^ coordinating the calcium ion. In MT-SPl, both positions 
are occupied by aliphatic residues (Leu''^^ and Val®°), hence 
rendering the catalytic domain calcium-independent. Together 
with the proximal side chain parts of Arg*^^ and Ala'^'^(A), the 
hydrophobic side chains at positions 70 and 80 clamp both loop 
ends together in a manner similar to that for the ion metal in 
the calcium-containing serine proteinases. 

Active Site and Substrate Binding Sites of MT-SPl~The 
residues of the active site triad, Ser^®^, His^'', Asp^^^, and other 
catalytic elements such as the oxyanion hole created by the 
main chain nitrogens of Gly^®^ and Ser^®^ are arranged in the 
active site cleft exactly as in trypsin and chjonotrypsin. The 
specificity pocket 81, which opens to the west of Ser^^^, is 
bordered by segments Val^^^-GIy^^o, Ser^®°-Ser^®^, Pro^^^^ 
Tyr228, and the Cys^^^-Cys^^o disulfide bridge. The side chains 
of all MT-SPl residues lining the interior of this pocket with 
their side chains are virtually superimposable with the corre- 
sponding residues in trypsin. This applies not only to Asp^®* at 
the bottom of the pocket, determining the specificity for basic 
residues, but also to Ser^®°, making Lys PI residues equally 
acceptable as Arg, and residues forming the inner wall (Gly^®, 
Tyr^®, and VaP^^). The phenyl group of the bound benzami- 
dine molecule is sandwiched between the parallel peptide 
groups Trp2ifi.Gly216 and Cys^^^-Gln^®^ whereas its amidino 
group opposes the Asp^®® carboxylate at the bottom of the 
pocket forming a two-O/two-N salt bridge, with the distal ni- 




FiG. 4. An Arg-Gln-Ala-Arg i Val-Val hexapeptide substrate 
with the sequence of the autoactivation site of pro-MT-SPl, 
modeled In canonical conformation against the MT-SPl active 
site, based on protein inhibitor-trypsin complexes (30). MT-SPl 
is represented by its solid surface, with the colors indicating positive 
and negative electrostatic potential contoured from 15 kT/e (intense 
blue) to -15 kT/e (intense red). Figure was made with GRASP (42). 

trogen atom additionally hydrogen-bonded to Gly^^^ O and to a 
buried solvent molecule. 

An extended S2/S4 pocket based on the indole moiety of 
rpjp2i5 -g located above the SI pocket. This pocket is delimited 
to the east by the flat side of the His^^ imidazole ring and to the 
north by the flat side of the Phe^ benzyl moiety, and shaped to 
accept small hydrophobic P2 residues (Fig. 4). The adjacent 84 
pocket is bordered by the carbonyl groups of Asp®®, Phe®'', and 
Thr^®, which are partiaUy shielded firom solvent by the benzyl 
group of Phe^, a residue rarely present at this position in 
(chymo)trypsin-like proteinases. Because of the presence of the 
TT-electron system of Phe^ and the nearby Asp^, this S4 pocket 
is well suited to accommodate positively charged side chains. It 
is noteworthy that the benzyl group of Phe^ is fiiUy defined in 
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Fig. 5. Section of the complex formed between BPTI (blue color) and MT-SPl (orange) around the contact interface, superimposed 
with parts of the Bz-MT-SPl structure (red). The active site residues and Asp^^^ {green), the benzyl groups of Phe^^ and Phe***, and Asp®®(B) 
and Arg^(C) from the 60 loop of both MT-SPl structures, as well as Pro'^(I). Lys^'^d), Ala'M). Arg^'^d), Ile^^d), and Arg^^d) of BPTI are shown 
as stick models. The large rotation of the Phe**** side chain and the adaptation of the 60 loop upon binding of BPTI are clearly visible. The view is 
slightly tilted in comparison with Fig. 1. Figure was made with SETOR (38). 



Bz-MT-SPl, in striking contrast to the equivalent Arg^^ resi- 
due in enteropeptidase (19). The hydrophobic S17S3' cavity to 
the east of Ser^^^ is centered on the Cys'*^-Cys^® disulfide 
bridge, and bordered by the isobutyl side chain of Ile'*^ and the 
aromatic side chains of Tyr®^(G) and Trp^. These positions can 
thus be preferentially filled with large hydrophobic PI' or P3' 
side chains, but also the adjacent S2' pocket, lined by Gln^®^ 
and His^*^, gives space for hydrophobic residues (Pig. 4). 

Interaction with Kunitz-type Inhibitors — Because the proto- 
typic Kunitz-type inhibitor BPTI is closely related to the first 
Kunitz domain of HAI-1 at the sequence level, we crystallized 
the complex of BPTI with MT-SPl and solved its structure at 
2.9-A resolution (Figs. 5 and 6). BPTI docks into the concave 
substrate binding surface of MT-SPl via the reactive site loop 
(Thr^Hl) to ne^®(I), with the Lys^^d) i Ala^^d) scissile bond) 
and the secondary binding segment (Gly^®(I) to Arg^(I)), sim- 
ilar to the prototypical trypsin complex (23). The reactive site 
loop of BPTI runs anti-parallel to MT-SPl segments Ser^^'*- 
Asp2" and His^^-Be^S with the Pl-Lys^^(I) side chain extend- 
ing into the SI pocket and interacting via its €-ammonium 
group with the Asp^®® carboxylate. Because of the main chain 
kink at P3-Pro^'^(I), which is tj^iical for Kunitz-type Inhibitors, 
the P3 pyrrolidine ring nestles into the S4 depression. 

The Phe®^ side chain, if positioned as observed in Bz-MT- 
SPl, would clash with the Cys^*(I)-Cys^®(I) disulfide bridge. In 
the BPTI complex, this clash is avoided by a rotation of the 
Phe^ benzyl group away fi*om this site, in this way enlarging 
the S2 cavity and making it accessible for the disulfide bridge 
(Fig. 5). The Arg^^(I) side chain extends toward the 99 loop of 
MT-SPl, with its guanidyl group stacking between the Phe^^ 
and the Phe^ benzyl moieties and hydrogen bonding to the 
Phe^'^ carbonyl oxygen much more favorably than in the BPTI- 
trypsin complex, because of the additional shielding by the 
Phe®^ side chain. On the primed side of the reactive center loop, 
the Pl'-Ala^^(I) and the PS'-Be^^d) side chains form a hydro- 
phobic knob that interacts with the hydrophobic S17S3' cavity. 

The characteristic protruding 60 loop of MT-SPl provides a 
large extra surface to make a number of favorable new contacts 
with BPTI, and hence with any other canonicaUy bound Kunitz 
domain (Fig. 5). A comparison with Bz-MT-SPl shows that 
BPTI induces some conformational changes in the 60 loop, to 
allow for instance the formation of a salt bridge between 




Fig. 6. Docking of BPTI (yellow) and of the (covalently linked) 
fourth LDLR domain (green) to opposite sites of the MT-SPl 
catalytic domain. Solid surface representations, with the colors of the 
MT-SPl component showing negative (red) and positive (blue) surface 
potential (as above). Some surface residues and the thrombin-like anion 
binding exosite I are indicated by labels. The complex is rotated about 
around a vertical axis for 90" compared with the standard orientation in 
Fig, 1. Figure was made with WEBLABVIEWER (www.msi.com). 

Asp^(B) (MT-SPl) and Arg^^d), and of charged hydrogen 
bonds between Arg®°(C) and the carbonyl groups of Lys'*^(I) 
and Asn^(I). The well defined 60 loop of MT-SPl thus exhibits 
some capability to adapt to the rigid Kunitz-type inhibitor. In 
this respect it strongly differs from the also exposed but much 
more rigid thrombin 60 loop, which delimits the S2 cavity and 
normally prevents access of bulkier substrates and protein 
inhibitors (30). The 60 loop of MT-SPl, in contrast, does not 
impair BPTI binding, but instead strengthens it by forming a 
number of additional favorable interactions (Figs. 5 and 6). 

DISCUSSION 

The catal3^ic domain of human MT-SPl reveals the overall 
fold of a (chymo)trypsin-like serine proteinase, but displays 
unique properties such as the hydrophobic/acidic S2/S4 sub- 
sites and an exposed 60 loop, which considerably affect its 
substrate recognition and binding properties. The MT-SPl 
polypeptide fold deviates notably from that of the serine pro- 
teinase tryptase, however, which exhibits six novel surface 
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loops through which the tryptase monomers form the intermo- 
lecular interactions that stabilize the unusual tetramer struc- 
ture (31). Thus, MT-SPl is not closely related to tryptase, and 
the name '"matriptase" is therefore potentially confusing. How- 
ever, because enterokinase and hepsin are members of the 
^T-SP* family that were discovered and described before 
matriptase, assignment of the name MT-SPl to matriptase 
may also produce controversy. One potential solution to these 
nomenclature issues would be to adopt either the ''MT-SP" or 
the TTSP (3) terminology that was suggested previously by 
others, and to assign numbers to individual family members 
based on their date of discovery. 

MT-SPl can cleave selected synthetic substrates as effec- 
tively as trypsin, but exhibits a significantly more restricted 
specificity than trypsin (10). This may reflect, on the one hand, 
the near identity of the SI specificity pockets of these two 
enzymes, presumably allowing both tight binding of substrates 
and optimal presentation of their scissile peptide bonds to the 
enzyme active site, and, on the other hand, clear structural 
distinctions between the extended binding subsites of the two 
proteinases. With respect to shape and chemical composition, 
the SI pocket of MT-SPl (and trypsin) provides good comple- 
mentarity for Lys as well as arginine PI residues. The efficient 
accommodation of Pl-Lys residues, which has been demon- 
strated experimentally (5, 10), is facilitated by Ser^^, whose 
side chain provides an additional hydrogen bond acceptor to 
stabilize the buried a-ammonium group. Pl-Arg residues are 
accommodated equally well, because of the overall better space 
filling by the guanidinium group (30). 

The hydrophobic S2 groove of MT-SPl is shaped to accom- 
modate small to medium-sized hydrophobic side chains of P2- 
L-amino acids, with a wide hydrophobic exit toward the bulk 
solvent that would expose longer and more polar side chains. In 
addition, the rotation of the Phe^ benzyl group upon BPTI 
binding suggests that the S2 subsite of MT-SPl is not rigid. 
This observation is consistent with experimental findings fi-om 
positional scanning (10), which indicated that MT-SPl accepts 
a broad range of amino acids in the P2 position. 

Craik and co-workers (10) have reported previously that 
MT-SPl exhibits a strong preference for peptide substrates 
that contain Arg or Lys at P4 position, and a certain preference 
for either Gin or basic residues at P3. This interesting speci- 
ficity may be mediated by electrostatic interactions with the 
acidic side chains of Asp-217 and/or Asp-96 (see Fig. 4), which 
could favorably pre-orient specific basic peptide substrates as 
they approach the enzyme active site cleft. Because canonically 
binding substrates align antiparallel to the Ser^^'^-Asp^^^ seg- 
ment in an extended conformation, the side chains of P4-Arg or 
Lys residues presumably extend toward the 99 loop with their 
guanidyl or ammonium groups not reaching the Asp®® carbox- 
ylate directly but forming a charged hydrogen bond with Phe^'^ 
O, In addition, they can favorably interact with the overall 
negative potential created by the 96-, 97-, and 98-carbonyls, in 
full agreement with the preference of MT-SPl for basic P4 
residues. This MT-SPl S4 subsite is reminiscent of the corre- 
sponding region of coagulation factor Xa, which has also been 
shown to function as a cation binding site (32). The Arg^®(I) 
side chain of BPTI (see Fig. 5), although provided by the 
secondary binding segment of the inhibitor, is a nice example 
of such interactions; in the BPTI-MT-SPl complex, the hy- 
drogen bond between the Arg^^(I) guanidyl and the Phe^^ O is 
shielded from solvent by the unique Phe®^ benzyl side chain 
(see Figs. 4 and 5), which would significantly strengthen 
these interactions. 

In canonically boimd peptide substrates, the side chain of a 
P3 residue projects out of the active site cleft, where it can 



hydrogen-bond the carboxamide group of Gln^ . It is also 
possible, however, that a bound substrate could adopt a 
"kinked" conformation at the P3 position, as seen for Kunitz- 
type inhibitors (Fig. 5). In this alternative conformation, the P3 
side chain extends into the S4 subsite, i,e, toward the 99 loop. 
To allow the guanidyl or the ammonium group of a P3-Arg or 
Lys residue, boimd to MT-SPl in the alternative conformation, 
to form direct hydrogen bonds with Phe®^ O, the substrate 
main chain around P3 would have to rotate and thereby 
weaken considerably the inter-main chain hydrogen bonds to 
Qjy2i6 either conformation, however, a basic P3 side chain 
would interact favorably with the negative potential of the 
MT-SPl S4 pocket (Fig. 4). Long range electrostatic interac- 
tions between side chain charges and complementary surface 
potentials are often found in protein -protein complexes. For 
instance, in the thrombin complexes with hirudin and other 
protein inhibitors, the removal of the charges of acidic residues 
not involved in direct salt bridges has been shown to affect the 
electrostatic interactions strongly (33, 34). The relatively low 
probability of the simultaneous occurrence of Arg/Lys residues 
at P3 and P4 in good MT-SPl substrates (10) would then be 
explained by mutual charge compensation and exclusion from 
the same (S4) site. 

The specificity of MT-SPl differs substantially from that of 
trypsin, because MT-SPl does not indiscriminately cleave pep- 
tide substrates at accessible Lys or Arg residues, but instead 
requires recognition of additional residues surrounding the 
scissile peptide bond. This requirement for recognition of an 
extended primaiy sequence for efficient catalysis of substrates 
suggests that MT-SPl is a relatively specific proteinase that 
may play a regulatory role (5, 6). Recognition of an extended 
primary sequence appears to be also required for efficient 
cleavage of macromolecular substrates by MT-SPl. Efficient 
auto- activation of MT-SPl entails recognition and cleavage of 
an Arg-Gln-Ala-Arg P4-P1 target sequence. MT-SPl can also 
efficiently activate the proteinase-activated receptor-2 (PAR2), 
sc-uPA (10), and the hepatocjrte growth factor/scatter factor 
(11). These extracellular surface-localized proteins display the 
P4 to PI target sequences Ser-Lys-Gly-Arg, Pro-Arg-Phe-Lys, 
and Lys-Gln-Gly-Arg, respectively, which match closely the 
MT-SPl cleavage specificity requirements observed for small 
peptidic substrates (10). Another indication of the substrate 
specificity of MT-SPl is that the enzyme does not activate 
proteins closely related to these substrates, such as PAR-1, 
PAR-3, PAR-4, and plasminogen, that do not display target 
sequences matching the extended MT-SPl specificity near the 
scissile bond. 

Because MT-SPl has been foimd co-localized with sc-uPA in 
several cell types, it has been suggested that MT-SPl may be a 
physiologically relevant activator of this proteinase zymogen 
(10). uPA plays an important role in angiogenesis and/or tumor 
progression and can also activate both plasminogen and matrix 
metalloproteinases (35). The latter, in turn, play important 
roles in matrix degradation and re-modeling, events that are 
required both during angiogenesis and tumor invasion and 
metastasis. In addition, MT-SPl can directly activate hepato- 
cyte growth factor, a protein that promotes cell growth as well 
as angiogenesis; therefore, it may play both direct and indirect 
roles in cell growth and migration and, when improperly reg- 
ulated, may contribute to tumor angiogenesis, growth, and 
progression. 

In normal tissues, the proteol3^ic activity of proteinases is 
carefully controlled localizing their action in time and space. As 
mentioned above, HAI-1, a type-II transmembrane protein con- 
taining two Kunitz-type domains, appears to be the primary 
physiological inhibitor of MT-SPl (12). Based on the structure 
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of our BPTI-MT-SPl complex, we have modeled the complex 
between MT-SPl and the first Kunitz domain of HAI-1, which 
is 36% identical to BPTI and appears more likely to exhibit 
high affinity for the MT-SPl active site than the second HAI-1 
Kunitz domain. This model suggests that a large number of 
favorable interactions could form between the first HAI-1 
Kunitz domain and MT-SPl, both between the Gly^^(I)- 
Arg^3(i).cysi4(i).^gi5(i) I Gly*«(I)-Ser"(I)-Phe^^(I) reactive 

site loop (using the BPTI nomenclature, with Arg^^(I) i Gly^®(I) 
representing the scissile bond) and the active site cleft, and also 
at secondary interaction sites such as made by the 60 loop. In 
the primed side, the first HAI-1 Kunitz domain possesses an 
uncommon P3'-Phe^®(I), which could, because of the small Pl'- 
Gly residue, be nicely packed in the large hydrophobic S17S3' 
pocket of MT-SPl (Fig, 4). The side chain of the P3-equivalent 
Arg^^(I) is expected to extend into the S4 subsite, where its 
guanidyl group would make favorable electrostatic interactions 
with the 99 loop carbonyls, similar to those observed for 
Arg^^d) in the BPTI complex (see Fig. 5). 

The putative G\y^\D-l^u^\iyCyB^\l)-Lys^\l) i Glu^^d)- 
Ser^^(I)-Ile^®(I) reactive site loop of the second Kunitz domain 
of HAI-1, in contrast, does not match the reported substrate 
specificity of MT-SPl. The second HAI-1 domain does, however, 
possess a number of negatively charged residues in its C- 
terminal a-helix that could form favorable electrostatic inter- 
actions with basic surface residues of MT-SPl that map in or 
near the region corresponding to the anion binding exosite I of 
thrombin (such as Arg'^^, Arg^, Arg®^, Lys^^**; see Fig. 6). Such 
an additional exosite binding of the second HAI-1 domain ac- 
companying the interaction of the first domain with the MT- 
SPl active site would considerably increase the affinity and 
specificity of HAI-1 for MT-SPl. A similar cooperative action 
with consequent increase in affinity and specificity has been 
previously observed in the complex between thrombin and or- 
nithodorin, a two-domain Kunitz-type inhibitor derived ft-om 
the blood-sucking tick Ornithodoros moubata (36), where the 
first and the second Kunitz domains interact (noncanonically, 
however) with the active site and electrostatically with exosite 
I, respectively, of thrombin. 

The role of the four LDLR domains that precede the catal3rtic 
domain remains unclear, but they have been implicated in 
mediating interactions with other membrane or membrane- 
associated proteins (5, 6). The LDLR (4) domain of MT-SPl was 
modeled (Fig. 6) based on the structure of the fifth low density 
lipoprotein class A binding domain of LDLR (37), which shares 
a conserved disulfide bonding pattern and a 45% amino acid 
identity with LDLR (4). In full-length MT-SPl. the last cys- 
teine residue (Cys^'*(g)) of LDLR (4) is separated by only one 
amino acid (Asp®°^(g)) from the first residue (Oys^°®(g) = Cys^) 
of the catalytic domain, which, in turn, forms an intradomain 
disulfide bridge with Cys^^^. This implicates close proximity of 
both modules. A careftil inspection of the electrostatic poten- 
tials suggests a distinct rotational orientation of LDLR (4) and 
the catalytic domain relative to each other, which would create 
both favorable interactions between electrostatic potentials of 
the two domains and a good steric fit of the two complementary 
surfaces. Our docking experiment predicts four interdomain 
salt bridges (Glu^^3(g)-Arg2o«, Lys^^2(g).^pi25^ Asp«»^(g)- 
Arg^^®, and Asp®^®(g)-Lys®°^(g)) and charged hydrogen bonds 
fix)m Lys=^® and Arg^^g to the 589(g)-590(g) backbone. The 
LDLR (4) domain of MT-SPl carries all four acidic side chains 
(of Asp^^(g), Asp^®'*(g). Asp^(g), and Glu«°Hg)) engaged in 
calcium coordination in the fifth low density lipoprotein-A do- 
main of LDLR (37). The suggested association of LDLR (4) with 
the catalytic domain does not directly involve this putative 
calcium binding site, which would be located close to the inter- 



face, however. The location of the LDLR (4) domain on the 
proteinase surface opposite to the active site cleft allows ir- 
restricted substrate or inhibitor binding to MT-SPl. 

It has only recently been appreciated that TTSPs represent 
an important, emerging subfamily of the (chymo)trypsin en- 
zjnme family. With the exception of enterokinase, an important 
digestive enzyme with a unique substrate specificity, the bio- 
logical roles of individual members of this recently discovered 
enzyme subfamily have not yet been unambiguously estab- 
lished. The first review of this new class was published earlier 
this year and discussed seven human TTSPs (3). As described 
by these authors, the msyority of these human proteinases have 
been associated either with tumor cells and/or cell growth. 
Because both cell growth and tumor progression (including 
angiogenesis) are expected to require localized activation of 
growth factors and degradation and/or remodeling of the extra- 
cellular matrix, it seems likely that these processes require cell 
associated, proteoljrtic activities. Other membrane-associated 
proteinases such as the disintegration and metalloproteinase 
(ADAM), matrix metalloproteinases, and uPA have previously 
been implicated in these processes, and it is conceivable that 
specific members of the TTSPs will also contribute to these key 
processes. The development of highly potent and selective in- 
hibitors of individual TTSPs, therefore, may represent an ex- 
citing new strategy to discover compounds with anti-angiogenic 
or antitumor activities. 

MT-SPl has been implicated in the progression of prostate 
cancer in a rat model, and our solution of the high resolution 
structures of the proteinase domain, in complex with benzami- 
dine or BPTI, is an important first step toward the design of 
potent selective inhibitors. Because the proteinase active site is 
fully solvent-exposed in the Bz-MT-SPl crystals, as demon- 
strated by initial soaking experiments (data not shown), small 
molecule inhibitors can be soaked into these crystals, thereby 
facilitating rapid progress in structure-based drug design ef- 
forts. In addition, the structures reported here can be used to 
model the structure of other cancer-associated type II trans- 
membrane serine proteinases, and may therefore facilitate ef- 
forts to find potent, specific inhibitors of those important pro- 
teinases as well. First homology-based inhibitor design studies 
have been undertaken by Enyedy et aL (16), based on a throm- 
bin model, which led to the development of selective bis-benza- 
midine inhibitors. This work showed that there is sufficient 
structural difference between MT-SPl and closely related ser- 
ine proteinases for the development of potent and selective 
inhibitors. Small molecule inhibitors of TTSPs can be used not 
only to discover the biological and pathological roles of specific 
members of this intriguing new enzyme subfamily but, because 
of the strong association of these enzymes with tumor cells, can 
also be tested as potential lead anticancer compounds. 
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Thrombin is a multifunctional serine proteinase that plays a key role in coagulation while exhibiting several other 
key cellular bioregulatory functions. The X-ray crystal structure of human a-thrombin was determined in its com- 
plex with the specific thrombin inhibitor D-Phe-Pro-Arg chloromethylketone (PPACK) using Patterson search 
methods and a search model derived from trypsinlike proteinases of known spatial structure (Bode, W., Mayr, 
L, Baumann. U., Huber, R., Stone, S.R., & Hofsteenge, J., 1989. EMBO J. 8, 3467-3475). The crystallographic 
refmement of the PPACK-thrombin model has now been completed at an R value of 0.156 (8 to 1 .92 A); in par- 
ticular, the amino- and the carboxy-termini of the thrombin A-chain are now defined and all side-chain atoms 
localized; only proline 37 was found to be in a cis-peptidyl conformation. 

The thrombin B-chain exhibits the characteristic polypeptide fold of trypsinlike serine proteinases; 195 residues 
occupy topologically equivalent positions with residues in bovine trypsin and 190 with those in bovine chymo- 
trypsin with a root-mean-square (r.m.s.) deviation of 0.8 A for their cr-carbon atoms. Most of the inserted resi- 
dues constitute novel surface loops. A chymotrypsinogen numbering is suggested for thrombin based on the 
topological equivalences. The thrombin A-chain is arranged in a boomeranglike shape against the B-chain glob- 
ule opposite to the active site; it resembles somewhat the propeptide of chymotrypsin(ogen) and is similarly not 
involved in substrate and inhibitor binding. 

Thrombin possesses an exceptionally large proportion of charged residues. The negatively and positively charged 
residues are not distributed uniformly over the whole molecule, but are clustered to form a sandwichlike electro- 
static potential; in particular, two extended patches of mainly positively charged residues occur close to the car- 
boxy-terminal B-chain helix (forming the presumed heparin-binding site) and on the surface of loop segment 70-80 
(the f1brin[ogen] secondary binding exosite), respectively; the negatively charged residues are more clustered in 
the ringlike region between both poles, particularly around the active site. Several of the charged residues are in- 
volved in salt bridges; most are on the surface, but 10 charged protein groups form completely buried salt bridges 
and clusters. These electrostatic interactions play a particularly important role in the intrachain stabilization of 
the A-chain, in the coherence between the A- and the B-chain, and in the surface structure of the fibrin(ogen) sec- 
ondary binding exosite (loop segment 67-80). 

The most remarkable feature at the thrombin surface is the prominent canyonlike active-site cleft mainly shaped 
by two characteristic insertion loops around Trp 60D and Trp 148. The deep and narrow active-site cleft in gen- 
eral explains the narrow speciflcity of thrombin for distinct macromolecular substrates and inhibitors. Compar- 
isons with other crystal structures of human and bovine thrombin recently determined using this PPACK-thrombin 
model indicate that the first loop around Trp 60D is relatively rigid, whereas the opposite loop around Trp 148 
can attain different conformations depending on complexation state and crystalline environment. 

The active-site residues and the entrance to the specificity pocket are partially occluded in thrombin (much more 
than in the other serine proteinases) by this distinctive Trp 60D loop. The specificity pocket of thrombin resem- 
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bles that of bovine trypsin but is designed to prefer arginine over lysine residues at PI . D-Phe II and Pro 21 of 
the bound PPACK inhibitor fit neatly to a novel hydrophobic cleft (the aryl-binding site) and to the cavitylike 
hydrophobic S2 subsite; the D-configuration of Phe 1! is beneficial for binding as it allows the PPACK amino- 
terminus to form hydrogen bonds to Gly 216 in addition. Some small arginine and benzamidine-derived synthetic 
inhibitors owe their particularly high thrombin specificity and affinity to their exceptional steric fit to these novel 
hydrophobic cavities close to the thrombin active site (Bode, W., Turk, D., & Sturzebecher, J., 1990, Eun 7. Bio- 
chem, 193, 175-182). 

The active-site cleft levels off in the primed direction and continues over the molecular surface of the throm- 
bin loop Lys 70-Glu 80 (itself structurally similar to the calcium loop in trypsin, with the distal nitrogen of Lys 
70 replacing the calcium of trypsin). This site of a strong positive electrostatic surface potential probably repre- 
sents the secondary site for interaction with the a-chain of fibrinogen and fibrin (the fibrin[ogen] secondary bind- 
ing exosite) and accommodates the carboxy-terminal acidic tail part of hirudin (Rydel, T. J. , Ravichandran, K.G., 
Tulinsky. A.. Bode, W., Huber, R., Roitsch. C. & Fenton, J.W,, II, 1990, Science 249, 277-280). Segment Arg 
187-Gly 188- Asp 189, which could represent a thrombin adhesion site for cellular interactions with platelets, fi- 
broblasts, and endothelial cells, is mainly buried in a-thrombin; its adhesive role would thus appear to require 
some prior unfolding. 

Most of the well-characterized sites of proteolytic cleavage leading to the degradation products i?-, 7-, and e- 
thrombin of diminished or lost clotting activity are situated in exposed mobile loops of a-thrombin. None of these 
segments is in a canonical conformation that would allow association with the substrate-binding site of a cleav- 
ing serine proteinase without large conformational changes. The cleavage of the Arg 77A-Asn 78 scissile peptide 
bond (leading to /3-thrombin) presumably results in the unfolding of the 70-80 loop, exposing the salt bridge-con- 
nected residues buried in a-thrombin to the solvent; the concomitant disruption of the surface of the fibrin(ogen) 
secondary binding exosite would explain the loss of binding capacity (and thus catalytic activity) toward fibrino- 
gen. The Arg 67-Ile 68 peptide bond is completely buried beneath this 70-80 loop surface and thus only suscep- 
tible to an attacking proteinase after prior cleavage and exposure of this loop, as observed. 

Keywords: electrostatic interactions; protein crystallography; serine proteinase; thrombin; thrombosis 



a-Thrombin' is a glycosylated trypsinlike serine protein- 
ase generated in the penultimate step of the blood coag- 
ulation cascade from the circulating plasma protein 
prothrombin. Upon autocatalytic and Factor Xa cleav- 
age the functional two-chain molecule a-thrombin is gen- 
erated. In the case of the human species this cv-thrombin 
consists of the 36-residue A-chain and the 259-residue B- 
chain (Butkowski et aL, 1977; Thompson et ah, 1977; De- 
gen et al., 1983). The two chains are connected covalently 
by a disulfide bridge. The B-chain has been shown to be 
homologous to the catalytic domains of other pancreatic 
and coagulation/fibrinolytic trypsinlike proteinases (Jack- 
son & Nemerson, 1980). Upon further autolytic or pro- 
teolytic cleavage, more species (in particular jS- and 
7-thrombin) are generated that retain some activity 
against small synthetic substrates but have lost most or 
all clotting activity (Lundblad et al., 1979; Elion et al., 
1986; Hofsteenge et al., 1988). 

Thrombin is a multifunctional protein; it plays a cen- 
tral role in thrombosis and hemostasis but is also impli- 



Pl. P2, P3, etc., and PI', P2' designate subsirate/inhibitor resi- 
dues amino- and carboxy-terminal to the scissile peptide bond, respec- 
tively, and SI, S2, S3, etc., and SI', S2' the corresponding subsites of 
the cognate proteinase (Schechter & Berger, 1967). For numbering of 
the thrombin amino acid residues, the chymotrypsinogen nomenclature 
introduced by Bode et al. (1989b) is used. D-Phe-Pro-Arg chlorometh- 
ylkeione (PPACK) residues are designated by a suffix 1, hirudin resi- 
dues by a suffix HI behind the residue number. 

Deposition: The PPACK-thrombin have been deposited at the 
Brookhaven Protein Data Bank (Bernstein et al., 1977). 



cated in wound healing and various disease processes (see 
Fenton, 1988). a-Thrombin converts fibrinogen into fi- 
brin, which consequently aggregates and forms the inter- 
connecting network of thrombi. Furthermore, thrombin 
is able to activate several coagulation and plasma factors, 
such as Factors V, VIII, XIII, and protein C. The com- 
plex of thrombin with thrombomodulin exhibits en- 
hanced reactivity toward protein C (Esmon et al., 1982) 
but has lost all other procoagulant activity (Esmon et ah, 
1983; Hofsteenge & Stone, 1987). Thrombin is effectively 
inhibited by only a very few endogenous protein inhibitors, 
such as a2-macroglobulin and the serpins antithrombin 
III (Rosenberg & Damus, 1973), heparin cofactor II 
(Toilefsen et al., 1982), and protease nexin I (Cunning- 
ham & Farrell, 1986). The interaction with these serpins 
is considerably enhanced by the acidic glycosaminoglycan 
heparin (Li et aL, 1976; Bjork & Lindahl, 1982; Olson & 
Shore, 1982; Wallace et al., 1989). In addition, thrombin 
binds very tightly to and is selectively inhibited by hiru- 
din, a protein isolated from the medicinal leech (Wals- 
mann & Mark ward t, 1981; Stone & Hofsteenge, 1986; 
Dodt et al., 1988). 

Thrombin acts on a variety of cells (see Jackson & 
Nemerson, 1980). Some of these cellular interactions re- 
quire proteolytically active forms of thrombin, whereas 
others also function with active-site-blocked thrombin 
species (see, e.g., Carney et al., 1986; Fenton, 1986; 
McGowan & Detwiler, 1986; Shuman, 1986; Vu et al., 
1991). Thrombin induces platelet aggregation and stim- 
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ulates platelet secretion. It causes mitogenesis in fibro- 
blasts (Cunningham & Farrell, 1986; Glenn et al., 1988) 
and macrophagelike cells (Bar-Shavit et al., 1986), ex- 
hibits chemotactic properties (Bar-Shavit et ah, 1983. 
1984, 1986; Bizios et al., 1986), and binds to endothe- 
lial cells (Prescott et al., 1990; Bar-Shavit et al., 1991) 
and to subendothelial extracellular matrix (Bar-Shavit 
et al,, 1989). 

Thrombin cleaves a variety of proteins for which the 
amino acid sequence around the scissile peptide bond dif- 
fers considerably (BlombSck et a!., 1977; Scheraga, 
1977). Its specificity is primarily trypsinlike, i.e., it 
cleaves behind arginine and lysine residues, with a clear 
preference for Arg-Xaa bonds (Liem & Scheraga, 1974; 
Lottenberg et al., 1983). It is, however, much more selec- 
tive than trypsin toward macromolecular substrates in 
that it cleaves many fewer peptide bonds of permitted se- 
quence; in fibrinogen, for example, only two Arg-Xaa 
bonds out of a total of 181 Arg/Lys-Xaa bonds are at- 
tacked (Blomback et al., 1967; Hogg & Blomback, 1978). 
Thrombin is particularly active toward peptidic substrates 
and inhibitors with a proline at the P2 position (Kettner 
& Shaw, 1981); a D-phenylalanine residue al P3 makes 
such peptides even more reactive toward thrombin. 
Rather reactive and selective chloromethyl, aldehyde, and 
boroarginine inhibitors have been developed with the 
peptidyl moiety D-Phe-Pro-Arg (Bajusz et al., 1978; Kett- 
ner & Shaw, 1979; Kettner et al., 1991); various synthetic 
thrombin inhibitors have been found and characterized 
based on arginine and benzamidine derivatives or on het- 
erocyclic compounds (Okamoto et al., 1981; Sturzebecher 
et al., 1983; Kam et al., 1988). 

In contrast to most other serine proteinases, the throm- 
bin specificity toward fibrinogen is not determined by 
subsites surrounding the active residues alone. For effi- 
cient cleavage of fibrinogen (as well as of isolated Aa- 
chains) the availability and integrity of a thrombin exo- 
site quite distant from the catalytic residues is important 
(Hageman & Scheraga, 1974; Liem & Scheraga, 1974; 
Blomback et al., 1977; Van Nispen et al., 1977; Marsh 
et al., 1985). This fi brin(ogen)-recognizing exosite, attrib- 
uted to a rather basic surface area of thrombin around its 
jS-cleavage site has also been implicated in binding of 
other proteins such as fibrin monomers, fibrin E-domain, 
and its NDSK-fragment, thrombomodulin, and hirudin, 
and of negatively charged cell surfaces, glass, and cation- 
exchange resins (Liu et al., 1979; Fenton et al., 1981, 
1988; Berliner et al., 1985; Kaminski & McDonagh, 1987; 
Kaczmarek & McDonagh, 1988; Vali & Scheraga, 1988). 

An apolar binding site close to the catalytic center has 
been inferred from binding and proflavine displacement 
studies (Thompson, 1976; Berliner & Shen, 1977). This 
site has been suggested as accounting for thrombin spec- 
ificity with tripeptide substrates (Sonder & Fenton, 1984) 
and for the accommodation of large hydrophobic resi- 
dues amino-terminal of the fibrinogen Aa-cleavage site 
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(Meinwald et al., 1980; Marsh et al., 1985; Ni et al., 
1989a.b). 

Several dysfunctional genetic variants of human 
(pro)thrombin have been found; in three of them the site 
of mutation has been localized (Miyata et al., 1987; Hen- 
riksen&Mann, 1988, 1989). Recently, several recombinant 
thrombin mutants also have been prepared (LeBonniec & 
Esmon, 1991; LeBonniec et al., 1991; Wu et al., 1991). 

Three-dimensional models have been proposed for the 
thrombin B-chain (Magnusson et al., 1975; Greer, 1981; 
Bing et al., 1986; Sugawara et al., 1986; Toma & Suzuki, 
1989) based on the crystal structures of bovine trypsin 
and chymotrypsin. These models provided a general im- 
pression of the arrangement of sites involved in the var- 
ious interactions of thrombin and were of some use in 
explaining certain structure-function relationships and in 
developing inhibitors (Maraganore et al., 1990). An ad- 
equate understanding of the specific and characteristic 
functions of thrombin and the accurate model-based de- 
sign of thrombin inhibitors requires, however, knowledge 
of an experimentally determined structure. 

We have recently communicated the X-ray crystal 
structure of D-Phe-Pro-Arg chloromethylketone (PPACK 
[Kettner & Shaw, 1979]) inhibited human a-thrombin and 
presented a concise description of the most important 
features of thrombin, in particular its unique and prom- 
inent canyonlike active-site cleft (Bode et al., 1989b). This 
PPACK-thrombin model is now fully refined; the rela- 
tively fiexible amino- and carboxy-termini of the A-chain 
have been determined, the locations of a few amino acid 
side chains have been revised, and more solvent molecules 
have been added in order to improve further the phases. 

Using the refined coordinates of this PPACK-throm- 
bin structure as a search and initial phasing model, the 
X-ray crystal structures of human a-thrombin complexes 
of hirudin (Grutter et al., 1990; Rydel et al., 1990, 1991) 
and fibrinopeptide A/hirugen (Stubbs et al., 1992) and of 
bovine thrombin complexes (Martin et a!., 1992) have 
been elucidated. Recently, we established the exact bind- 
ing geometry of some arginine- and benzamidine-based 
synthetic inhibitors toward bovine thrombin (Brandstet- 
ler et al., 1992) and confirmed earlier modeling results 
inferred from trypsin binding (Bode et al.. 1990; Chow 
et al., 1990; Turk et al., 1991). 

In the following, the entire course of the structure anal- 
ysis of PPACK-thrombin and the completely refined 
model are fully described; sites of particular interest, such 
as the A-B-chain interaction and the distinct substrate- 
binding subsites including the interactions with substrates 
and inhibitors are presented in more detail; the unique 
electrostatic properties of thrombin that play an impor- 
tant role in the molecular stability and in several molec- 
ular functions are described; the thrombin structure is 
compared with that of related proteinases; and finally, 
the various functions of thrombin are discussed with re- 
spect to its structure. 



Crystal structure of human thrombin 

Results and discussion 

Overall structure 

The thrombin molecule can be described as a prolate el- 
lipsoid of approximate dimensions 45 A x 45 A x 50 A. 
Its A- and B-chains arc not organized in separate do- 
mains, but form a single contiguous body with a highly 
furrowed surface (see, e.g., Fig. 1). Similar to the other 
trypsinlike serine proteinases, thrombin consists essen- 
tially of two interacting six-stranded barrellike domains 
(linked by four transdomain "straps"), of five helical seg- 
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ments and one helical turn, and of various surface- 
located turn structures (Fig. 2; Kinemage 1). Three of 
these straps and three of these helices are also present in 
other known serine proteinase structures; the A-chain has 
a counterpart in chymotrypsin(ogen). The catalytic resi- 
dues, in particular Ser 195, His 57, and Asp 102 (using 
the chymotrypsinogen nomenclature introduced by Bode 
et al, [1989b]; see Table 3) are located at the junction be- 
tween both barrels; a prominent active-site cleft (Fig. 1) 
stretches perpendicular to this junction (described in 
more detail below). 




Fig. 1. Front view toward the thrombin molecule (yellow), displayed together with its Connolly dot surface (calculated with 
a probe of radius 1 .4 A). The active-siie cleft runs from left to right across the molecular surface; the bound PPACK molecule 
(violet) is bound. Only the front parts of the thrombin molecule and of its molecular surface are displayed. A bound substrate 
polypeptide chain would run front left to right. Tlie surface **hole** close to the ceiuer corresponds to the entrance to the speci- 
ficity pocket. The 60 insertion loop, in particular Trp 60D, ts partially occluding the active site. The fibrinogen secondary binding 
exosite is to the right. 





Fig. 2. Ribbon plot of the human a-thrombin polypeptide chain; 3-strands, helices, and turns are represented by twisted arrows, 
helical ribbons, and ropes. The view is on the active-site cleft made by the B-chain (yellow); the A-chain runs in the back of 
the molecule (violet). This figure has been produced with a modified version of the program RIBBON kindly provided by John 
Priestle (Priesile. 1988). 
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The secondary structure of the A- and the B-chain of 
the final thrombin model is illustrated schematically in 
Figure 3. Each barrellike domain appears as six antipar- 
allel running strands. The size and extent of each jS-sheet 
is remarkably similar to those observed in other trypsin- 
like serine proteinases. The polypeptide strands involved 
in barrel formation always fold back on the molecular 
surface (see Fig. 2); some of these surface-located throm- 
bin loops are particularly extended in comparison to 
other related serine proteinases. In each barrel, four of 
the five turns are hairpin loops, all of them of a rather 
open, nonregular conformation. Some of these open 
loops contain local classical 1-4 tight turns (Table 1). 

The six helical segments occurring in thrombin are 
listed in Table 2 together with their helix type and length 
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defined according to different helix criteria. The one-turn 
helix around His 57, the intermediate helix around Cys 
168, and the long carboxy-terminal helix (of a mixed 
type) are similar to those found in other serine protein- 
ases; the A-chain helix, the short 3|o-heIix between Tyr 
60A and Asp 60E (better described as two interlaced type- 
111 turns; see Table 1), and the regular two-turn a-helix 
around Ala 129 are additional segments or parts of inser- 
tion loops and seem to be characteristic of thrombin. 

Comparison with related serine 
proteinase structures 

The a-carbon structure of human a-thrombin optimally 
superimposed on that of bovine chymotrypsinogen is 




Rg. 3. Secondary structure of human a-ihrombin. Inter-main-chain hydrogen bonds arc selected according lo criteria (with 
£ < -0.7 kcal/mol) given by Kabsch and Sander (1983). The sequence nomenclature is that derived from equivalency with 
chynioirypsin(ogcn)/irypsin (Bode et al., 1989b) as also used in Table 3. Thrombin residues that arc topologically equivalent 
lo chymotrypsin residues are emphasized by thick circles; thrombin residues, which in addition possess identical amino-acids, 
are denoted by filled circles. 
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Table 1. Tight 1-4 turns in human a-thrombin 



Hydrogen bond 



Residues 




1 




Position 

2 3 


4 

. — 


Turn type 


1—4 Hvdrosen-bond 
length 
(A) 


Energy 
(kcal/mol)" 


Remarks 




lA-3 


Asp 


Cys 


Gly 


Leu 


.. 

II 


3.45 


■ 

-1.3 


Gly in position 3 


4-7 


Arg 


Pro 


Leu 


Phe 


I-lll 


3.01 


-1.5 


Pro in position 2 


8-11 


Glu 


Lys 


Lys 


Ser 


I 


2.89 


-1.9 




14-14C 


Asp 


Lys 


Thr 


Glu 


I 


3.21 


-1.6 




I4G-14J 


Leu 


Glu 


Ser 


Tyr 


III 


2.80 


-2.1 




23-26 


Glu 


lie 


Gly 


Met 


II 


3.40 


-1.3 


Gly in position 3 


48-51 


Ser 


Asp 


Arg 


Trp 


I 


3.46 


-0.7 


Very weak hydrogen bond 


60A-60D 


Tyr 


Pro 


Pro 


Trp 


III 


3.20 


. -1.5 


Part of 3jo-heHx 


60B-60E 


Pro 


Pro 


Trp 


Asp 


I 


2.74 


-2.8 


Pro not in position 3 


601-63 


Thr 


Glu 


Asn 


Asp 


I 


2.97 


-1.8 




77-79 


Glu 


Arg 


Asn 


He ' 


I 


3.41 


-0.8 


Very weak hydrogen bond 


91-94 


His 


Pro 


Arg 


Tyr 


I 


3.39 


-1.0 


Pro not in position 3 


131-134 


Gin 


Ala 


Gly 


Tyr 


II 


2.84 


-2.4 


Gly in position 3 


164-167 


Glu 


Arg 


Pro 


Val 


III 


3.07 


-0.9 




168-171 


Cys 


Lys 


Asp 


Ser 


I 


3.11 


-1.6 




177-180 


Thr 


Asp 


Asn 


Met 


1 


3.12 


-1.5 




185-186B 


Lys 


Pro 


Asp 


Glu 


1 


3.01 


-2.2 


Pro not in position 3 


191-194 


Cys 


Glu 


Gly 


Asp 


II 


3.34 


-1.5 


Gly in position 3 


221-224 


Arg 


Asp 


Gly 


Lys 


11 


3.18 


-1.7 


Gly in position 3 



" According to Kabsch and Sander (1983). 



shown in Figure 4. One hundred ninety amino acid resi- 
dues of the thrombin B-chain are topologically equivalent 
(within a root-mean-square [r.m.s.] deviation of 0.78 A) 
to residues in bovine chymotrypsin (Cohen et al., 1981; 
Blevins & Tulinsky, 1985; Tsukada & Blow, 1985); in ad- 
dition, 6 residues of the thrombin A-chain are equivalent 
to the 6 amino-terminal residues of the chymotryp- 
sin(ogen) propeptide. The 190 equivalent B<:hain residues 
and another 40 residues were assigned the sequence num- 
bers of the topologically equivalent chymotrypsinogen 
residues (Hartley & Kauffman, 1966; Meloun et ah, 
1966). In addition to secondary structural elements, Fig- 
ure 3 defines those human cv-thrombin amino acid resi- 



dues topologically equivalent to chymotrypsin residues 
(see also Kinemage 2); filled circles further represent those 
84 with identical amino acids. 

The thrombin B-chain has 195 a-carbon atoms equiv- 
alent to the bovine trypsin structure (see Fig, 5) with an 
r.m.s. deviation of 0.8 A (Bode & Schwager, 1975a); at 
a few sites the thrombin structure is closer to that of tryp- 
sin than to that of chymotrypsin, i.e., (1) thrombin seg- 
ment 70-80, which resembles the calcium-binding loop of 
trypsin (Bode & Schwager, 1975a,b), (2) Gly 184-Tyr 
184A, and (3) thrombin loop segment 217-224. 

Here, the generally used chymotrypsinogen numbering 
of bovine trypsin (Hartley & Shotton, 1971) was used. 



Table 2. Helical segments in human a-thrombin 



Helix extent (number of residues) according to Richardson and Richardson (1990) 



Helix type 


Hydrogen bonding 


Conformation 


a-Carbon position 


3.6,3 


Glu 14C loScr 141 (7) 


Glu 14C to Ser 141 (7) 


Thr 14B to Tyr 14J (9) 


3io 


Ala 55 to Leu 38 (S) 


Ala 56 to Leu 59 (5) 


Ala 56 to Tyr 60A (6) 


3 II) 


Tyr 60A to Asp 60E (S) 


Pro 60B to Pro 60C (2) 


Tyr 60A to Trp 60D (4) 


3.6i3 


Asp 125 to Uu 129C(8) 


Arg 126 to Scr 129B (6) 


Asp 125 to Glu 131 (10) 


3.6,3 


Glu 164 to Asp 170 (7) 


Arg 165 to Lys 169 (5) 


Glu 164 to Ser 171 (8) | 




Cys 168 to Thr 172 (5) 


Lys 169 to Ser 171 (3) 


Lys 169 to Thr 172(4) ) 


3 Id 


Val 231 to Lys 235 (5) 


Val 231 to Trp 237 (7) 


His 230 to Leu 234 (5) \ 


3.6,3 


Leu 234 to Asp 243 (10) 


He 238 to lie 242 (5) 


Leu 234 to He 242 (9) j 


3,0 


Val 241 to Glu 244 (4) 


Val 241 to Glu 244 (4) 


Val 241 to Glu 244 (4) J 
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47GLU 




47GLU 




Fig. 4. a-carbon structure of the A-chain 
(dashed connections) and the B-chain (thick 
connections) of human a-thrombin super- 
imposed with molecule 1 of bovine chymo- 
trypsinogen (thin connections [Wang et al., 
1985J) after minimizing the r.m.s, deviation 
of the topologically equivalent a-carbon 
atoms. The PPACK molecule is shown with 
all nonhydrogen atoms, and the thrombin 
chain termini are labeled. 



I70I.U 



:47GLU 




Fig. 5. «-carbon structure of the A-chain 
(dashed connections) and the B-chain of 
human a-thrombin (thick connections) su- 
perimposed with bovine trypsin (Bode et al., 
1976). The PPACK molecule is overlaid; 
standard view as in Figure 1 . 



The residual insertion residues in the thrombin B-chain 
were marked by letter suffixes; at a few of the junctions 
between conserved and variable segments, the position- 
ing of the insertions was necessarily somewhat arbitrary. 

For numbering of the A-chain, residue numbers I and 
15 were assigned to the (chymoirypsin-homologous) cys- 
teine and to the carboxy-terminal arginine residue, re- 
spectively. The A-chain residues preceding Cys 1 were 
designated by a 1 and a letter suffix in alphabetic order 
in the reverse chain direction; the A-chain residues fol- 
lowing Cys 1 were designated by numbers up to 14, fol- 
lowed by 14 and letter suffixes in alphabetic order. 

The chymotrypsinogen sequence numbering of human 
thrombin is shown in Table 3 based on these topological 
equivalences with chymotrypsin and trypsin (see Bode 
et al., 1989b). The A-chain runs from Thr IH to Arg 15; 
the B-chain starts with lie 16 and ends up with Glu 247. 
A nomenclature based on the topological equivalence of 
the amino acid residues is an enormous advantage, as 
functional properties already known for the related tryp- 
sinlike enzymes can be attributed to distinct sites of the 



thrombin structure. Table 3 also contains the sequences 
of bovine chymotrypsinogen (Hartley & Kauffman, 1966; 
Meloun et al., 1966), bovine trypsinogen (Walsh & Neu- 
rath, 1964; Mikes et al., 1966), and bovine thrombin 
(Magnusson et al,. 1975; MacGilJivray & Davie, 1984). 
All key residues of thrombin are also conserved in the cor- 
responding enzymes from mouse (Friezner-Degen et al., 
1990) and rat (Dihanich & Monard, 1990). 

According to this topological alignment, 86 (of the 
equivalent) amino acid residues of the thrombin B-chain 
are identical with residues in bovine trypsin, and 79 are 
identical with residues in bovine chymotrypsin (whose 
propeptide possesses, in addition, two residues identical 
to the thrombin A-chain). For comparison, the two diges- 
tive enzymes have 99 identical residues between them. 

Of the other vertebrate serine proteinases of known 
spatial structure, porcine pancreatic kallikrein (Bode et al., 
1983) and porcine pancreatic elasiase (Meyer et al,, 1988) 
exhibit a similar topological equivalence to the thrombin 
B-chain; however, their r.m.s. deviation is slightly larger 
and the number of identical residues slightly smaller (Ta- 
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Table 3. Sequence alignment of bovine thrombin (BTHR), human thrombin (HTHR), bovine trypsin (BTRY), and 
bovine chymotrypsin (BCHY) according to topological equivalences; thrombin chymotrypsinogen numbering (THRO), 

chymotrypsinogen numbering (CHYG)^ 



THRO 


IH 


IG 


IF 


IE 


ID 


IC 


IB 


lA 


1 


2 


3 


4 


5 


6 


7 


BTHR 


T 


F 


G 


A 


G 


£ 


A 


D 


C 


G 


L 


R 


P 


L 


F 


HTHR 


T 


F 


. G 


S 


G 


E 


A 


D 


c 


G 


L 


R 


P 


L 


F 


BTRY 
































BCHY 


















c 


G 


V 


P 


A 


1 


Q 


CHYG 


















1 


2 


3 


4 


5 


6 


7 


THRO 


8 


9 


10 


11 


12 


13 


14 


14A 


14B 


14C 


14D 


14E 


14F 


14G 


14H 


BTHR 


E 


K 


K 


Q 


V 


Q 


D 


Q 


T 


E 


K 


E 


L 


F 


£ 


HTHR 


E 


K 


K 


s 


L 


E 


D 


K 


T 


E 


R 


E 


L 


L 


E 


BTRY 
































BCHY 


P 


V 


L 


s 


G 


L 


S 


















CHYG 


8 


9 


10 


11 


12 


13 


14 


















THRO 


141 


14J 


14K 


14L 


14M 


IS 




















BTHR 


S 


Y 


I 


E 


G 


R 




















HTHR 


S 


Y 


I 


D 


G 


R 




















BTRY 


V 


D 


D 


D 


D 


K 





















BCHY ----- R 
CHYG ----- 15 
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^ Residues topologically equivalent with human thrombin are boxed. 



ble 4), The corresponding numbers (Table 4) for rat mast 
cell proteinase II (Remington et al., 1988) and human leu- 
kocyte elastase (Bode et al,, 1986), two cellular serine pro- 
teinases, indicate a more distant structural relationship to 
thrombin. 

This ranking order due to topological equivalence 
agrees approximately with the order obtained from a se- 
quence alignment of these serine proteinases with the 



thrombin B-chain (Table 4), Table 4 shows furthermore 
the alignment score and the degree of similarity to the 
thrombin B-chain for the reactive domains of several co- 
agulation factors based on sequence alignment alone. Ac- 
cording to such data, the human thrombin B-chain is 
most similar to the catalytic domains of protein C and 
Factor Xa of human origin; its similarity with coagula- 
tion Factors IXa and XIa is only slightly lower, in agree- 



Table 4. Topological equivalence and sequence identity of human thrombin B-chain with the reactive 
domains of vertebrate serine proteinases. Sequence alignment scores for other coagulation factors^ 



Topologically equivalent 
residues 



Identical residues 
(from sequence alignment) 





Number 


r.m.s. deviation 
(A) 


Number 


Vq Total 
(enzyme) 


Alignment 
(standard de^ 


BTRY 


195 


0.82 


88 


39.5 


27 


BCHY 


190 


0.78 


85 


37.0 


24 


PPKK 


193 


0.92 


70 


30.2 


19 


PPE 


192 


1.02 


62 


25.8 


12 


RMCP 


180 


0.89 


56 


25.0 


13 


HLE 


176 


0.90 


61 


28.0 


12 


(BTRY-BCHY) 


190 


0.76 


96 


43,0 


35 


Protein C 






100 


40.0 


41 


FXa (reactive domain) 






97 


38.2 


40 


FIXa (reactive domain) 






87 


37.0 


37 


FXIa (reactive domain) 






88 


37.0 


34 



" BTRY, bovine trypsin (Bode & Schwager. 1975a); BCHY, bovine chymotrypsin A (Cohen et al., 1981); PPKK, porcine 
pancreatic kallikrein (Bode et al.. 1983); RMCP, rat mast cell proteinase 11 (Remington et al., 1988); PPE. porcine pancreatic 
elasiase (Meyer et a!.. 1988); HLE, human leukocyte elastase (Bode et al.. 1986). 
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ment with previous investigations of others (see, e.g., 
Patthy, 1985). 

The polypeptide alignment (Table 3) obtained for 
thrombin and chymotrypsin(ogen) due to topological 
equivalence is in good agreement with the alignment 
scheme proposed by Hartley and Shotton (1971). The to- 
pologically equivalent regions of the thrombin B-chain 
(Table 3) contain the (structurally) conserved regions pre- 
viously identified by Greer (1981) and Furie et al. (1982) 
on the basis of amino acid sequence homology or from 
comparison of the three-dimensional models of serine 
proteinases, respectively. The sites and numbers of inser- 
tions and deletions predicted by Bing et al. (1986) on the 
basis of a chymotrypsin-trypsin hybrid model are largely 
in accord with our experimental results. Their proposed 
alignment turned out to be incorrect only around Glu 80; 
as shown below, a single insertion at Arg 11 A is found 
compared with the predicted deletion and insertion at po- 
sitions 76 and 84, respectively. Some spatial properties of 
thrombin such as the positively charged anion-binding 
exosite and the clustering of aromatic residues near the 
active site were correctly predicted (Bing et al., 1986). It 
had been suggested that one or both large insertion loops 
might fold (partially) over the entrance to the catalytic 
center, in this way blocking the approach of large macro* 
molecular substrates or inhibitors (Sugawara et al., 1986). 
Not foreseen was the role of He 174 (which is not part of 
an insertion segment, see below) or some other structural 
details largely responsible for thrombin specificity. 

Comparison with related thrombin structures 

The availability of different crystal structures of throm- 
bin from different species allows us to recognize intrin- 
sic structural properties and to separate them from 
crystal-packing effects. Comparison of the a-carbon trac- 
ing of PPACK-thrombin with that of the hirudin-throm- 
bin complex (Rydel et al., 1991; see Fig. 6) shows both 
thrombin models to be to a large extent identical; two- 
thirds of all a-carbon atoms (i.e., 2QO largely internal res- 
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idues) differ by only 0.26 A (r.m.s. deviation). There are, 
however, some very significant displacements and confor- 
mational differences, in particular in the ami no-terminal 
segment (up to Ala IB) and in the carboxy-terminal seg- 
ment (from lie 14K onward) of the A-chain, in the inser- 
tion loop around Trp 148, and at the carboxy-terminus 
of the B-chain from Gin 244 onward. 

The displacements of the residues around Trp 148 are 
the result of a large conformational change of the corre- 
sponding loop, mainly through a rigid-body rotation of 
the loop domain around Glu 146 Ca-C and Gly 1 50 N-Ca 
(see Fig. 17 and below). The central residues of this loop 
are displaced by up to 10 A (Rydel et al., 1991). Without 
rearrangements, the hirudin chain as it is observed in the 
hirudin-thrombin complex would collide with the Trp 148 
loop in its PPACK-thrombin conformation (in particu- 
lar, the side chains of Trp 148 and Thr 4IH [see Fig. 17]). 
It should be mentioned that this loop segment in both 
thrombin structures exhibits enhanced, but by no means 
extreme, B-values (see Fig. 28). 

Significant deviations (up to almost 1.0 A) are further- 
more observed at the central residues of the insertion loop 
around Trp 60D (which juxtaposes the Trp 148 loop 
across the active-site cleft). These displacements result, 
however, from expansion of the S2 cavity (see below) en- 
closed by this exposed loop rather than from conforma- 
tional changes. Again, the B-values of this loop are not 
exceptionally high in either structure. 

Further significant, but more local, positional changes 
between PPACK-thrombin and hirudin-thrombin are 
observed at Glu 97A (which forms the ceiling of the hy- 
drophobic pocket) and at Arg 77 A (in the fibrin [ogen] 
secondary binding exosite); both residues are parts of 
quite exposed loops in the substrate-binding area and are 
rather flexible; thus their conformation might easily be 
affected by the differently bound inhibitors. 

Comparison of the human PPACK-thrombin model 
with the two independent molecules (Martin, Kunjum- 
men, Kumar, Bode, Huber, & Edwards, unpubl.) in the 
bovine thrombin structure (with 87% identical residues 



THR1H 




THRIH 



247 




U247 



Fig. 6. a-carbon structure of PPACK- 
human or-thrombin (thick conneaions) 
optimally superimposed with the throm* 
bin (chin connections) and the hirudin 
component (dashed connections) of the 
thrombin-hirudin complex (Rydel et al., 
1991). Standard view as in Figure 1. 



Crystal structure of human thrombin 

[compare with Table 3]) yields a similar pattern of poly- 
peptide chain displacements: the free amino- and the free 
carboxy-terminal end of the A-chain, the carboxy-termi- 
nus of the B-chain, and the Trp 148 loop (which is only 
weakly defined presumably due to partial cleavage) show 
very large displacements, mainly as the result of differ- 
ent crystal contacts. Omission of these segments (i.e., of 
21 amino acid residues) yields an r.m.s. deviation of only 

0. 45 A for a-carbon atoms. It is noteworthy that the 60« 
insertion loop is similarly folded in the two bovine throm- 
bin molecules as in PPACK-thrombin. 

Charged groups and sail bridges 

Human thrombin has a particularly high proportion of 
polar amino acids. Only about 45*^^o of its total residues 
are nonpolar (compared with, for example, 51% in bo- 
vine chymotrypsin and 63% in the much more "sticky" 
human leukocyte elasiase); more than 70% of all residues 
on the thrombin surface are hydrophilic in nature. More- 
over, human a-thrombin possesses an exceptionally large 
number of charged residues compared with the pancre- 
atic irypsinlike enzymes. The A- and the B-chain contain 
9 and 31 acidic residues and 6 and 37 basic residues (and 
an additional 4 histidines), respectively (Fig. 7). In con- 
trast, bovine trypsinQgen and bovine chymotrypsinogen 
contain only 12 or 14 addle, and 17 or 18 basic residues. 
In addition, human a-thrombin has two terminal amino, 
two terminal carboxy, and two neuraminic acid groups, 
the latter terminating the single Asn 60G-linked sugar 
chain of thrombin (Nilsson et ah, 1983). 

The calculated charges and electrostatic energies of all 
(partially) charged protein groups in the electrostatic 
multipole field created by the whole thrombin molecule 
are listed in Table 5. About three-quarters of all charged 
groups exhibit negative electrostatic interaction energies, 

1. e., confer overall stability to the thrombin molecule, 
whereas one-quarter of them have a destabihzing influ- 
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ence on the molecular structure. At pH 7, His 57 should 
carry an almost full positive charge (due to the close 
proximity of Asp 102); His 119 should be weakly charged 
at this pH value, whereas the residual two histidine resi- 
dues. His 71 and His 91, should virtually be noncharged. 

The almost balanced number of acidic and basic resi- 
dues of human thrombin suggests that its overall net 
charge (which we calculate to be +2.3 at pH 7, if the two 
neuraminic acid groups are included) is close to zero un- 
der physiological buffer conditions. Considering intrin- 
sic pK values for the ionizable groups of a-thrombin, an 
overall isoelectric point of 7.8 can be calculated; allow- 
ing for the spatial arrangement of the charged groups and 
their environment (Karshikov et al., 1989) and assuming 
two neuraminic acid residues located 20 A apart from the 
thrombin surface, a higher value of 8.4 is obtained. These 
calculated isoelectric points are slightly higher than the 
experimental values of 7.3 and 7.6 obtained by isoelectric 
focusing (Fenton et aL, 1977); however, higher values 
around 9 have also been reported for human thrombin 
(Berg et al., 1979; Heuck et al., 1985). 

Seventy-nine of the 89 fully charged groups (at pH 7) 
are at least partially exposed to bulk water (Table 5); they 
represent 42% of all surface- located residues of the 
thrombin molecule. This large number of surface charges 
presumably accounts for the marked tendency of human 
thrombin to associate at low ionic strength and foi; its 
strongly salt-dependent solubility (Landis et al., 1981). 
The significantly better solubility of the almost equally 
charged bovine species indicates that some individual sur- 
face charges might have a particular impact on the asso- 
ciation behavior, however. 

Ten charged protein groups of human a-thrombin are 
completely buried and inaccessible to bulk water (Table 5), 
All of these buried charges and many of the surface- 
located charges are arranged in pairs and clusters of 
oppositely charged groups. Thirty-two salt bridges (of 
stabilizing energies greater than or equal to IRT, i.e.» 




Fig. 7. Main-chain structure of human thrombin (yellow), with ail positively (blue) and negatively (red) charged residues added 
with full structure. Standard view as in Figure 1 . 
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Table 5. Calculated parameters of ait charged groups of human a- thrombin at pH 7, ionic strength 0. 125 
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0.6 
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1.0 


-0.8 


0.35 
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-0.7 


0.47 
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0.7 
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-0.7 


0,53 
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0.7 


0,17 
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-0.6 
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Arg 15 


1.0 


-0.6 


0,66 


Arg 93 


1.0 


1.4 
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Amino-terminus. 
'* Carboxy-terminus. 



—0.6 kcal/mol) formed between oppositely charged pro- 
tein groups are given in Table 6. They contribute to the 
stabilization of both /3-barrels and of some of the open 
turns. Several of the salt bridges are further intercon- 
nected giving rise to ionic clusters of up to four protein 
groups of either charge. The electrostatic energies of the 
resulting clusters are, of course, slightly less than the 



arithmetic sum of the contributing ion pairs, due to par- 
tial repulsion of identically charged groups. 

These electrostatic interactions play particularly impor- 
tant roles in the intrachain stabilization of the A-chain 
and in the coherence between the A- and the B-chain (see 
Figs. 9, 10; Table 6). Twenty salt bridges (some of which 
are further organized in five larger ion pair clusters) are 
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Table 6, Ion pairs and clusters stabilizing the tertiary 
structure of human a-thrombin 



Ion clusters and 
salt bridges 



Electrostatic 
energy of 
bridges and 

clusters 

(kcal/mol) 



Distance* 

(A) 



Solvent 
accessibility 
of charged 

groups 
involved 
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0.08 
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■ Distances between charged group centers (sec Materials and meth- 
ods)/nearest charged atoms. 
^ Amino-terminus. 

"^Fixation of amino-terminus (N) or carboxy-ierminus (C); stabili- 
zation of turns (T), /3-sheets (S); cross-connection of domains (D) or 
active-site residues (A). : 



made by B-chain groups alone. In contrast, the compar- 
atively short A-chain is cross-linked by six salt bridges 
grouped in three separate clusters; another six salt bridges 
cro$s*connect the A- with the B-chain (Fig. 10). Alto- 
gether, seven of these salt bridges (two within the A- 
chain, three between A and B, and two within the 
B-chain) are completely buried in the protein matrix; in 
each of these buried ion bridges/clusters, at least one of 
the charged groups is in direct contact with an adjacent 
internal water molecule (see Table 9). 

Particularly extended ion clusters stretch around Asp 
14 (where they confer stabilization to the A-chain and 
tighten the A-B interaction, see below) and Lys 70 (where 
they stabilize the structure of loop 70-80 and thus main- 
tain the integrity of the fibrin[ogen] secondary binding 
exosite; see below). Similar to the other activatable tryp- 
sinlike serine proteinases, thrombin possesses a buried 
salt bridge of large electrostatic interaction energy be- 
tween the ammonium group of He 16 (the amino-termi- 
nal residue of the B-chain) and the side-chain carboxyl 
group of Asp 194 (see below and Kinemage f ); the integ- 
rity of this salt bridge has been shown to be important in 
maintaining the characteristic active conformation (see 
below and Bode, 1979). 

The overall distribution of positively and negatively 
charged residues in the thrombin molecule and along its 
molecular surface is not uniform (Fig. 7); rather the pos- 
itive charges are somewhat more concentrated at two 
poles, and the negatively charged residues arranged in a 
ring. leading to a sandwichlike electric-field distribution 
(Fig. 8). In other words, several charges are not properly 
balanced by oppositely charged protein neighbors, but 
cluster in positively and negatively charged surface 
patches (Table 7). These clusters give rise to quite high 
electrostatic field strengths (both positive and negative) 
outside the thrombin surface (Fig. 8). The charged groups 
of residues located in the center of these patches can ex- 
hibit considerable positive electrostatic interaction ener- 
gies with the electric multipole of the molecule (such as, 
e.g., Arg 93 or Asp 189 (see Tables 5, 7]) and thus have 
a destabilizing effect on the molecule; several of the 
charged residues located more toward the periphery of 
these patches are, on the other hand, simultaneously en- 
gaged in salt bridges and can therefore contribute to the 
stability of the whole molecule (as, e.g., Glu 39 of the 
negative patch nla [Table 7]). This charge clustering is 
presumably the main reason for the strong electrostatic 
interaction of thrombin with anionic structures observed 
even at physiological pH values, where its overall charge 
almost cancels out. . 

Three such surface patches (patches pi, p2, nl, and 
subgroups in Table 7) deserve particular attention. The 
strongest positive electrostatic field (patch p2) is observed 
for a groove (on top of the thrombin molecule in Fig. 8) 
surrounded by the exposed side chains of Arg 126, Lys 
236, Lys 240, and Arg 93, and of the electrostatically 
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Fig. 8. Main chain structure of luiinan thrombin overlaid with a posiiive (hk3 kcal/mol) and a negative (—5 kcal/mol) electrostatic 
potential surface. Standard view as in Fij^ure K 



more compensated residues Lys 235, Arg 101, and Arg 
233 in the more distant neighborhood (see Fig. 18). This 
site presumably represents the heparin-binding site of 
thrombin and will be discussed further below. 

A second surface region of high positive charge density 
(patch pi) extends along the convex thrombin surface be- 
tween Lys 149E and Lys 1 10 (on the right-hand side of 
the active-rsite cleft {Fig. 8]). This positively charged patch 
carries nine basic amino acid side chains (Lys 149E, Arg 
67, Arg 73, Arg 75, Arg 77A, Lys 81, Lys 36. Lys 109, 
and Lys 110) whose charges are only partially compen- 
sated by neighboring car boxy late groups. Also this fi- 
brin(ogen) secondary binding exosite will be shown in 
more detail below. 

One larger surface patch of negatively charged residues 
(Figs. 7, 8) extends around the active site of thrombin 
(patch nl). It consists of seven aspartic and glutamic acid 
residues arranged within (Asp 189) and around the spec- 
ificity pocket: Glu 192 (without protein counter-charge); 
Asp 102 (active-site residue, hydrogen bonded with His 
57); Asp 194 (involved in a relatively strong internal salt 
bridge with the ammonium group of He 16); and Glu 217, 
Glu 146, and Glu 97A, each involved in salt bridges with 
positively charged residues in the neighborhood. The 
many negative charges certainly assist in orienting ap- 
proaching substrates and affect binding; the electrostatic 
nature of the catalytic residues located at the edge of this 
patch is governed more by their adjacent neighbors, how- 
ever (see below). 

Solvent molecules and selected residues 

Thrombin possesses four disulfide bridges (see Kinemage 
2). The first (Cys 1-Cys 122) links the A- and B-chains, 
equivalent topologically to that in chymotrypstn(ogen) 
(Table 8). Disulfide bridge Cys 42-Cys 58 has a similar 
conformation to that observed in all other vertebrate ser- 



ine proteinases of known tertiary structure. Cys 168-Cys 
182 exhibits the same hand in thrombin as in chymotryp- 
sin but opposite to that in trypsin and kallikrein (Bode 
et al., 1983); however, this bridge differs in detail from 
all other related proteinases in spite of a quite similar en- 
vironment. The fourth disulfide bridge, Cys 191 -Cys 220, 
is again of a similar conformation in thrombin as in other 
related proteinases. 

The 409 (423; see Table 15) localized solvent molecules 
in the PPACK-thrombin structure account for almost 
30% of all solvent (water) molecules in the crystals. Their 
B- values range from 3 to 108 A^ (3 to 88 A^), the 
average B-value is 50 (45) A^. Thirty-four of these sol- 
vent molecules (Table 9) are located within the protein 
domain (and are in direct contact with at most one bulk 
solvent molecule); they are considered to be an integral 
part of the thrombin structure. Seventeen of them are 
found to be topologically equivalent (i.e., with similar 
protein environment and deviations up to 1 .2 A after op- 
timal superposition of both protein structures) to waters 
found in benzamidine-free bovine trypsin (DEBA [Bode 
et al., 1976; see also Table 3 in Bode & Sch wager, 
1975a]). 

Several of these internal solvents are grouped together 
forming linear water clusters, A particularly large cluster 
is made by eight adjacent internal solvent molecules (sol- 
vent molecules 315, 316, 320, 323, 342, 343, 345, and 414 
[Table 9|) encircled by the thrombin insertion loop Tyr 
184A-Gly 188 and segment Asp 221 -Tyr 225 behind the 
side chain of Asp 189; Sol 323 mediates the connection 
to the bulk water (see Fig. 19). Another large internal sol- 
vent cluster comprises five water molecules, extending 
around Sol 360 close to the 70-80 loop. Sol 305 in the 
back of the specificity pocket (see Fig. 22) is fully sepa- 
rated from bulk solvent molecules in PPACK-thrombin 
by the Arg 31 side chain; as with the equivalent Sol 416 
in trypsin (Bode & Sch wager, 1975a), it plays an impor- 



Crystal structure of human thrombin 

Table 7. Charged groups of human a-thrombin 
that form charge patches 

Interaction Nearest charged neighbor 



Charge 




energy 


(distance and interaction en 


patch 


Residue 


(kcal/mol) 


A/residue/(kcal/mol) 


Positive patches 








Lys 36 




<6 7/Lvs 109/0 41 




Arg 73 


0.6 


(8.2/Lys I49E/0.3) 




Arg75 


0.0 


(6.6/Glu 77/-0.5) 




Arc 77 A 


0.1 


(>10./) 




Lvs 81 


0.3 


(9.9/Lys 110/0.1) 




Lvs 109 


0.4 


(6.7/Lys 36/0.4) 




Lv5 1 10 


0.2 


(V.4/Lys 109/0.1) 






—0.2 


(>10./ /) 




Lvs 149E 


0.2 


(6.6/Asp 21/— 0.3) 


p2a 


Arg 93 


L4 


(3,6/Arg 101/1.3) 




Arg 101 


0.4 


(3.6/Arg 93/1.3) 




Arg 126 


0.6 


(4.3/Lys 236/0.4) 




Arg ZjD 


0.3 


(4.9/ Asp 178/-0.7) 




Lvs 235 


-0.5 


(6.8/Asp 125/-0.5) 




Lvs 236 


0.7 


(4.3/Arg 126/0.4) 




Lvs 240 


0.3 


(9,7/Lys 236/0.1) 


p2b 


Arg 165 


0.2 


(4.5/Lys 169/0.8) 




Lys 169 


0.3 


(4,5/ Arg 165/0.8) 


p2c 


Arg 97 


0.0 


(>10./ /) 




Arg 173 


1.0 


{8.6/GIu 217/— 0.2) 




Arg 195 


LO 


(4.1/Glu 97 A/ -0.9) 


Negative 


patches 






nla 


Glu 39 


— 1 . / 


l3.o//\rg jj/ — j.jj 




Asp60E 








Glu 61 




i/Liys o// — 1-^/ 




Asp 63 




V 1 .Hf Lys JO/ — u.*»/ 




Clu 97A 


— 1 1 


(4 l/Are 175/— 0 9) 




Glu 146 


-0.5 


(3.9/Arg 221 A/- 1.4) 




Asp 189 


1.1 


(6.6/A$p 221/0.7) 




Glu 192 


0.4 


(7.7/Glu 146/0.4) 




Glu 217 


-1.0 


(3.7/Lys 224/- 1.4) 




Asp 221 


0.7 


(4.5/Arg 187/-1.2) 


nib 


Glu 18 


0.4 


(6.0/ Asp 221/0.6) 




Glu 164 


-0.9 


(3.5/Lys 185/- 1.5) 




Asp 170 


-0.1 


(8.4/Lys 169/-0.2) 




Asp 186 A 


0.3 


(6.4/Glu 186B/0.4) 




Glu 186B 


0.0 


(5.2/Lys 185/-0.8) 




Asp 222 


-0.7 


(3.0/ Arg 187/- 1.7) 


n2 


Asp lA 


-2.1 


(3.0/Lys 9/- 1.8) 




Glu IC 


-0.3 


(7.6/ Arg 206/ -0.5) 




Asp 49 


-1.0 


(4.0/ Arg 50/-1.5) 




Glu 247 


-0.8 


(4.0/Ar8 50/-1.S) 




CTR" 247 


-0.1 


(5.8/Glu 247/0.5) 


o3 


Glu 14H 


0.1 


(7.4/Glu 14E/0.4) 




Asp 14L 


0.3 


(6.8/CTR 15/0.2) 




CTR 15 


0.2 


(6.8/Asp 14L/0.2) 




Glu 127 


-0.1 


(7.3/Asp 125/0.2) 



* Carboxy-terminus. 



tant role as an anchor for the distal positively charged 
groups of substrate PI residues. 

The aromatic residues are relatively evenly distributed 
over the thrombin molecule. Edge-on-plane as well as 



441 



Table 8. Dihedral angles of the disulfide bridges in thrombin 



Disulfide bridge 


1(1 


a 


«3 


kA 


x5 


Hand 


Cys 1-122 


68 


11 


97 


-63 


179 


Right 


Cys 42-58 


-82 


-163 


-95 


-78 


-77 


Left 


Cys 168-182 


-62 


-61 


-90 


-77 


90 


Left 


Cys 191-220 


-157 


46 


88 


-170 


-55 


Right 



planar stacking of adjacent aromatic rings can be ob- 
served (e.g., Tyr 60A-Trp 60D and Phe 225-Trp 215). 
The concentration of four relatively exposed indole side 
chains (Trp 60D, Trp 148, Trp 96, and Trp 215) around 
the substrate-binding site is particularly remarkable (see 
below). 

The thrombin A-chain and its interaction 
with the B-chain 

The thrombin A-chain with its multiplc-turn/helical con- 
formation (see Fig, 3) is fixed to the B-chain surface op- 
posite to the substrate-binding cleft (Fig. 2; Kinemage 3), 
It has an overall boomeranglike shape and nestles in such 
a way toward the B-chain that it merges to the common 
molecular surface without significant breaks (with the ex- 
ception of the extending carboxy-terminal segment and 
the exposed elbow around Lys 10-Ser 1 1 and Glu 13- Arg 
14D [Fig. 9]). The buried surface between the chains 
amounts to 1,200 A^; this number can be compared to 
the intermolecular surface of 1,725 A^, buried between 
a-thrombin and hirudin in their complex (Rydel et al., 
199J). 

Besides the covalent disulfide bridge connection be- 
tween Cys 1 and Cys 122 the interaction of the A- with 
the B-chain is mediated through charged side chains. Only 
three inter-main-chain hydrogen bonds exist between A 
and B (see Fig. 3), whereas another 20 hydrogen bonds 
are formed utilizing at least one side chain (Fig. 10). Six 
of the latter represent simultaneously (largely) buried salt 
bridges (see Table 6), and a charged side-chain group par- 
ticipates in a further 10 (such hydrogen bonds involving 
charged groups are particularly strong). 

Stabilization within the A-chain occurs preferentially 
through polar and salt-bridge interactions (in particular 
in its central part [see Figs. 9, 10]). Fourteen of the intra- 
A-chain hydrogen bonds are made between main-chain 
groups (Fig. 3). Nine oppositely charged side-chain 
groups are involved in four of the six residual hydrogen 
bonds and form part of the extended salt-bridge clusters 
(Table 6); three of them contribute simultaneously to the 
interchain salt bridges mentioned above (Fig. 10). 

The amino-terminal segment of the A-chain up to 
Glu IC is only defined by relatively weak density (see 
Fig. 28) in the PPACK-a-thrombin crystals but is placed 
unequivocally. This segment runs along a ridge of the B- 
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Table 9. Internal water molecules and their hydrogen-bond partners as found in thrombin PPACK^ 



PPACK-humaii a-thrombin 



Bovine trypsin 



PPACK-human a-thrombin 



Hydrogen-bond partners 



Deviation of 
equivalent 



Hydrogen-bond partners 



Bovine trypsin 

Deviation of 
equivalent 



Solvent 






Distance 


B 


Solvent 


molecules 


Solvent 






Distance 


B 


Solvent 


molecules 


number 


Atom 




(A) 


(A2> 


number 


(A) 


number 


Atom 




(A) 


(A) 


number 

— - 


(A) 

— — . 


304 


^ 

Asn 143 


N52 


(2.9) 


12 


**- 




— 

325 


Met 26 


- • 

0 


— • ■ 

(2.7) 


■. 

15 










f\ 
\j 












Arg 137 


NH 


(3.0) 










TTir 147 


N 


(3 2) 










Sol 454 


OH 


(2 7) 








305 


Trp2l5 


O 


(3.4) 


4 


416 


0.9 


327 


Pro 5 


N 


(3.5) 


31 


— 


— 




Phe 227 


O 


(3.3) 










Leu 6 


N 


(3.1) 






































Tvr 228 


OH 


t3 7) 










A«n 1 16 


0 










306 


Mel 210 


o 


(3.0) 


14 


722 


0.7 


332 


Arg 101 


N 


(2.9) 


28 


703 


0.3 




His 230 


N61 


(3.2) 










Asp 102 


N 


(3.5) 










Phe 232 


N 


(2.9) 










Asn 179 


0 


(2-9) 








308 


Ser 45 


07 


(3.1) 


13 


406 


0.7 




oOl jOS 




I*. '/ 










Leu 53 


0 


(2.8) 








334 


Ser 171 


O7 


(3.0) 


15 


— 


— 




Gly 196 


0 


(2.6) 










Tyr 225 


N 


(2.9) 










OIn 209 




(2.9) 










Sol 41 1 


OH 


(2.9) 








310 


Glu 217 


0 


(2.7) 


8 


415 


0.3 


335 


Trp 148 


Nel 


(2.9) 


23 


— 


— 




Gly 219 


0 


(3.5) 










Glu 192 


Oel 


(3.1) 










Cys 220 


kt 
N 


(3.4) 










Cys 220 


N 


(3.4) 










Arg 22 1 


N 


(3,0) 










Cys 220 




(3.2) 








311 


Trp 141 


N 


(3.1) 


19 


410 


0.4 


337 


Glu 217 


N 


(2.9) 


22 


— 


— 




Gly 193 


0 


(2.8) 










Tyr 225 


0 


(2.6) 










Sol 314 


OH 


(3.2) 










Sol 310 


OH 


(3.4) 








314 


Glu 30 


Nc2 


(3.2) 


10 


701 


0.1 




Sol 411 


OH 


(3.3) 










Thr 139 


0 


(2.7) 








342 


Arg 221 


0 


(2.7) 


10 


— 


— 




Asp 194 


0 


(3.0) 










Lys 224 


0 


(2.8) 










OOI ill 


vlrl 












Sol 316 


OH 


(2.8) 








315 


Ala 183 


0 


(3.0) 


16 


704 


0.3 




Sol 343 


OH 


(2.4) 










Asp 189 


062 


(2.8) 










Sol 414 


OH 


(3.3) 










Gly 226 


N 


(3.5) 








343 


Tyr 184 


0 


(2.9) 


10 


— 


— 




Tyr 228 


OH 


(2.8) 










Lys 224 


0 


(2.6) 








316 


Asp 221 


051 


(2.7) 


11 


— — 






Sol 320 


OH 


(3.1) 










Sol 342 


OH 


(2.5) 










Sol 342 


OH 


(2.4) 










Sol 320 


OH 


(2.8) 










Sol 345 


OH 


(2.6) 








318 


Glu 30 


0 


(3.0) 


17 


708 


0.2 


345 


Gly 188 


0 


(2.8) 


27 


705 


0.5 




Arg 67 


0 


(2.7) 










Sol 315 


OH 


(3.3) 










Gly 69 


N 


(3.3) 










Sol 320 


OH 


(3.1) 










Lys 70 


N 


(3.2) 










Sol 343 


OH 


(2,6) 








319 


[le 16 


N 


(2.7) 


3 


430 


0.3 


359 


Tyr 117 


0 


(3.0) 


39 








Gly 140 


0 


(3.1) 










Sol 358 


OH 


(3.3) 










Gly 142 


N 


(2.9) 










Sol 360 


OH 


(2.7) 










Gly 142 


0 


(2.7) 










Sol 457 


OH 


(3.3) 








320 


Tyr 184 


N 


(2.9) 


13 






360 


Ser 27 


0 


(3.4) 


15 


709 


1.2 




Gly 188 


0 


(3.0) 










Lys 70 


0 


(2.8) 










Sol 316 


OH 


(2.8) 










Sol 322 


OH 


(3.0) 










Sol 343 


OH 


(3.1) 










Sol 359 


OH 


(2.7) 










Sol 345 


OH 


(3,1) 










Sol 457 


OH 


(3.0) 








321 


Arg 137 


0 


(3.2) 


21 


721 


0.4 


368 


Leu 99 


0 


(2.8) 


16 


408 


0.4 




Thr 139 


O7 


(2.8) 










Ser 214 


Oy 


(2.9) 










Pro 198 


0 


(3.0) 










Sol 322 


OH 


(2.7) 










Sol 360 


OH 


(3.0) 








386 


Val 241 


0 


(2.6) 


21 






323 


Lys 185 


0 


(2.8) 


28 








lie 242 


0 


(2.8) 










Lys 186D 


0 


(3.0) 










Gly 244 


N 


(3.2) 










Sol 324 


OH 


(2.9) 










Phe 245 


N 


(3.2) 
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Crystal structure of human thrombin 
Table 9. Continued 



PPACK-human a -thrombin Bovine trypsin 



Hydrogen-bond partners Deviation of 
equivalent 



Solvent 
number 


Atom 




Distance 

(A) 


B 
(A^) 


Solvent 
number 


molecules 
(A) 


410 


Arg 165 
Arg 165 
Thr 177 
Met 180 


Ne 
NH2 
O 
O 


(2.7) 
(3.2) 
(3.0) 
(3.1) 


25 






411 


Glu 217 
Sol 334 
Sol 337 


Oc2 
OH 
OH 


(3.0) 
(2.9) 
(3.3) 


25 






414 


Tyr 184 
G!y 223 
Sol 342 


O 
N 

OH 


(2.7) 
(2.7) 
(3.3) 


21 






434 


Ser 27 
Trp 29 
Sol 321 
Sol 325 


O7 
Nel 
OH 
OH 


(3.3) 
(3.0) 
(2.9) 
(2.7) 


25 






457 


Ser 27 
Ser 27 
Sol 359 
Sol 360 


N 
0 

OH 
OH 


(2,9) 
(3.3) 
(3.3) 
(3.0) 


35 


717 


0,7 


626 


Trp 51 
Lys 107 


Ncl 

Nr 


(3.0) 
(3.4) 


44 


713 


0,8 



* Topologically equivalent solvent molecules of bovine trypsin 
(benzamidine-free, DEB A IBode et al,. 1976]) are added together with 
the r.m.s. deviation obtained after optimal superposition of both en- 
zyme models (for deviations up to 1 .2 A). 



chain with only a few van der Waals contacts between 
both chains. The position of the amino-terminal Thr IH 
is almost in the center of the carboxy-terminal (nonheli- 
cal) half-turn of the B-chain (see Fig. 9). It is close to the 
negatively charged side chains of Asp 243 and Glu 247 
but not within direct hydrogen bond distance (see Ta- 
ble 5). 
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The equivalent A~chain segment seems to be similarly 
arranged in bovine thrombin; in the crystals of human 
thrombin bound to hirudin (Rydel et al., 1991) it is defi- 
nitely in a quite different position and conformation » pre- 
sumably due to intermolecular contacts. The A-chain 
amino-terminus therefore possesses a high degree of flex- 
ibility and can adapt to different crystal environments but 
seems inherently to prefer the location found in PPACK- 
thrombin. The preceding segment, cleaved off autocat- 
alytically upon activation in human prothrombin, might 
extend away from the molecular surface thus facilitating 
enzymatic attack; in bovine thrombin the equivalent 13- 
residue segment, which is not cleaved off in the course of 
activation, is completely undefined by electron density, 
indicating its complete disorder. 

The central, more rigid part of the A-chain (between 
Asp 1 A and Tyr 14 J) runs in a shallow curved groove of 
the B-chain lined by B-chain segments Ser 20-Met 26 and 
Lys 202-Trp 207 (Fig. 9). The amino-terminal half of this 
central segment, organized mainly in tightly packed turns, 
is stabilized considerably by several intrachain and inter- 
chain hydrogen bonds (Fig. 3) and by salt-bridge clusters 
(Table 6). Asp 14 and Lys 202, with their side chains 
pointing toward the molecular center and toward the pe- 
ripheral A-chain, respectively, are at the center of the 
charge network (Fig. 10). The carboxylate group of Asp 
14 opposes the guanidyl group of (B-chain residue) Arg 
137 almost symmetrically; furthermore, it is involved in 
favorable hydrogen bond/salt bridge interactions with 
the guanidyl group of (A-chain residue) Arg 4. Glu 8 
makes hydrogen-bond/electrostatic interactions with this 
Arg 4, but also with Lys 202, which further interacts with 
the carboxylate group of Glu 14C. 

Most of these salt bridges are more or less completely 
buried (Table 6), Electrostatic interactions therefore con- 
tribute significantly to the intrachain stability as well as 
to the A-B interaction; almost 90^o of the total electro- 
static energy of the interaction between A and B is cal- 
culated to be due to these salt clusters. Six out of 7 




Fig. 9. Back view of the human throm- 
bin B-chain (thin connections) and its 
Connolly surface, displayed together 
with the thrombin A-chain (thick con- 
nections). This figure is obtained from 
Figure 1 after an approximately 1 80" ro- 
tation about a vertical axis. The A-chain 
runs from left (close to the B-chain 
carboxy-terminus Glu 247) to right along 
a boomeranglike shallow groove of the 
B-chain. 
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Fig. 10. Schematic drawing of the ionic interactions (as far as involved 
in hydrogen bonding) within the thrombin A-chain and between the 
thrombin A-chain (bold connections) and the B*chain (double connec- 
tions). The arrangement is similar to that in Figure 9, i.e., corresponds 
to a back view of the thrombin molecule. 



A-chain residues involved in intra- and interchain salt 
bridges are shared by human and bovine thrombin; only 
Glu 13, which makes a weak surface-located salt bridge 
in human thrombin, is replaced by a glutamine in the bo- 
vine species. In spite of these impressive electrostatic 
interactions, however, the A-chain is apparently neither 
of great importance for B-chain folding nor for the (ther- 
modynamic) stability of the B-chain globule (Hageman 
et al., 1975). 

The carboxy-terminal A-chain segment between Thr 
14B and Tyr 14J is organized in an amphiphilic a-helix 
of 1 1 turns (Figs. 3, 9). The side chains of Leu 14G and 
Tyr 14J are involved in hydrophobic contacts between the 
A- and the B-chains. The phenolic side chain of Tyr 14J 
exhibits enhanced flexibility. The segment from lie 14K 



to the carboxy-terminal Arg 15, which projects away 
from the main molecular body, is only weakly defined by 
electron density. The carboxy-terminus at Arg 15 is po- 
sitioned close to the guanidyl group of Arg 165 of a 
symmetry-related molecule. Segment He 14K-Arg 15 is 
arranged quite differently in bovine thrombin as well as 
in the human thrombin-hirudin complex; it therefore ap- 
pears to be inherently flexible. 

A-chain segment Cys 1-Leu 6 is topologically equiva- 
lent to the first six amino acid residues of the activation 
peptide of chymotrypsin(ogen) and connected in a simi- 
lar manner via disulfide bridge Cys 1-Cys 122 to the 
thrombin B-chain. The more carboxy-terminal part of the 
thrombin A-chain is of a different conformation and 
considerably longer than the corresponding pro-part of 
chymotrypsinogen. The interaction surface of the throm- 
bin B-chain has a relatively similar contour to that of 
chymotrypsinogen; in thrombin, however, most of the 
interacting side chains are longer and more |K>lar (see 
Fig. 11). 

In human cv-thrombin, Arg 15 is about 20 A away from 
the equivalent position in chymotrypsinogen (Fig. 11). 
Assuming an A-chain conformation identical to a-throm- 
bin up to Tyr 14J, Arg 15 of a prothrombin molecule 
would not be able to attain an equivalent position, even 
after a complete extension of the connecting polypeptide 
segment. Presumably, therefore, thrombin segment lie 
16-Ser 20 occupies a different position in prothrombin. 

As mentioned above, the B-chain alone appears to be 
thermodynamically stable and to retain clotting activity 
(Hageman et al., 1975). Thus, the A-chain would not 
seem to be a structural element required for the activity 
of thrombin; its integration into the B-chain might rather 
be necessary for correct activation cleavage and for other 
functions of prothrombin. As it is positioned opposite to 
the active site, interaction of the A-chain with larger 
bound substrates and inhibitors is not possible, it is, how- 
ever, conceivable that amino-terminally linked pro-parts. 




s 
s 




Fig. 11. Parts of the thrombin A-chain 
(dashed connections) and B-chain (thick 
connections) optimally superimposed with 
equivalent parts of bovine chymotrypsino- 
gen (thin connections). Polypeptide segment 
10-20 of chymotrypsinogen deviates consid- 
erably from the equivalent parts of both 
thrombin chains; the carboxy-terminal res- 
idue Arg IS of the thrombin A-chain is quite 
distant from the equivalent Arg IS' of the 
(nonactivated!) chymotrypsinogen. This 
back view is similar, but not identical, to 
that used in Figure 9. 



Crystal structure of human thrombin 

such as fragment 1.2 of meizothrombin, could fold back 
and project into the substrate-binding region, thus im- 
pairing activity toward macromolecular substrates, as re- 
cently shown for fibrinogen or Factor V (I>oyIe & Mann, 
1990). 



The thrombin B-chain 

The tertiary structure of the thrombin B-chain is ho- 
mologous to the reactive domains of the other trypsin- 
like serine proteinases of known spatial structure (see 
Figs. 2, 4, 5 and Bode et al., 1989b). A remarkable fea- 
ture of the thrombin B-chain globule is, however, a se- 
ries of more elongated and exposed loops, in particular 
around the active-site cleft. The unique specificity of 
thrombin for macromolecular ligands involves interac- 
tions at the surface of the molecule. The surface segments 
of the thrombin B-chain exhibit new features compared 
with chymotrypsinogen (as can be inferred from Table 3, 
Figs. 4, 5 and as has been presented in more detail by 
Bode et al., 1989b). Among them are (1) segment Arg 
173-Ile 174; (2) loop segment 95-100; (3) the large inser- 
tion loop Leu 59-Asn 62 (the 60-insertion loop or Trp 
60D loop), and (4) loop segment 34-42, which together 
form the "north rim" of the active-site canyon (Bode et a!,, 
1989b); (5) the large insertion loop 145-150. which to- 
gether with the trypsinlike segment Gly 216-Cys 220 
forms the corresponding "south rim"; (6) loop segment 
Tyr 184A-GIy 188, and (7) the trypsinlike loop segment 
Cys 220-Tyr 225, which together make up the thrombin 
structure behind the specificity pocket; and (8) loop seg- 
ment Ser 203-Arg 206, which is particularly involved in 
the interaction with the A-chain (see above). 

Most of these surface loops project out of the molec- 
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ular surface. However, with the exception of loop seg- 
ment 145-150 (see above and below), all of these loops 
are remarkably rigid. Proline and tryptophan residues 
seem to contribute to their rigidity. 

The positions of some polypeptide segments of the 
a-thrombin B-chain particularly susceptible to proteolytic 
cleavage are emphasized in Figure 12 and Kinemage 2. 
Some of these cleavage sites (in particular those generated 
autocatalytically or through trypsin action) may only 
exist under laboratory conditions and might play no im- 
portant role in vivo; they are, nevertheless, extremely 
valuable for investigating the importance of distinct 
thrombin sites for various thrombin functions. Most of 
these sites are located at exposed surface loops. One of 
them (Lys 149-Gly 150, see below) is not readily acces- 
sible to an attacking proteolytic enzyme; another one 
(Arg 67-lle 68) is even completely buried in the interior 
of .the thrombin molecule and only accessible after prior 
cleavage in the 70-80 loop (see below). None of the pre- 
ferred proteolytic cleavage sites shown in Figure 12 ex- 
hibit the canonical conformation found in substratelike 
binding small protein inhibitors (Huber & Bode, 1978; 
Laskowski & Kato, 1980) (see Table 10). Most of these 
sites are, however, flexible enough to allow conforma- 
tional adaptation to the binding sites of the proteolytic 
enzymes. The rigid, noncomplementary activation cleav- 
age site of chymotrypsinogen (Wang et al., 1985), how- 
ever, illustrates nicely that the susceptibility of peptide 
bonds is governed by other (hitherto unknown) properties. 

Thrombin sites of particular importance 

lie I6-Asp 194 and its role in activation 

Similar to the other activatable trypsinlike serine pro- 
teinases, the amino-terminal residue He 16 (and part of 




Fig. 12. a-carbon chain of the A-chain (thick connections) and the B-chain (thin connections) of human a-thrombin displayed 
together with the full amino acid residues placed at well-characterized proteolytic cleavage or replacement sites in naturally 
occurring thrombin variants. The bound PPACK is overlaid. Cleavages at Arg 77A-Asn 78 and Arg 67-ne 68 give rise to 
^-thrombin; cleavages at Trp 148-Thr 149, Ala 149A-Thr 149B, and Lys 149E-Gly 150 yield f-. and 7-ihrombin. respec- 
tively. In the genetic thrombin variants Quick I. Quick II, and Tokushima, Arg 67, Gly 226, and Arg 101 arc replaced by cys- 
teine, valine, and tryptophan, respectively. Standard view as in Figure I . 
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Table 10. Main-chain conformational angles of some <x-thrombin cleavage sites compared with 
the proteinase-binding loops of some protein inhibitors around the scissile peptide bond 



Scissik peptide bond 


P2 


PI 


pr 


P2' 


Arg77A-Asn 78 


-98/95 


-48/-55 


-115/29 


-II4/-52 


/?': Arg 75-Tyr 76 


-I24/-30 


-85/151 


-67/118 


-98/95 


/9: Arg 67/IIe 68 


-130/124 


-111/118 


-115/143 


79/8 


7,: Lys 149E.GIy 150 


110/164 


-75/149 


-92/-22 


-127/137 


r: Trp 148-Thr 149 


-I01/-6 


-73/-8 


-64/128 


-163/154 


c: Ala 149A>Asn I49B 


-137/150 


-64/128 


-163/154 


-100/153 


BPTI: Arg 151-AU 161 


-82/160 


-120/34 


-88/179 


-132/84 


Eglin c/subtilisin: Leu 4S-Asp 46 


-61/143 


-115/43 


-96/169 


-117/110 


OMTKY3: Uu 181-Glu 191 


-76/155 


-99/29 


-82/156 


-104/106 


PPACK 


-74/139 


-87/ 







Val 17) of the thrombin B-chain is buried within the pro- 
tein moiety (see Kinemage 1). The internal salt bridge 
made between the a-ammonium group of He 16 and the 
side-chain carboxylate group of Asp 194 is of consider- 
able strength (3.0 kcal/mol [see Table 6]). According to 
our electrostatic calculations, the pK of this ammonium 
group is shifted to about 10.5, due mainly to this strong 
interaction. The deprotonation of this ammonium group 
and the concomitant disruption of the salt bridge is gen- 
erally believed to result in a transformation of the active 
enzyme structure toward a zymogen-like structure (for a 
more detailed discussion of the equivalent case in trypsin, 
see Bode [1979]). This transformation is probably accom- 
panied by a decrease in activity with increasing pH val- 
ues in the alkaline range. Indeed, the thrombin hydrolytic 
activity toward typical synthetic />-nitroanilide substrate 
seems to be governed by an ionizable group of a pK value 
of almost 10 (see Lottenberg et al., 1983; De Cristofaro 
&DiCera, 1990). 

In prothrombin the intact Arg 15-lle 16 segment must 
be situated on the molecular surface (see Fig. 11). It 
would be reasonable to assume that segment 191-194 and 
the adjacent residues in prothrombin might have a simi- 
lar conformation as the equivsilent residues in chymotryp- 
sinogen (Freer et al., 1970; Wang et al., 1985); i.e., that 
Asp 194 might project into the active-site cleft and loop 
segment 191-194 folded inward with residue 192 occupy- 
ing the specificity pocket. However, the thrombin residue 
lopologically equivalent to His 40 of trypsinogen and 
chymotrypsinogen is Leu 40; thus. Asp 194 of prothrom- 
bin could not, as in the chymotrypsinogen structure, ac- 
cept a (probably stabilizing) hydrogen bond from the side 
chain of residue 40. In addition, according to model- 
building experiments the polar side chain of Glu 192 of 
prothrombin would probably not find a favorable envi- 
ronment in the protein interior of prothrombin (in contrast 
to the equivalent methionine side chain of chymotrypsin- 
ogen). Therefore, both substitutions (at positions 40 and 
192) seem to favor destabilization of the thrombin zymo- 
gen state relative to the active state. As mentioned earlier 



(Bode, 1979), such an equilibrium shift toward the active- 
state-like conformation could facilitate formation of pro- 
teolytically active prothrombin-protein complexes such 
as the prothrombin-staphylocoagulase complex (see 
Kawabata et al., 1985). 

The insertion loop Leu 59-Asn 62 
(60 insertion loop) 

The polypeptide segment Cys 58-Leu 64 of thrombin 
is nine residues longer than the corresponding segment of 
chymotrypsin (see Table 3; Fig. 4) and folded in a char- 
acteristic hairpin loop, which projects (in contrast to the 
digestive enzymes) considerably out of the molecular sur- 
face (Figs. 4, 5, 13A). This large insertion loop plays an 
important role in the selection of susceptible (macromo- 
lecular) substrates and inhibitors, i.e., for recognition and 
specificity of thrombin (Bode et al., 1989b). Segment Leu 
59-Phe 60H is folded in an approximate 3io-heIical turn, 
which is terminated by Glu 60E in a left-handed helical 
conformation (Table 2). The phenolic side chain of Tyr 
60A points toward the S2 subsite, covering it, and pack- 
ing tightly against Pro 21 of the bound PPACK molecule 
(see below). The indole moiety of Trp 60D is almost fully 
exposed to bulk solvent molecules. This part of the ex- 
posed loop seems to be particularly stabilized by the two 
intervening proline residues, which, in addition, pack 
against Trp 96 of an adjacent loop (see Fig. 13A and 
Kinemage 4). In the PPACK-thrombin crystals the whole 
loop (including all side chains) is well defined by electron 
density (Fig. 13B) and also makes intermolecular contacts 
with residues of the B-chain carboxy-terminus of a 
symmetry-related thrombin molecule. In spite of very dif- 
ferent crystal contacts in the thrombin-hirudin (Rydel 
et al., 1991) and in the bovine thrombin structure, this 
loop is of similar structure, i.e., its internal conformation 
seems to be stable. 

The well-defined (carbohydrate carrying) asparagine 
side chain of the subsequent Asn 60G extends from the 
loop surface away from the active-site cleft (Fig. 13 A). 
In the PPACK-thrombin crystals there is free intermo- 
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Fig. 13. A: The 60 insertion loop and the glycosylation site of human a-thrombin (thin connections) displayed together with 
part of PPACK (thick connections) and the thrombin Connolly surface. Loop segment Tyr 60A-Asp 60E projects particularly 
out of the molecule; it probably forms part of the chemotactic domain of thrombin. Asn 60G is pointing away from the active-site 
cleft. Standard view as in Figure I . B: Section of the Hnal electron density around the 60 insertion loop superimposed with the 
equivalent parts of human a>thrombin and some localized solvent molecules (crosses). The electron density representing the 
Asn 60G side chain (upper right corner) extends toward the top; it is. however, not appropriate to accommodate a distinct 
Asn 60G W-linked sugar chain; instead, this density has been interpreted as two fixed solvent molecules. Contour surface at O.80 
calculated for the whole map. Standard view as in Figure 1 . 



lecular space but no continuous electron density that 
would indicate the carbohydrate linked to it (Fig. 13B); 
the isolated density blobs visible beyond the carboxamide 
group have therefore been interpreted as solvent mole- 
cules. The bound, presumably disordered, oligosaccha- 
ride (consisting of 12 sugar units in a branched chain 
[Nilsson et al., 1983)) must extend away from the active- 
site cleft (Fig. 13 A). Thus, it should not interfere with 
macromolecular substrates that bind mainly along the 
cleft; this is in agreement with the lack of any effect on 
enzymatic (clotting and esterase) and cellular activities of 
thrombin on its removal or destruction (Horne & Gral- 
nick, 1983). Concanavalin A, in binding to the a-D-man- 
nose of this sugar chain, might on the other hand come 
into conflict with an approaching fibrinogen molecule, in 
agreement with experimental results (Skaug &. Christen- 



son, 1971; Karpatkin & Karpatkin, 1974; Hageman et al., 
1975). 

The compact and relatively rigid segment Tyr 60A-Trp 
60D is part of a tetradecapeptide with the sequence Tyr 
60A-Leu 65 of known extracellular matrix-binding and 
chemotactic/growth factor activities (Bar-Shavit et al., 
1984, 1986, 1989). The intact catalytic residues are not a 
prerequisite for chemotactic activity toward monocytes 
and neutrophils (Bizios et al., 1986). U is conceivable that 
such activities could be attributed to this exposed Tyr 
60A-Trp 60D loop of a-thrombin for its rather unique 
ridgelike architecture. In thrombin-hirudin this project- 
ing thrombin loop is smoothed out by the hirudin Pro 
46HI-Glu 49H1 segment packing alongside of it (Rydel 
et al., 1991); thus, partial masking upon hirudin binding 
could account for the observed abolition of chemotactic 
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activity (Bizios et ai., 1986; Prescott et al.. 1990). Extra- 
cellular matrix-bound thrombin failed to form a complex 
with antithrombin Ml, but still exhibits an open proteo- 
lytic site (Bar-Shavit et ah, 1989). This is in agreement 
with recent model-building studies, which show that the 
core of antithrombin III should pack against this project- 
ing loop thus burying it and removing it from contacts 
with receptors (unpubl.). 

According to a search in 7,068 sequences, this YPPW 
sequence seems (with the exception of a presumed poly- 
merase subunit from a vesicular stomatitis virus) to be 
confined to thrombin. Preliminary molecular dynamics 
simulations with an isolated and TV-methylated Tyr-Pro- 
Pro-Trp-Asn-amide peptide indicate that the distinct con- 
formation observed in thrombin might also be a low 
free-energy conformation in the isolated peptide; recent 
NMR experiments show that the equivalent hexapeptide 
does, however, not possess a single preferred conforma- 



tion but interchanges between at least two conforma- 
tional states (unpubl.). 

Surface structure around segment 67-80 
(the fibrin(ogen) secondary binding exosite) 

The loop Lys 70-Glu 80 (Fig. 14A; Kinemage 5) of 
thrombin is topologically similar to the so-called calcium- 
binding loop of bovine trypsin (Bode & Schwager, 
1975a,b) (Fig. 15A) and of the other digestive serine pro- 
teinases (see Meyer et al., 1988). The central residue of 
this loop is Lys 70. Its distal ammonium group occupies 
a site equivalent to the calcium ion in the pancreatic en- 
zymes (Fig, 15 A). In the thrombin structure, however, of 
the six oxygen ligands of calcium found in bovine tryp- 
sin only three are connected to Lys 70Nf through short 
hydrogen bonds, in an almost tetrahedral geometry (Ta- 
ble 11). It is noteworthy, that the distal ammonium group 
of Lys 70, which is well below the molecular surface in 
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Fig. 14. The nbrinogen secondary binding exosite of human a-thrombin. The view is similar to the standard view of Figure I. 
A: Stick drawing of loop 67-80 and its environment. Lys 70 is located at the center of an extended buried salt bridge network 
made with GIu 77, Glu 80, and Arg 67. Cleavage of peptide bond Arg 77A-Asn 78 leads to formation of noncoagulant 
thrombin, in which this loop is presumably unfolded, B: Surface residues of the exosite displayed together with the Connolly 
surface. 



Crystal structure of human thrombin 
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Fig. 15. A: Loop Lys 70-Glu 80 of human a-thrombin (thick connections) superimposed by the calcium-binding loop, the bound 
calcium ion, and two adjacent solvent molecules (crosses) of bovine trypsin (thin connections [Bode & Schwagcr, 1975a,b)). 
Nf of Lys 70 of thrombin occupies an equivalent position as the calcium ion in trypsin. The view is similar to the standard view 
of Figure 1. B: Loop Lys 70-Glu 80 of human a-thrombin (thick connections) superimposed by the equivalent loop of human 
leukocyte elastase (thin connections [Bode et al., 1986}). The guanidinium group of Arg 80 of the elastase occupies a similar 
position as Lys 70Nf of thrombin. 



the static a-thrombin structure (Fig. 14B), is (in the ab- 
sence of shielding proteins such as hirudin) highly suscep- 
tible to chemical modification (Chang » 1989; Church etal., 
1989). Clearly, ease and amount of modification are not 
necessarily reliable indicators for surface accessibility but 
might correlate with other molecular properties. 

In turn, residue Glu 80 is involved in another salt 
bridge/hydrogen bond with the guanidyl group of Arg 
67. Thus, the charged groups of four residues (Arg 67, 
Lys 70, Glu 77, and Glu 80) are cross-connected with one 
another to form a salt-bridge cluster essentially buried in 
the molecule (Fig. 14A), Its contribution to the rigidity 
and integrity of this loop appears considerable (see Ta- 
ble 6). As shown previously for trypsin (Bode, 1979), the 
integrity of this loop structure confers (thermal) stability 
to the whole molecule. Thrombin (like porcine kallikrein 
[Bode et al., 1983) and human leukocyte elastase [Bode 
et al., 1986; Fig. 15B; Table 1 IJ) seems to carry its endog- 
enous cationic group (70Nr) to order the 70-80 loop and 
to maintain a distinct surface contour (Fig. 14A,B); thus, 
these three proteinases share similar 70-80 loops with re- 
spect to conformation and a central basic residue. 

As already discussed (Table 7) and shown further in 
Figure 14B, the slightly notched surface arching over this 
loop has only positively charged amino acid side chains; 
in particular Arg 73, Arg 75, and Arg 77 A are sur- 



rounded exclusively by other positively charged residues 
such as Arg 35, Lys 149E, Lys 81 , Lys 1 10, Lys 109, and 
Lys 36, which are themselves quite distant to more pe- 
ripheral negatively charged residues on the surface. Their 
positive charge is partially compensated by the negative 
charges of Glu 77 and Glu 80 involved in the ionic clus- 
ter beneath the surface (see above and Fig. 14); neverthe- 
less, they give rise to a strong positive field around this 
surface (Fig. 8; Table 7). 

The crystal structure analysis of the thrombin-hirudin 
complex (Rydel et al., 1990) revealed that this positively 
charged thrombin surface patch interacts with the ex- 
tended hirudin tail segment ^ig. 16). Beside these distinct 
electrostatic interactions of hirudin, the complementary 
fit of hydrophobic groups seems to be of particular im- 
portance (see also Stone et al., 1987; Mao et al., 1988; 
Maraganore et al., 1990). There is much evidence that 
this thrombin exosite is also involved in thrombin bind- 
ing to fibrinogen (Hageman &, Scheraga, 1974; Van 
Nispenetal., 1977; Hogg&Blomback, 1978; White etal., 
1981; Hofsteenge & Stone, 1987; Fenton et al., 1988; 
Lundblad et al., 1988; Church et al., 1989; Hogg & Jack- 
son, 1989), to fibrin and its E-domain (Liu et al., 1979; 
Berliner et al., 1985; Kaminski & McDonagh, 1987; Kacz- 
marek & McDonagh, 1988; Vali & Scheraga, 1988), to 
thrombomodulin (Hofsteenge et al., 1986; Hofsteenge & 
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Table 11. Distances between the central cationic group and 
the ligands in the crystal structures of human a-thrombin 
(HTHRh bovine trypsin (BTRY; entry DEBA in the Protein 
Data Bank [Bode <£ Sch wager, 1 975a, b; Bernstein et al., 
J9771 human leukocyte elasiase (HLE) (Bode et aL, I986J, 
and porcine pancreatic kallikrein (KKJ [Bode et aL, I9S3J 



Central cationic group 



Liganding 
group 



HTHR 

Lys 70Nf Arg 75 O 

Glu 77 OE2 

Glu 80 OEI 

BTRY 

Ca^+ Glu 70 OEI 

Asn 72 O 

Val 75 O 

Glu 80 OE(l) 

Sol.7n OH 

Sol 714 OH 

(with connection 
to Glu 77) 

HLE 

Arg 80NE Glu 77 OE 

NH2 Asn 72 O 

Afg75 O 

KK 

Arg 70 guanidyl group 72 O 

(weakly defined) 75 O 

Glu 77 OE 



Distance 

(A) 



2.6 
2,6 
2.7 

2.3 
2.3 
2.3 
2.3 
2.3 
2.3 



3.1 
3.1 
2.9 



Stone; 1987; Preissner et al., 1987; Suzuki et al., 1990), 
and to anionic matrices (see, e.g.. Pent on et al., 1988, 
1989), Very recently, it has been shown that this exosite 
is also involved in interaction with a negatively charged 



peptide segment of the platelet-derived thrombin recep- 
tor (Liu et al.. 1991). 

It has been shown that the above proteins bind com- 
petitively (Hofsteenge et al., 1986), i.e., that they prob- 
ably occupy similar patches of this positively charged 
exosite of thrombin. Segment 30-44 of the fibrinogen 
Aa-chain, which comprises several negatively charged 
residues, seems to be particularly important for binding 
and acceleration of the activation cleavage (BlombMck 
et al., 1977; Van Nispen et al., 1977; Hofsteenge et al., 
1986). This fibrinogen segment exhibits only a very weak 
sequence homology with the hirudin tail segment, how- 
ever. The thrombin-hirudin complex (Fig. 16) can, nev- 
ertheless, serve as a model for the interaction of thrombin 
with fibrinogen. 

The positive field centered at this exosite (Fig. 8) 
spreads far into the extramolecular space; consequently, 
approaching macromolecules (substrates as well as inhib- 
itors) could be preoriented by the combined influence of 
this basic exosite and the negatively charged patch around 
the specificity pocket (see below). Anionic groups of as- 
sociated proteins need not necessarily interact through di- 
rect salt bridges with the cationic thrombin groups; they 
nevertheless experience the effect of this positive field at 
some distance from the thrombin surface. Due to its in- 
teraction with various anionic compounds and protein 
segments, this exosite is usually referred to as the anion- 
binding exosite (Fenton, 1986). This could, however, lead 
to confusion with the other principal basic site described 
below. More appropriate designations would therefore be 
the terms anion-binding exosite 1 or fibrin(ogen) second- 
ary binding exosite (Fenton, 1981), reflecting the impor- 
tance of this exosite in fibrinogen clotting and fibrin 
binding. 

The exposed peptide bond Arg 77A-Asn 78, in the 70- 
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Fig. 16. The fibrinogen secondary binding exosite of thrombin (thin connections) and the carboxy -terminal hirudin segment 
(thick connections) of thrombin-hirudin (Rydcl et al., 1991). displayed together with the Connolly surface of thrombin. The 
view is similar to the standard view of Figure 1 . 




Crystal structure of human thrombin 

80 loop (Fig. 14 A), is particularly susceptible to tryptic 
and to autocatalytic attack (Fenton et al.» 1977; Boissel 
et al., 1984; Chang, 1986; Bezeaud & Guillin, 1988) giv- 
ing rise to /3T-thrombin (Braun et al., 1988). This poly- 
peptide segment is relatively mobile (in particular from 
Thr 74 to He 79; see Fig. 28). Its main-chain conforma- 
tion is quite different from the canonical conformation 
of typical serine proteinase protein inhibitor binding loops 
(Huber & Bode, 1978; Laskowski & Kato, 1980; Read & 
James, 1986; Bode & Huber, 1991) (see Table 10). Hu- 
man /5-thrombin is formed upon its cleavage and follow- 
ing excision of peptide lie 68-Arg 77 A. 

Experimental evidence (No^ et al., 1988) suggests that 
the whole loop structure unfolds upon cleavage, driven 
presumably by favorable interactions of charged residues 
(previously involved in the buried salt cluster) with bulk 
water. Such a disruption of the exosite surface would ex- 
plain the tremendous reduction in clotting activity and 
loss of affinity for fibrin, thrombomodulin, and hirudin, 
observed for /3-r-thrombin (Braun et al., 1988; Hofsteenge 
et al.. 1988) and /3-thrombin (Fenton et al., 1977; Hof- 
steenge & Stone, 1987; Lewis et ah, 1987; Stone et al., 
1987; Bezeaud & Guillin, 1988). Indirect evidence for such 
an unfolding is that peptide bond Arg 67-lle 68 (Fig, 14A) 
is rather susceptible to autocatalytic cleavage (Boissel et al . , 
1984). Being deeply buried, a cleavage of the surface Arg 
77A-Asn 78 bond (see Fig. 14A,B) would be necessary 
prior to a proteolytic attack at this site to give /3-throm- 
bin. As witnessed by the almost unchanged kinetic pa- 
rameters of ^-thrombin toward chromogenic substrates 
this loop and surface disintegration does not seem to af- 
fect the geometry at the active site. 

In the naturally occurring human genetic variant Quick 
I, Arg 67 (see Fig. 12) is replaced by a cysteine residue 
(Henriksen & Mann, 1988). The surface location of the 
Arg 67 guanidyl group (Fig. 14B) and its integration in 
an extended internal salt-bridge cluster in normal human 
a-thrombin suggests that the exosite of this mutant might 
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be disrupted in a similar manner as in /3-thrombin. This 
agrees with the experimental observations of tremendous 
loss in clotting activity but with retention of hydrolytic 
activity toward small synthetic substrates; the consider- 
able reduction of platelet stimulation capability found for 
this mutant is a hint that this exosite is involved in plate- 
let aggregation. 

Insertion loop Leu 144'Gly 150 
(149 insertion loop) 

The polypeptide segment between residues 144 and 150 
is five amino acid residues longer in thrombin than in 
chymotrypsin (see Table 3). The exposed loop (Fig. 17) 
lacks any inter-main-chain hydrogen bonds, but seems to 
be stabilized through hydrogen bonds made with its own 
side chains (Lys 145 and Asn I49B); some hydrophobic 
contacts with a symmetry-related molecule seem to fur- 
ther stabilize this loop in the PPACK-thrombin crystals. 
This loop segment attains a very different conformation 
and overall shape in thrombin-hirudin (Fig. 17) and is of 
variable conformation in the various bovine thrombin 
structures. In the PPACK-thrombin conformation, the 
Trp 148 side chain would collide with parts of the bound 
hirudin molecule (Rydel et al., 1991). The loop confor- 
mation observed in PPACK-thrombin may be enforced 
by crystal or inhibitor contacts. There is considerable ev- 
idence (e.g., from the bovine thrombin crystals) that this 
loop exhibits a significant degree of overall flexibility. 

Three proteolyzed human thrombin variants have been 
found and characterized cleaved in this exposed, flexible 
loop segment (displayed in Fig. 12), namely (1) (-throm- 
bin, resulting from cathepsin G cleavage at Trp 148-Thr 

149 (Brezniak et al., 1990); (2) e-thrombin, generated by 
elastase action on Ala 1 49 A- Asn 149B (Kawabata et al., 
1985; Brower et al., 1987); and (3) 7-thrombin, resulting 
from tryptic or autocatalytic cleavage at Lys 149£-Gly 

150 (Bing et al., 1977; Boissel et al., 1984). 

As this loop is inherently flexible, the first two cleav- 





Fig. 17. Segment Asn 143-Gln 151 and 
Glu 192 of PPACK-lhrombin (thick con- 
nections) superimposed with the same res- 
idues of the thrombin component (thin 
connections) and adjacent segments of 
hirudin (dashed connections) of the 
thrombin-hirudin complex (Rydel et al., 
1991). Thrombin residue Trp 148 is very 
differently located in both thrombin mol> 
ecules. and loop Glu 146-Gly 150 is 
differently arranged. 
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ages should not greatly affect the active-site geometry and 
thus the catalytic activity of thrombin, as demonstrated 
experimentally (Stone et al., 1987; Hofsteenge et al-, 
1988). Nevertheless, an intact loop might confer some ex- 
tra stability to the whole molecule. 

The last of these scissile peptide bonds, at Lys 149E'Gly 
150 (Fig. 17), is somewhat obstructed in PPACK-a- 
thrombin and not readily accessible to attacking protease 
molecules. A primary autolytic cleavage seems possible at 
this site, however (Chang, 1986)» probably due to the in- 
herent mobility of this loop. This scissile peptide bond is 
adjacent to segment Gin 151-Pro 152, which forms (to- 
gether with Glu 39 of the opposing loop) the entrance to 
the nbrin(ogen) secondary binding exosite (see above and 
Fig. 14B). 6-Thrombin, cleaved four bonds before Lys 
I49E-Gly 150 (see Fig. 12), shows only a minor decrease 
in clotting activity compared with a-thrombtn (Hof- 
steenge et al., 1988). The nearly complete lack of clotting 
activity of 7t- and 7-thrombin (Bing et al., 1977; Fenton 
et al., 1977) might therefore be caused mainly by the 
disruption of this exosite due to excision of peptide He 
68-Arg 77A (Hofsteenge et al., 1988), i.e., similar to 0- 
thrombin. 

Peptide segment Thr 147-Ser 153 might be involved in 
thrombomoduhn binding (Suzuki et al., 1990). Other ev- 
idence (Hofsteenge et al., 1986; Preissner et al., 1987; Noe 
et al., 1988; Jakubowski & Owen, 1989; Tsiang et al., 
1990) indicates, however, that the accessibility and integ- 
rity of the 70-80 surface loop and the ribrin(ogen) sec- 
ondary binding exosite are at least also required for 
efficient thrombin interaction with thrombomodulin. It 
has been shown (LeBonniec & Esmon, 1991) that a Glu 
1 92-* Gin substitution in thrombin mimics the catalytic 
switch induced by thrombomodulin. Thus, the thrombin 
interaction site for this endothelial cell-surface protein 
seems to comprise a larger surface area including the 
junction between loop segments 70-80 and 144-151 (Wu 
et al., 1991). 

The putative heparin-binding site 

A thrombin surface patch of positively charged side 
chains (Arg 126, Lys 236, Lys 240. and Arg 93) sur- 
rounded by other basic residues (Arg 101, Arg 233, Arg 
165, Lys 169, Lys 235, Arg 175, Arg 173. Arg 97, patch 
p2 in Table 7), which in turn contact negatively charged 
side chains (see also Tables 5, 6), is shown in Figure 18A. 
These positively charged residues are quite densely packed 
on the surface and are not compensated by negatively 
charged groups below the surface as for the 70-80 loop. 
Each of them therefore gives rise to a very strong posi- 
tive electrostatic field (Fig. 8). In spite of this strong field, 
we see no evidence for fixed solvent anions (Fig. 18B); the 
isolated blobs of electron density within the surface de- 
pression and in the intermolecular (crystal) gap are suf- 
ficiently interpreted as water molecules. This site seems 
rather to be more suited to binding polyanions such as 
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heparin; this glycosaminoglycan of 1^ acidic (sulfato 
and uronic) groups per monosaccharide unit binds to 
thrombin (Griffith et al., 1979) and accelerates its inac- 
tivation by antithrombin III (Bjork & Lindahl, 1982), 
presumably through interactions with alkaline groups of 
thrombin and antithrombin. 

Chemical modification experiments on thrombin re- 
vealed that Lys 169 and Lys 240 (at the edges of this pos- 
itively charged patch [Fig. 18A1) are blocked by bound 
heparin and that their modification renders thrombin in- 
accessible to heparin (Church et al., 1989). Cooperative 
electrostatic interactions of linear heparin molecules (with 
high linear negative charge density) with this positively 
charged region of contiguous basic residues could explain 
the high affinity for binding heparin to thrombin (Heuck 
et al., 1985). Furthermore, this putative heparin-binding 
site exhibits a typical heparin-binding motif (a helical 
strand comprising several basic residues [see Table 3; 
Cardin & Weintraub, 1989]). Some slippage of this 
thrombin helix caused by heparin binding cannot be ex- 
cluded bearing in mind the relatively variable arrange- 
ment of the carboxy-terminal helix in related serine 
proteinase-reactive domains (Bode & Huber, 1986). 

Acceleration of complex formation between anti- 
thrombin III and thrombin by heparin may be due in part 
to a template mechanism (Pomeranz & Owen. 1978; 
Griffith, 1982; Nesheim, 1983) in addition to its effects 
on the antithrombin III structure. If a docking geometry 
to be similar to that known for the "smaller** protein in- 
hibhors (Bode & Huber, 1991) is assumed for antithrom- 
bin III bound to thrombin, the thrombin heparin-binding 
site would be located near the presumed heparin-binding 
site of antithrombin III in antithrombin Ill-thrombin 
complexes (for references, see Huber & Carrell, 1989). It 
is therefore conceivable that one heparin strand of about 
14 monosaccharide units (required for efficient acceler- 
ation of the thrombin-antithrombin III reaction [see 
Huber & Carrell, 1989]) could connect the two compo- 
nents. The Arg 97 and Arg 175 residues, which form a 
positively charged satellite patch (p2c in Table 7; Figs, 7, 
8. 18) to the "north rim" (Bode et al., 1989b) of the ac- 
tive-site canyon could further stabilize such a mediatory 
binding through heparin. 

Similar reaction rates of antithrombin III with or- and 
with 7-thrombin in the presence of heparin support the 
idea that the main heparin recognition sites of antithrom- 
bin Ill-bound thrombin reside outside the fibrin(ogen) 
secondary binding exosite (Stone & Hofsteenge, 1987). 
Other evidence (Olson et aL, 1986; Hogg & Jackson, 
1989; Naski et al., 1990) indicates, however, that heparin 
might bind to both exosites in the absence of antithrom- 
bin III. This highly charged putative hepiarin-binding site 
might also (in cooperation with the ribrin(ogen) second- 
ary binding exosite) mediate thrombin binding to cation 
exchange resins and fibrin (Fenton et al., 1988) and might 
therefore be alternatively called anion-binding exosite 11. 
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Fig. 18. Putative heparin-binding site of human a -thrombin. The view is obtained from the standard view upon an approxi- 
mately 90** rotation around a horizontal axis. A: Surface structure between Lys 169 Qower left comer) and Lys 240 (upper right 
comer), displayed together with the Connolly surface. The site of highest positive electrostatic potential is in the surface depression 
close to His 91. surrounded by Arg 126, Lys 236, Lys 240, Arg 93, Arg 101 . and Arg 233. Crosses represent localized solvent 
molecules. B: Section of the final electron density superimposed with equivalent surface parts of thrombin and a few localized 
solvent molecules (crosses). The depression outside the thrombin surface does not contain aiTpropriate high density, which could 
account for a bound anion. Contour surface at 0.8o. 



Furthermore, the heparin-binding site is close to the 
amino-terminal Thr IH of the A-chain (see Fig. 7); its 
partial shielding by adjacent pro-parts in prothrombin 
(Fenton et al., 1988) or in other partially activated pro- 
thrombin forms still possessing pro-parts (such as meizo- 
thrombin [see Doyle & Mann, 1990]) also seems possible 
therefore without conformational changes in the B-chain 
globule. Chemical modification experiments further sug- 
gest that part of this heparin-binding site (in particular 
Lys 169 and/or Lys 240) might act simultaneously as a 
high-affinity binding site for platelets (White et al., 1981). 

In the natural human variant thrombin Tokushima, Arg 
101 is replaced by a tryptophan residue (Miyata et al., 
1987). The side chain of Arg 101 points away from the 
active-site cleft (Fig. 12) but is involved with the adjacent 



Asp 100 in a surface-located salt bridge (Table 6). Ac- 
cording to modeling experiments, substitution with a 
tryptophan side chain might disturb the '^rim" region and 
would thus primarily affect binding of more extended 
substrates and interactions with cells. An effect on the 
subsequent active-site residue. Asp 102, whose side chain 
points in the opposite direction (toward the active-site 
cleft [see Fig. 1]), cannot be excluded; it would be small, 
however. 

The thrombin RGD site 

Both human and bovine thrombin contain an RGD 
segment (Arg 187-Gly 188- Asp 189) beneath the specific- 
ity pocket (Fig. 19). This segment has recently been sug- 
gested (Fenton, 1988) to play a distinct role as a docking 
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site in platelet binding (e.g., through the platelet-specific 
GPIIb/IIIa integrin receptor [see Charo et ah, 1987]). 
Glenn et al. (1988) showed that a thrombin-derived 23- 
amino acid peptide comprising this Arg-GIy-Asp region 
competes with a-thrombin for binding to the high-affinity 
receptor on fibroblasts, and that it favors the thrombin- 
mediated thymidine incorporation; in contrast, the RGDA 
peptide, which likewise competes with a-thrombin, inhib- 
its the mitogenic effect of a-thrombin. Bar-Shavit et al. 
(1991) demonstrated recently that the endothelial cell ad- 
hesion properties of modified human thrombin could be 
inhibited specifically by a GRGDSP peptide; antithrom- 
bin III inhibited this thrombin adhesion, whereas hirudin 
did not interfere with it. 

In the PPACK-a-lhrombin structure, the side chain of 
Arg 187 runs along the molecular surface; it is exposed 
to solvent with the distal guanidyl group engaged in quite 
strong salt/hydrogen-bond bridges (see Fig. 19; Table 6). 
Neither Gly 188 nor Asp 189 (except for its carboxylate 
group, which points toward the interior of the specific- 
ity pocket [Fig. 19]) are, however, accessible to receptor 
structures of approaching cells or proteins without com- 
plete unfolding of this thrombin site. Such an accessi- 
bility would seem to be required, however, for proper 
recognition of the underlying sequence motif. In partially 
degraded or thermally unfolded thrombin derivatives or 
in zymogenlike (pro)thrombin species (with the Asp 189- 
containing peptide fragment turned inside-out as in chy- 
motrypsinogen, see above) a greater exposure of this 
RGD site is conceivable; this would be in agreement with 
the observation that intact a-thrombin attains endothe- 
lial cell adhesive properties only after exposure to ele- 
vated temperatures (Bar-Shavit et al., 1991). 
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The activC'Site cleft 

The most remarkable feature of the thrombin surface is 
a deep narrow "canyon" (see Fig. 1) containing the cat- 
alytic residues and the adjacent substrate^binding subsites 
(Bode et al., 1989b). The catalytic residues (in particular < 
Ser 195, His 57,' and Asp 102) divide this cleft into two 
halves, corresponding to a fibrinopeptide side ("west") 
and a fibrin side ("east"). This cleft is bordered and 
shaped by the four extending polypeptide segments, Arg 
173-Ile 174, Asp 95-Leu 99, Tyr 60A-Phe 60H, and Phe 
34-Cys 42, on one side (forming the "north rim," Fig. 1) 
and the two loop segments, Gly 216-Cys 220 and Asn 
143-Gln 151, on the opposite side (the "south rim"). The 
side chains of He 174, Tyr 60A. Trp 60D, Lys 60F, Phe 
60H, and of Thr 147, Trp 148, respectively, protrude into 
the active-site cleft, i.e., each rim is lined primarily by hy- 
drophobic groups. The base, in contrast, is covered 
mainly with polar /charged side chains (Glu 217, Glu 192, 
Asn 143, Gin 151, Arg 73) and polar main-chain groups. 

Around the entrance to the specificity pocket (Fig. 20), 
10 negatively charged residues (among them Asp 189, Glu 
192, Asp 221, and Glu 146) are clustered (patch nla in 
Table 7), with Glu 192 (relatively flexible at the base) 
and Asp 189 (at the bottom of the specificity pocket Isee 
Fig. 20]) being the only acidic residues not involved in di- 
rect salt bridges. The latter two, together with Asp 221, 
Glu 18, Asp 186A, and Glu 186B, exert repulsive forces 
on one another and contribute substantially to the rather 
negative field generated at this site (see Fig. 8; Tables 5, 
7). According to our calculations, the catalytic residues 
are only marginally affected by this negative electric po- 
tential (due to their position at the edge of the negative 




Fig. 19. Segment Arg 187-Gly 188-Asp J 89 and adjacent residues of human a-thrombin (thin connections), displayed together 
with the Connolly surface. Most of the internal solvents (crosses), which form an extended solvent cluster, are labeled. Arg 187 
is located at the thrombin surface; Gly 188 and Asp 189 are. however, buried in the interior of the thrombin molecule. The car- 
boxylate group of Asp 189 is directed toward the specificity pocket and in ion pair contact with the guanidyl group of Arg 31 
of PPACK (thick connections). The main-chain angtes of this segment are -138*, 174" (Arg 187); 132*. \S5** (Gly 188); -171*. 
16* (Asp 189). The view is similar to the standard view of Figure 1. 
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Fig. 20. The active-site region of PPACK-thrombin, displayed together with the Connolly surface of thrombin. The throm* 
bin structure is given with thin connections, the PPACK molecule with thick connections, and localized solvent molecules are 
represented by crosses. The 60 insertion loop, which partially occludes the active-site residues and the entrance to the specific* 
ity pocket, is cut off; thus, the active-site residues and Pro 21 and Arg 31 are freely visible. The negatively charged residues ar> 
ranged around the specificity pocket to contribute to the negative patch nia (Table 7) are labeled. Standard view as in Figure 1 . 



charge-patch); besides their mutual interactions, they ex- 
perience mainly the ammonium group of Lys 60F. The 
latter interaction should shift the pK values of Asp 102 
and His 57 to slightly lower values than normal. As de- 
scribed above, the fibrin{ogen) secondary binding exosite 
(at the right-hand side of the active-site cleft [Figs. 1 , 14]) 
in contrast gives rise to a strong positive electrostatic field 
(see Fig. 8). This dipolar charge distribution along the ac- 
tive-site cleft would clearly influence the orientation of 
approaching substrates and inhibitors of large electric 
moments (such as hirudin, see above). 



The thrombin active site and interaction 
with PPACK 

The PPACK molecule, as it is bound to the active- 
site cleft of thrombin, is shown in Figure 21 and Kine- 
mage 6. Its peptidyl moiety juxtaposes the extended 
thrombin segment Ser 214-Glu 217 in a twisted antipar- 
allel manner. The car boxy-terminal Arg 31 carbonyl 
group is part of a tetrahedral hemiketal structure; i.e., it 
is linked covalently to Ser 195 O7 and, via the methylene 
group, to the imidazole group (Ne2) of His 57 of throm- 




Flg. 21. Interaction of the PPACK molecule (thick connections) with the thrombin active site (thin connections), displayed to- 
gether with the Connolly surface of thrombin and some localized solvent molecules (crosses). Most of the hydrogen bonds formed 
between PPACK and thrombin arc shown in addition (dashed lines). The Arg 31 carbonyl group forms a tetrahedral hemike- 
tal structure covalently linked with Ser 19507 and (via the methylene group) with His 57N€2. Standard view as in Figure 1. 
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bin. The side chain of Ser 195 is probably considerably 
rotated to form this covalent bond compared to its loca- 
tion in unliganded thrombin. 

The thrombin specificity pocket is geometrically 
quite similar to that of bovine trypsin (Fig. 22) (Bode &. 
Schwager, 1975a). The r.m.s. deviation of 22 a-carbon 
atoms of the residues and segments lining the specificity 
pocket in PPACK-thrombin is 0.39 A compared with bo- 
vine trypsin; an almost identical deviation is obtained 
upon comparison of PPACK-thrombin with hirudin- 
thrombin. The only thrombin residue whose side chain 
points into the pocket differently is Ala 190, replacing a 
serine residue in trypsin. This exchange renders the 
thrombin pocket slightly less polar but, in particular, 
causes the loss of a hydrogen-bond acceptor/donor (per- 
pendicular to the guanidyl group of Arg 31 [Fig. 22]). Sol- 
vent molecule 305, which becomes buried upon complex 
formation, is conserved in the thrombin pocket (compare 
with trypsin [Bode & Schwager, 1975a]). This solvent 
molecule finds a much more hydrophobic surrounding in 
thrombin than in trypsin, but is nevertheless adequately 
positioned to receive a favorable hydrogen bond from the 
NHl atom of the Arg 31 guanidinium group. In conse- 
quence, the thrombin SI subsite would better accommo- 
date an arginine PI side chain, whereas an inserted lysine 
side chain would fit less tightly in the thrombin pocket, 
due to the greater space and lack of a hydrogen-bond 
partner (residue 190). This is in agreement with experi- 
mental resuhs concerning the relative affinities for argi- 
nine and lysine PI side chains (Liem & Scheraga, 1974; 
Kettner & Shaw, 1979, 1981; Lottenberg et ah, 1983). 

The amidino nitrogen of the P3 residue Arg 31 is 
2.9 A away from Ser 214 O, and in a relatively favorable 
orientation to form a hydrogen bond (Fig. 21). Forma- 
tion of this hydrogen bond is assumed to be an imp>ortant 
step to achieve the tetrahedral transition state in produc- 
tive enzyme-substrate complexes; thus, the stable state 
seen here mimics somewhat a real transition state. 

Quick II (Henriksen & Mann, 1989), another naturally 
occurring dysfunctional thrombin mutant, has the re- 
placement Gly 226 Val 226 (see Fig. 12). Modeling ex- 



periments show that the bulky valine side chain would 
probably disturb the Asp 189 conformation at the bottom 
of the pocket, rather than directly blocking binding of an 
inserted arginine side chain. The pocket of Quick II will 
probably not be as narrowed as in porcine pancreatic 
elastase (Meyer et al., 1988) or human leukocyte elastase 
(Bode et al., 1986). Thus, neither typical trypsin nor elas- 
tase substrates should be cleaved efficiently by this mu- 
tant, in agreement with the experimentally observed lack 
of catalytic and clotting activity (Henriksen & Mann, 
1989). 

Behind Asp 189, the thrombin insertion loop Tyr 
184A-Gly 188 and segment Asp 221 A-Tyr 225 enclose a 
large internal cavity filled with (buried) solvent molecules 
(see Fig. 19; Table 9). This thrombin water cluster is even 
larger than in trypsin, due to the smaller Thr 172 of 
thrombin compared with a tyrosine in trypsin. This clus- 
ter and its linkage to the bulk water have previously been 
suggested as providing a route to expel water molecules 
through the back of the specificity pocket upon insertion 
of a PI side chain of a binding peptide (Meyer et al., 
1988). 

Another more extended, quite hydrophobic pocket lies 
near the entrance to the sjjecificity pocket (Fig. 21), lined 
by He 174, Trp 215. segment 97-99, His 57, Tyr 60A, and 
Trp 60D. This hydrophobic depression is most certainly 
the suggested apolar binding site (Berliner & Shen, 1977). 
Its back part is made up of the same structural elements 
as in trypsin (Trp 215, a shorter segment 97-99, His 57 
[see Fig. 22}); it is in thrombin, however, further screened 
off from bulk water by the mainly hydrophobic Tyr 60A- 
Trp 60D loop on one side and He 174 on the other side 
(to the right and left in Fig, 21, respeaively). The PPACK 
molecule nestles tightly into this depression. The solvent- 
accessible surface of the PPACK-thrombin complex is 
240 A^ smaller than that of thrombin itself; only one- 
third of the PPACK surface (one edge of the benzene ring 
of D-Phe II, and Pro 21 [see Fig. 21)) does not become 
buried upon the complex formation. In the PPACK- 
thrombin complex the pyrrolidine ring of Pro 21 is almost 
fully shielded from bulk water; it is in a completely hy- 
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Fig. 22. Human a-thrombin structure 
around the specificity pocket (medium 
connections) superimposed by the equiv- 
alent residues of unliganded bovine tryp- 
sin (thin connections [Bodeet al., I976J). 
Arg 31 of PPACK is shown with thick 
connections. The view is into the specific- 
ity pocket, i.e., similar to the standard 
view of Figure 1 . Ser 190 of trypsin is re- 
placed in thrombin by an alanine residue- 
Sol 305 in thrombin has an equivalent wa- 
ter molecule (Sol 416) in trypsin. 



Crystal structure of human thrombin 

drophobic environment and seems to be perfectly accom- 
modated through its polyproline II conformation (see 
Fig. 21). This excellent packing in the thrombin S2 cav- 
ity explains the frequent occurrence of proline (and other 
medium-sized hydrophobic) residues at the P2 position of 
naturally occurring thrombin macromolecular substrates. 
In contrast to trypsin (Liem & Scheraga, 1974; Bajusz 
et al., 1978), it further accounts in thrombin for the par- 
ticular reactivity of chloromethylketone inhibitors with 
P2-proline residues (Kettner & Shaw, 1981). Much larger 
residues would not pack favorably into the S2-thrombin 
cavity, and smaller ones (such as glycine residues) would 
not be as well fixed. 

The (substituted) piperidine rings of the benzamidine 
(NAPAP [Sturzebecher et al., 1983]) and of the arginine- 
based (MQPA [Okamoto et al., 1981]) synthetic inhibi- 
tors (Fig. 23) occupy this pocket where they can be tightly 
fixed and sandwiched between the flat side of the His 57 
imidazole ring and the amino-terminal aryl moiety (Bode 
et al., 1990; Turk et al., 1991; Brandstetter et ah, 1992). 
As in the case of PPACK-thrombin, the hydrogen bonds 
made with Gly 216 N and O determine mainly the dock- 
ing position of these inhibitors in the thrombin active site. 

The side chain of the preceding residue of PPACK, 
D-Phe II, fits into the notched, mainly hydrophobic cleft 
made by He 174, Trp 215, segment 97-99, and Tyr 60H 
(Fig. 21). Such an interaction is not only preferred due to 
favorable hydrophobic contacts; presumably, both the 
perpendicular aryl-aryl (edge-on) arrangement (Hurley & 
Petsko, 1985) and the aryl-carbonyl contact with the car- 
bonyl group of Glu 97 A (see Thomas et al., 1982) con- 
tribute significantly to binding strength (Bode et al., 
1990). In natural protein substrates, the variety of amino 
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acids in P3 is great; in general, L-amino acids in P3 po- 
sition would interact with a different thrombin site. The 
P4 side chain of a bound all-L-amino acid polypeptide 
substrate would, however, point toward this hydropho- 
bic cleft; in fact, the beneficial effect of large bulky aro- 
matic residues at P4 connected to a P3 glycine has been 
used to construct suitable thrombin substrates (Svendsen 
et al., 1972). Very recently we have shown that the naph- 
thyl and tosyl moieties of certain benzamidine-derived 
synthetic inhibitors (see Fig, 23; Bode et al., 1990; Turk 
et al., 1991) as well as other aromatic groups in arginine- 
derived synthetic inhibitors (Matsuzaki et al . , 1 989) interact 
in a similar favorable manner with this cleft (Brandstet- 
ter et al., 1992). The phenolic side chain of Tyr 3HI of 
hirudin is also placed in such a manner (Rydel et al., 
1991), and there is now crystallographic evidence (Martin 
et al., 1992; Stubbs et al., 1992) for an equivalent ar- 
rangement of the phenyl moiety of the fibrinogen Aa- 
chain Phe 8 residue upon binding to thrombin similar to 
previous suggestions (Meinwald et al., 1980; Ni et al., 
1989a,b). We have therefore proposed the designation 
aryl-binding site (Bode et al., 1990), distinguishing it from 
the other part of the apolar binding site (the 82 cavity). 

The extraordinary specificity and reactivity of PPACK 
for thrombin (Kettner & Shaw, 1979) can thus easily be 
attributed to the optimal fit of all three residues to their 
respective thrombin subsites (see Fig. 21). In addition, 
each of these PPACK residues seems to exist in a stereo- 
chemicaliy and energetically optimal conformation in the 
thrombin-bound PPACK molecule (see Table 10): both 
the main- and the side-chain moieties of Arg 31 are ar- 
ranged in an extended conformation of favorable energy; 
Pro 21 exhibits a characteristic polyproline II conforma- 




Fig. 23. Tosyl-m-amidinophenylalanyl-piperidine (thick connections), NAPAP (medium connections), and MQPA (thin con- 
nections) bound to ihe active site of human a-thrombin displayed together with the Connolly surface of thrombin (Turk et al., 
1991). The naphthyl/toluene/chinolyl groups of the inhibitors interact with the aryl-binding site of thrombin; the side chains 
of the m- and thep-amidinophenylalanyl residues and of the arginyl residue enter the specificity pocket from slightly differing 
sites; the S2 subsite of thrombin is occupied to different extents by the (partially substituted) piperidine moieties. The view is 
similar to the standard view of Figure I . 
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tion; and D-Phe II (with a Ca-C angle of —131°) repre- 
sents just one of the two optimal conformers preferred 
for steric reasons by a D'ammo acid residue linked to a 
proline. 

The ammonium and the carbonyl group of D-Phe II 
are in favorable hydrogen-bond contacts with Gly 216 
(2.7 A [see Fig. 21 J). In contrast to trypsin, good peptidyl 
aldehyde inhibitors of thrombin require termination at P3 
(Bajusz et al., 1978), probably to allow the undisturbed 
formation of this hydrogen bond; it is clear, however, that 
iV-alkylation of PPACK (such as in A/-methyl-PPACK 
[see Bajusz et al., 1990]) should not impair hydrogen- 
bond formation in solution. In the PPACK-thrombin 
crystal structure, the ammonium group forms two further 
hydrogen bonds with a fixed solvent molecule and with 
Gin 244 of a symmetry-related thrombin molecule. This 
contact might confer stability to this PPACK-thrombin 
crystal form. 

An L-configured Phe II residue at P3 would not be 
able to interact with thrombin in as favorable a manner 
as the D-diastereomer (in particular when linked to a pro- 
line [see Fig. 21 J), Simultaneous hydrogen-bond formation 
through its amino group and favorable aryl interaction 
are mutually exclusive; the best fit for an L-Phe II resi- 
due would seem to be at a Ca-C dihedral angle of about 
—60**, with the phenyl group fitting into the aryl-binding 
site in a slightly different relative orientation. 

Due to lack of direct structural evidence, the primed 
subsites of thrombin cannot be described with a confi- 
dence equivalent to the nonprimed sites. The thrombin 
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complexes of various protein inhibitors (see, e.g.. Fig. 24), 
modeled according to experimental serine proteinase 
complexes, might give a rough idea of the probable inter- 
actions of substrate PI', P2', and P3' residues with throm- 
bin. The Sr subsite of thrombin is (due to the impending 
Lys 60F) limited in size and therefore particularly suited 
for small PI' residues such as glycine or alanine (or even 
serine or aspartate); this would be in agreement with the 
frequent occurrence of small PT residues in natural pro- 
tein substrates (see, e.g., Lottenberg et al., 1983) and with 
thrombin's preference for alanine and glycine over leu- 
cine at this substrate position (Liem & Scheraga, 1974), 
P2' side chains of protein inhibitors would normally in- 
teract with a surface provided by the side chain of resi- 
due 151 of the cognate serine proteinase (see Fig. 24). A 
proline residue is present at the P2' position of the fibrin- 
ogen Aa-chain, and synthetic peptides with a proline at 
P2' have been found to be readily cleaved by thrombin 
(Liem & Scheraga, 1974); in light of the presumed impor- 
tance of the hydrogen bond formed between the P2' am- 
ide nitrogen and the carbonyl group of residue 41 to 
achieve the transition state in productive enzyme-sub- 
strate complexes (Bode & Huber, 1986; Read & James, 
1986), this preference for proline residues (which cannot 
mediate such a hydrogen bond) is remarkable. Inspection 
of the structure shows that this P2' proline could be ac- 
commodated in a hydrophobic pocket (left to Leu 41 
[Fig. 24]) with the following residues of a thrombin- 
bound peptide probably taking a different route than 
the binding loop of canonical protein inhibitors (Stubbs 




Fig. 24. interface of the hypothetical complex of human thrombin with a Kazal-type inhibitor. The primed subsites of human 
thrombin (thin connections) are displayed together with the Connolly surface of thrombin; superimposed are the PPACK molecule 
(dashed connections) and part of the turkey ovomucoid third domain (thick connections) as determined in its complex with human 
leukocyte eiastase (Bode et at., 1986) after fit of the elastase component to thrombin. The inhibitor scissile peptide bond is be- 
tween Leu 181 (PI) and Glu 191 (PI'). The side chain of GIu 191 would collide with the indole ring of Trp 60D of the (partially 
cut-of060 insertion loop; only a shorter side* chain should become accommodated at ST. There is space for a Tyr 201 (P2') 
side chain; the interactions with thrombin would, however, not be optimal. The side chain of Arg 211 (P3') would collide with 
Glu 39; however, other more favorable arrangements can be assumed. The view is similar to the standard view of Figure 1 . 
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et ah, 1992). The relatively beneficial effect of an arginine 
residue at P3' (Liem & Scheraga, 1974) could then be at- 
tributed to ionic interactions with glutamic acid residues 
192 and 39 of thrombin (Fig, 24), in agreement with re- 
cent results obtained with a Glu 39 Arg mutant (Le- 
Bonniec et al., 1991), and that of a valine residue at P4' 
by hydrophobic interactions with Phe 60H. 

Probable interaction with macromolecular 
substrates and inhibitors 

The narrow, canyonlike structure of the thrombin ac- 
tive-site cleft is clearly the main source of the remarkable 
thrombin specificity toward macromolecular substrates 
and inhibitors (Bode et al., 1989b). It restricts access to 
the thrombin catalytic center of potential scissile peptide 
bonds of most macromolecules simply through steric hin- 
drance, thereby discriminating in this way susceptible 
substrates by means of shape and conformation. An at- 
tempt to dock the archetypal inhibitor bovine pancreatic 
trypsin inhibitor (BPTI) to the thrombin active site with 
identical position and orientation as observed for the 
trypsin(ogen) complexes (Fig. 25) (Huber et al., 1974; 
Bode etaL, 1978, 1984; Huber & Bode, 1978) results in 
a collision of residues of the inhibitor-binding region (in 
particular Tyr 351 and segment 371-401) with part of the 
60 insertion loop of thrombin (in particular with the side 
chain of Trp 60D [see Fig. 25]). 

It appears as if this steric hindrance cannot be relieved, 
even upon considerable relative tilting of the inhibitor. 
The inherently flexible Trp 148 loop (left-hand side in 
Fig. 25) could give way; the 60 insertion loop (right-hand 
side, Fig. 25), however, appears so rigid that strong forces 
would be required to allow access of the inhibitor. These 
docking experiments agree with experimental results, 
which reveal extremely low association constants of about 



10' M"* for a-thrombin, but almost 10 times larger val- 
ues for the more open and flexible 7-thrombin (Berliner 
et al,, 1986; Ascenzi et al., 1988). The structure of the 
complexes formed ai very high BPTI concentrations is 
unclear; however, an enforced expulsion of the 60 inser- 
tion loop by BPTI (in a similar manner to when the non- 
complementary binding site of trypsinogen is forced into 
a fitting trypsinlike structure upon BPTI complexaiion 
[see Bode et al., 1978; Huber & Bode, 1978; Bode. 1979]) 
and consequent binding at the expense of free energy can- 
not be ruled out. 

Similar docking experiments with other "small" serine 
proteinase protein inhibitors of known spatial structure 
such as eglin c (Bode et al.. 1987), the squash seed inhib- 
itor CMTM (Bode et al., 1989a), and the turkey ovomu- 
coid third domain (Bode et al., 1986) show similar 
restraints (see Fig. 24). In each case the binding loop res- 
idues on both sides of the scissile peptide bond would 
essentially fit to the corresponding subsites around the 
thrombin catalytic site; however, collision with the throm- 
bin "rims" would occur through structural elements ex- 
tending from the inhibitor core. Of the protein inhibitors 
tested for regard to possible thrombin inhibition so far, 
the Kazal-type inhibitors seem to cause least steric hin- 
drance (in spite of the noninhibiting behavior reported 
for ovomucoid IBerliner et al., 1986]); they therefore 
seem to be the most promising candidates for future pro- 
tein design experiments. 

In the absence of an experimental structure of an intact 
functional serpin inhibitor or of a serpin-serine protein- 
ase complex, one can only speculate on the interaction 
mode between thrombin and antiihrombin III. There are, 
however, several lines of evidence (see Bode & Huber, 
1991) that serpins bind in an essentially similar manner 
as the "small" protein inhibitors to their cognate enzymes. 




Pig. 25. Pan or the liypoiheiical docking complex formed between human a-thrombin (yellow) and BPTI (blue). The BPTI 
docking is as found in the trypsin-Arg IS-BPTI complex (Bode et al., 1984); part of (he trypsin component superimposed on 
thrombin is given in violet color. Only part of the cleft regions of thrombin and trypsin are shown; the full structure of BPTI 
is given; the view is along the active-site cleft of thrombin, which points downward. Clearly visible is the collision of Trp 60D 
with Tyr 351 of BPTI and the overcrowding, which would be expected in the surrounding region. 
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namely through an exposed binding loop of canonical 
conformation (Huber & Bode, 1978; Laskowski & Kato, 
1980; Huber & Carrell, 1989); furthermore, the interac- 
tion of this loop with thrombin seems to be restricted es- 
sentially to the immediate environment of the active-site 
residues (Chang et al.. 1979; Hofsteenge et al., 1988). 
Thrombin obviously imposes particular restrictions on 
the shape of the serpin-binding region, which must be 
flat. The core of these serpin inhibitors (Lobermann et al., 
1984; Baumann et a]., 1991) is much larger than those of 
the "smair Inhibitors; therefore, more intimate molecu- 
lar interactions with rim elements of the thrombin active- 
site canyon are to be expected. 

Similar geometric requirements must be met by coag- 
ulation factors susceptible to thrombin action. The 
thrombin interaction with fibrinogen presumably does 
not follow this general interaction scheme, however. Re- 
cent crystallographic studies (Martin et al., 1992; Stubbs 
et al., 1992) reveal that segment Phe 8-Arg 16 of the fi- 
brinopeptide A binds in a multiturnlike conformation to 
the hydrophobic cleft area of the thrombin active-site re- 
gion (as suggested earlier by Scheraga and coworkers). 
With regard to the Aa-chain region carboxy-terminal of 
the scissile peptide bond Arg 16-GIy 17, one can only 
speculate that segment 30-44 (with several acidic residues) 
might bind to the fibrin(ogen) secondary binding exosite 
(Fig. 14) in a similar manner as seen for the bound hiru- 
din tail (Hofsteenge et al., 1988). 

Hirudin, however, interacts with thrombin in a quite 
different manner (Griitter et al., 1990; Rydel et al., 1990) 
to the canonically binding protein inhibitors (Fig. 26; see 
also Bode & Huber, 1991). It binds in a complementary 
manner with its large amino-terminal domain to throm- 
bin-specific sites outside the enzyme's active site; only the 
(originally flexible {Folkers et al., 1989; Haruyama & 
Wuthrich, 19891) amino-terminal segment comprising the 
first three hirudin residues becomes fixed under forma- 
tion of a parallel 0-pleated sheet (in contrast to PPACK 
[Fig. 21] and the canonical inhibitors). In addition, it as- 
sociates with part of the active-site cleft and the positively 
charged fibrin(ogen) secondary binding exosite surface 
(Fig. 14) under formation of several surface-located salt 



bridges and hydrophobic contacts. Thus, the thrombin- 
hirudin interaction gains its tremendous specificity pri- 
marily through interactions with thrombin sites not 
common to other serine proteinases; due to the partial oc- 
clusion of substrate-binding sites and shielding of the ac- 
tive-site residues (not themselves directly blocked [Rydel 
et al., 1991]), both macromolecular and small substrates 

are denied access to the thrombin active site. 
• 

Materials and methods 

Data collection and processing 

Highly purified human a-thrombin was a kind gift of 
Drs. Jan Hofsteenge and S.R. Stone, Friedrich-Miescher- 
Institut, Basel. PPACK was purchased from Calbiochem 
or Bachem. Formation of the PPACK-human a-throm- 
bin complex and crystallization in polyethylene glycol 
6000/phosphate buffer, pH 7, were described previously 
(Bode et aL, 1989b). The crystals belong to the ortho- 
rhombic space group P2|2|2i and contain one molecule 
per asymmetric unit with cell constants a == 87.74 A, b = 
67.81 A, and c = 61.07 A. Larger crystals initially show 
reflections to a minimal Bragg spacing of beyond 1 .9 A 
resolution. The crystals were transferred to a mother liq- 
uor of 30<7o polyethylene glycol 6000, buffered with 0.2 M 
sodium/potassium phosphate to pH 7,0 for data col- 
lection. 

At first, a particularly large, but macroscopically 
twinned crystal obtained by microdialysis was cut into 
four parts; two of them were used initially for data col- 
leaion on a rotation camera (see below). Only one of 
these crystal fragments (of size 0.20 x 0,4 x 0.6 mmj*) 
yielded completely unsplit reflections (initially to 1.9 A) 
and was transferred to a FAST television area detector 
diffractometer (Enraf-Nonius) using Ni-filtered CuKof ra- 
diation, a crystaJ-to-detector distance of 40 mm» and a 
detector angle of 10°. An X-ray data set to 2.45 A reso- 
lution (Table 12) was collected upon rotation of the crys- 
tal fragment of 92** about its b axis and of 30° about an 
axis inclined by 30** to b. Capillary and crystal were kept 
in a cold air stream at 3 The program package 
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Fig. 26. a-carbon structures of hirudin 
(bold connections) and thrombin (medi' 
ate connections) (Rydel et al.» 1991), 
superimposed with BPTI (thin connec- 
tions) as it binds to trypsin in the tryp- 
sin-Arg 15-BPTI complex (Bode ct aL, 
1984). The PI residue of Arg 15-BPTl, 
Arg 151, is shown with dashed connec- 



Crystal structure of human thrombin 
Table 12. Intensity data statistics 



A. Fast daia 



Number of evaluated reflections 






/? merge* 




u. luy 


Numoer ot rnedei mates 






Ac merge" oi rneQei mates 






Number of indepenaent reflections 






Maximal resolution 






M easu red/ ex pectca re i leciions 
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Photographic films 






Number of evaluated reflections 




26,009 


/? merge* 




0.071 


Number of independent reflections 




14.105 


Maximal resolution 




1.92 A 


Combined data 






Number of evaluated reflections 




66»592 


Number of independent reflections 




16,910 


Maximal resolution 




1.92 A 


l^merge after rejection^ 




0.101 


Completeness 






Measured/expected reflections 


316 to 2.45 A 


0.87 




2.55 to 2.45 A 


0.70 


Measured/expected reflections 


316 to 2. 2 A 


0.77 




2.25 to 2.2 A 


0.42 


Measured/expected reflections 


316 to 1.92 A 


0.65 




2.05 to 1.92 A 


0.16 



"/Emerge = i:/iLi[/(f, A) - </(A)>l/S /i </(/»)>. where /(/, h) is 
the intensity of reflection k observed for the /th source, and </(/>)> is 
the mean intensity of reflection h for all measurements of I{h). 

^ Individual measurements were rejected if their relative deviation 
from the mean value (/- </>/</» exceeded the threshold limit of 0.6. 



MADNES (Messerschmidt & Pflugrath, 1987) was used 
to determine and refine the crystal setting parameters, 
crystal cell constants, and camera parameters and to eval- 
uate the reflection data on-line. All intensities with three 
contiguous pixels greater than 3.5a above background 
were assigned as observed intensities. These data were 
corrected for absorption effects using equivalent reflec- 
tions and a nonuniformity table based on previously col- 
lected data to determine the absorption ellipsoid of the 
crystal (Messerschmidt et aL, 1990). The Friedel-mates 
(Table 12) were loaded, scaled, and merged using the pro- 
gram system PROTEIN (Steigemann, 1974). Number, 
quality, and completeness of these FAST data are shown 
in Table 12. 

Additional high-resolution X-ray data up to 1.92 A res- 
olution (Table 12) were collected on a rotation camera 
(Huber, Rimsting) in the cold room at 4 ®C using graphite 
monochromatized CuKot radiation. For data collection 
the large crystal later used for FAST data collection (see 
above), another slightly disordered "twin "-brother, and 
three additional, smaller crystals (of approximate size 
0.05 X 0.2 X 0.5 mm^) were rotated about 15** each 
around their b axes. In this way the whole reciprocal 
space was systematically surveyed. Due to radiation dam- 
age the maximal resolution of the larger crystals de- 
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creased to about 2.2 A upon X-ray exposure according to 
rotation photographs. 

The films were scanned on an Optronics rotating-dnun 
film scanner and evaluated with the program FILME as 
modified by W.S. Bennett, Jr. Reflections with inten- 
sity >1.0(T were further processed. Some of the film data 
obtained from the smaller vapor diffusion-made crystals 
fitted poorly to the FAST data; in spite of individual 
/?sym values as low as 0.06, some of them exhibited 
/?merge values (defined as Sd/ - </>l)/S|/|) above 
0.20. Film data from one of these crystals and from all 
films with /?merge values >0.20 were therefore omitted 
from the final merged data set. Table 12 gives only the in- 
tensity statistics for accepted camera data, which repre- 
sent about 70^0 of the asymmetric unit of reciprocal 
space. 

The combined and scaled data set comprises 16,910 in- 
dependent X-ray data. Completeness is almost full down 
to 2.45 A and gradually decreases with increasing reso- 
lution to 1.92 A (Table 12). 

Search for orientation and position 

Figure 27 gives a schematic arrangement of the final 
thrombin model (see also Fig. 3). Highlighted are those 
amino acid residues that were selected to form the trun- 
cated model used for the search in the PPACK-thrombin 
crystals. Due to the anticipated closest topological simi- 
larity with thrombin, the polypeptide backbone of the bo- 
vine chymotrypsin model (Brookhaven Data Bank 
[Bernstein et al., 1977] codes 2CGA [Cohen et al., 1981], 
4CHA [Tsukada & Blow, 1985], 5CHA [Blevins & 
Tulinsky, 1985]) was used as a guiding structure. Some 
structural features of bovine trypsin (Bode & Schwager, 
1975a; Bode et al., 1976) expected to be closer at a few 
sites were incorporated into the truncated model as well. 
Ninety-seven amino acid residues of human a-thrombin 
(emphasized by filled circles in Fig. 27) were assumed to 
be identical or almost identical to residues in chymotryp- 
sin and were used in their entirety; another 105 residues 
(indicated by bold circles) were considered to be topolog- 
ically equivalent to either chymotrypsin or trypsin and 
were included only with their main-chain and side-chain 
atoms up to C|3. No efforts were made to construct a 
contiguous energy-minimized search model. 

In addition, the truncated model contained also the 
two last residues of PPACK including the hemiketal 
group (modeled according to the peptidyl chloromethyl- 
ketone moiety of inhibited human leukocyte elastase [Wei 
et al., 1988]) and 10 conserved internal solvent molecules. 
The truncated model used for subsequent searches con- 
sisted of 1,352 active atoms, corresponding to slightly 
more than half of all atoms of the final peptidyl struc- 
ture. This residue selection corresponds approximately to 
the structurally conserved regions (SCR, CR) defined by 
Greer (1981) and Bing et al. (1986). 
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Fig. 27. Schematic representation of the truncated model used for search; the thrombin chain segments are arranged as found 
for the final thrombin molecule (see Fig. 3). The backbone atoms plus C/3 of 202 residues of bovine chymotrypsin (in a few cases: 
of bovine trypsin, see text) assumed to be topologically equivalent to thrombin (emphasized by thick circles) were included; of 
these 202 residues, 97 given by filled circles were taken with full size. 



The orientation of the thrombin B-chain in the throm- 
bin crystals was determined by Patterson search methods. 
The truncated model was placed in a triclinic cell with or- 
thogonal cell axes of length 120 A. A model Patterson 
map was calculated with data between 8 and 3.5 A using 
a B value of 20 A^; grid points representing the 2,463 
highest function values within a radial shell of 3 and 15 A 
were selected. A crystal Patterson synthesis was calcu- 
lated using reflections between 8 and 3.5 A resolution on 
a 1-A grid. The rotational search was carried out by cal- 
culating the product correlation function of the crystal 
and the modified model Patterson map as a function of 
the orientational angles ^i, ^3 (defined as subsequent 
rotations around z, the new x [x'J and the new z [z" axis]) 
in 5** steps in the angle range 0, = 0-90**, ©2 = O-ISO"", 
^3 = 0-180** (with the PROTEIN program system of 



Steigemann [19741). The rotation function showed a 
prominent peak of height 101,2 arbitrary units (corre- 
sponding to 5.6a above the mean value of 70.5) at 
10V125V90'* with second and third peak heights of 92.7 
and 92.6 (4. 1 and 4, 1 a), respectively. A fine scan in 0.2** 
intervals increased the value of the highest peak to 103.2 
units, yielding an optimal orientation value. 

The location of the thrombin B-chain with respect to 
the crystallographic symmetry elements was determined 
using the translation function and programs written by 
E.E. Lattmann and modified by J. Deisenhofer and R. 
Huber. An 8- to 3-A model transform was calculated for 
the truncated search model oriented according to the ro- 
tation function solution and placed with its center of 
gravity at the origin of a triclinic cell with orthogonal cell 
axes of length 120 A. Only one of the three cross-vector 
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sets yielded the highest peak on a Marker section; all three 
highest Marker section peaks yielded, however, a consis- 
tent solution (Table 13). The crystallographic -value 
(defined as S (l^obs - Fcalc|)/S I Fobs] ) of the appro- 
priately oriented and positioned truncated model was 
0.53 for 8- to 3-A data. 

The orientation and position parameters of this model 
were further refined with the Fourier transform fitting 
program TRAREF (Muber & Schneider » 1985) resulting in 
rotational and positional shifts of about 3° and 0.25 A. 
The /?-value of this model was 0.493 for 8- to 3-A data. 

Model building and crystallographic refinement 

A 2Fobs — Fcalc Fourier map was calculated with this 
model by means of PROTEIN using intensity data from 



Table 13. Position and height (in arbitrary units) 
of highest peaks in the translation function 









Peak position 




Vector 


Peak 


Standard 








Marker 


set 


height 


deviation 


f//80 


i;/70 


w/60 


section 


I -*2 


382.1 




40.3 


55.1 


30.0 






374.5 


58.4 


8.3 


2.3 


0.0 


w = 30 




373.9 




9.6 


50.3 


0.0 




1 -*3 


683.1 




0.0 


4.4 


5.1 






664.4 


102.3 


40.0 


20.1 


50.8 


t# = 40 




561.5 




0.0 


10.9 


51.1 




1 -4 


536.6 




5.7 


0.0 


3.5 






526.6 


78.8 


0.0 


35.0 


20.3 


i; = 35 




430.5 




5.3 


0.0 


59.0 
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8 to 3 A and Sim- weighted model phases. This map dis- 
played most parts of the core-forming segments and of 
the PFACK moiety; in addition it showed electron den- 
sity for several omitted side chains (such as Phe II), for 
some insertion loops (such as loop 202-208), and for a 
short A -chain segment following Cys 1-Gly 2. These de- 
fined structural parts were modeled into density using the 
PSFRODO version (Pflugrath et al., 1984) of FRODO 
(Jones, 1978) on a PS390 display system (Evans & Suther- 
land). This new, more complete model had a decreased 
/f-value of 0.44. 

This model was subjected to 12 macrocycles of mod- 
eling and cycles of crystallographic refinement (Table 14). 
Most refinement was carried out using the energy- 
restrained least-squares program EREF (Jack & Levitt, 
1978). Each EREF macrocycle started with inspection of 
Fourier and difference Fourier (and sometimes "omit") 
maps at the display with appropriate model manipula- 
tions and additions, followed by 5-9 refinement cycles, 
each consisting of calculation of structure factors and of 
a Sim-weighted difference Fourier (with PROTEIN), cal- 
culation of the normal equations from the difference 
Fourier (DERIV [Jack & Levitt, 1978]), and subsequent 
least-squares refinement of positional parameters with 
energy restraints (EREF [Jack & Levitt, 1978]). In be- 
tween, thermal parameters were also refined. Most stan- 
dard geometries were taken from Levitt (1974); a few 
improper angles were introduced to keep carboxylate and 
carboxamide groups planar; some peptide bond param- 
eters were updated, and the proline parameters were 
changed to allow ring puckering (see Schirmer et al., 
1987). 

The carboxy-terminal arginine methyl residue of 
PPACK (Arg 31) was constructed from an EREF stan- 



Table 14. Course of crystallographic refinement 
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0.493 


0 


1.352 




8 to 3 






0.444 


1 


1.531 


EREF/DERIV 


8 to 3 


-488 


0.014 


0.345 


2 


1.765 


EREF/DERIV 


8 to 2.5 


-320 


0.014 


0.329 


3 


1.889 


EREF/DERIV 


8 to 2.5 


-291 


0.015 


0.289 


4 


2,070 


EREF/DERIV 


6 to 2.5 


-420 


0.014 


0.251 


5 


2.205 


EREF/DERIV 


6 to 2.5 


-472 


0.016 


0.234 


6 


2.331 


EREF/DERIV 


6 to 2.2 


-752 


0.016 


0.234 


7 


2.365 


EREF/DERIV 


6 to 1 .92 


-1,056 


0.018 


0.200 


8 


2.391 


EREF/DERIV 


6 to 1.92 


-1,362 


0.018 


0.178 


9 


2,589 


EREF/DERIV 


6 to 1.92 


-1,298 


0.018 


0.173 


10 


2,650 


EREF/DERIV 


6 to 1.92 


- 1 ,227 


0.019 


0,171 


11 


2.833 


X-PLOR 


8 to 1.92 


-2,857 


0.013 


0,174 


i2a 


2.819 


X-PLOR 


8 to 1.92 


-3.180 


0.013 


0.156 


12b 


2.833 


EREF 


8 to 1.92 


-1,442 


0.018 


0.159 
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dard arginine residue by adding a methylene group and 
enforcing a tetrahedral geometry at the carbon yl carbon 
as done for the complex of human leukocyte elastase with 
a peptidyl chloromethylketone (Wei et al., 1988). All non- 
bonded restraint parameters for Ser 195 O7 and for the 
Arg 31 methylene group were set to zero to allow their un- 
biased approach to the Arg 31 carbonyl group and to the 
His 57 Ne2 atom, respectively. At later stages of refine- 
ment, solvent molecules interpreted as water molecules 
were inserted at stereochemically reasonable positions if 
the difference electron density exceeded 4a (using PRO- 
TEIN routines). 

The graphic program MAIN (D. Turk) was employed in 
macrocycle 1 1 in combination with the conjugate gradient 
minimization procedure of X-PLOR version 1 .5 (Brunger 
et al.. 1987). Crystallographic refinement with X-PLOR 
involved 50 cycles of positional refinement followed by 
50 steps of B-value optimization. New solvent molecules 
were searched for by means of MAIN routines that checked 
the difference Fourier map for peaks greater than 2.5a 
and inserted waters (closest to the original molecule) if lo- 
cated within hydrogen-bond distance (1.8-3.5 A) of po- 
lar protein atoms or of other solvent molecules; current 
solvent molecules were checked for height of density in 
the Fourier map and were deleted if positioned in density 
less than 0.8a above the mean value. The X-PLOR force- 
field parameters given in PARAM19X.PRO (which treats 
only polar hydrogens explicitly) were used for restrained 
crystallographic refinement; all side-chain charges were 
set to zero (as recommended in the X-PLOR manual), 
and all histidine residues were treated as singly proton- 
ated, with His 57 carrying a proton at N61 and all other 
histidines at N62. 

PPACK residue Arg 31 was modeled according to an 
X-PLOR standard arginine residue; additional patch res- 
idues were built to allow covalent linkages to the preced- 
ing residue (Pro 21), via the carbonyl carbon (CF) to 
Ser 195 O7 and via its methylene carbon (CJ) to His 57 
Nc2, respectively. The hemiketal group of Arg 31 was re- 
strained to an ideal tetrahedral state (with all angles around 
the carbonyl carbon [CFl being 109.5**); the lengths of 
both CF-O bonds (from the carbonyl carbon to its oxy- 
gen and to Ser 195 O7) were restrained to 1.42 A (as 
C-O single bonds), and the bond length between the 
methylene carbon and His 57 NeZ to 1.49 A (taken from 
X-PLOR parameters). 

The course of the crystallographic refinement is shown 
in Table 14. The density calculated with the first, already 
improved model displayed spurious density for most of 
the insertion loops, which only from macrocycle 3 on be- 
came clear enough to be modeled. From macrocycle 4 on, 
the full B-chain was present; several peptide segments, in 
particular some insertion loops and the carboxy-terminus, 
remained subject to changes until the very end of refine- 
ment. From macrocycle 6 on, additional water molecules 
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were localized and included in refinement. At macrocy- 
cle 8 the A-chain was clearly traceable from Asp lA to 
Tyr 14J; the preceding residues Ala IB and Glu IC were 
modeled and refined with relatively bad conformations; 
handicapped by several breaks in density, the first five 
residues, Thr IH to Gly ID, were tentatively placed along 
a shallow groove along the B-chain surface. At this stage 
no continuous density was visible for the last four resi- 
dues. He 14K-Arg 15, of the A-chain. 

The model resulting from macrocycle 10 was that de- 
scribed recently (Bode et al., 1989b). It comprised 290 de- 
fined residues (lacking the four carboxy-terminal residues 
of the A-chain and the carboxy-terminal Glu 247 of the 
B-chain) with some side chains remaining partially unde- 
fined, and 291 solvent molecules, and had an /?-value of 
0.171 (6-l,92-A data). 

Upon further inspection of the maps it became clear 
that some hitherto undefined side chains could be local- 
ized and furthermore that Lys 110 (with a very mobile 
side chain) had to be rotated and that the density "tongue" 
extending away from the main chain and being formerly 
interpreted as the Lys 1 10 side chain should rather ac- 
commodate its carbonyl group; as a consequence, the 
peptide bond formed with the succeeding prolyl residue. 
Pro 111, should (contrary to our previous suggestion) be 
in a trans rather than in a cis conformation. Thus, only 
one of the proline residues of human thrombin (Pro 37) 
is in a cis conformation, in full agreement with the struc- 
tures of the thrombin-hirudin complex (Rydel et al., 
1991). In addition, a slightly different pathway for the 
A-chain segment preceding Glu IC was found; as a re- 
sult, the peptide segment from Thr IH to Glu IC is still 
arranged in a multiple-turn conformation along the same 
B-chain surface as before, but is now slightly better de- 
fined by density with a more reasonable stereochemistry. 
Likewise, some faint but (at very low contouring) contin- 
uous density allowed the placement of the carboxy-termi- 
nal four A-chain residues and Glu 247 of the B-chain vnlh 
more confidence and to include all side-chain atoms as- 
signed before as dummy atoms in phase calculation. 

The /?-value of this partially revised thrombin model 
was 0.193. A subsequent restrained refinement with X- 
PLOR (macrocycle 1 1 [Table 14)) using the default bond 
and angle B-value restraints brought it down to 0.174. In 
a final unrestrained B-value refinement with X-PLOR 
(macrocycle 12a) or EREF (macrocycle 12b) this model 
converged (without rejection of any bad reflections) to fi- 
nal /^-values of 0.156 and 0.159, respectively. The final 
parameters of both models obtained from X-PLOR and 
EREF refinement are given in Table 15. The models are 
virtually identical (with an r.m.s. deviation of 0,07 A be- 
tween all 295 a-carbon atoms, i.e., well below the esti- 
mated accuracy of each model); the small differences 
result from different target and restraint parameters in 
both procedures. ■ 
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Table 15. Final model parameters of PPACK-thrombin 



EREF 



X-PLOR 



Number of active thrombin atoms 
Number of active PPACK atoms 
Number of active solvent atoms 

r.m.s. standard deviation from target values 

Bond lengths 

Bond angles 
Total energy 
Resolution range 

Number of reflections used in refinement 

R value 
Mean B value 
Whole data set 

Thrombin nonhydrogen atoms 
PPACK nonhydrogen atoms 
Solvent molecules 

Estimated mean coordinate error from agreement between 
observed and calculated structure factor amplitudes 

According to Luzzati (1952) 

According to Cruickshank (1949) 



2,380 
30 



423 

0,018 A 
3.085° 
1.442 kcal/mol 



0.159 



8 to 1.92 A 
15,019 



26.8 A^ 

22.9 A^ 
8.4 

45.3 A2 



0.23 A 
0.11 A 



409 

0.013 A 
3.137" 
-3.733 kcal/mol 



0.156 

29.9 A^ 
26.8 A^ 
10.2 A^ 
49.7 A^ 



0.22 A 
0.11 A 



The B-values for each thrombin residue, averaged over 
ail nonhydrogen atoms, are shown in Figure 28 A; the 
mean electron density per residue is given in Figure 28B; 
with the exception of peptide bond 14L-14M (<0.6a) and 
IH-IG and IF-IE (<0.8a) the main chain is everywhere 
represented by continuous electron density above la. In 
general, the oscillations reflect the alternate arrangement 
on the surface and in the interior of the molecule; partic- 
ularly high B*values are observed for both A-chain ter- 
mini, for the exposed B-chain loops 70-80 and 204A-205, 
and for the carboxy-terminus of the B-chain. 

A very few noninterpreted maxima and minima re- 



main in the solvent region of the final difference Fourier 
map with heights up to 5a and -3a, respectively; no new 
solvent molecules could be placed at stereochemically rea- 
sonable positions, however. According to the Ramachan- 
dran plot (Fig. 29) almost all main-chain angles of the 
thrombin model are within the allowed regions defined by 
Ramakrishnan and Ramachandran (1965). The angle 
pairs outside belong to the weakly defined amino- and 
carboxy-lermini of the thrombin A-chain; they are la- 
beled in Figure 29 and presumably reflect multiple con- 
formations of the respective residues. 
For such energy-restrained least-squares procedures it 
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Fig. 28. A: Temperature factors of the final thrombin model averaged over all nonhydrogen atoms of each residue. B: Mean 
density per residue (in a) of the final 2Fobs — Fcalc electron density map averaged over the main-chain atoms of each residue. 
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Fig. 29. Ramachandran plot of the refined thrombin model. O, glycyl 
residues: X, all other residues. 



is difficult to define an adequate standard deviation for 
positional parameters of the atoms. From the scattering 
angle dependence of the /?>values (Luzzati, 1952) the up- 
per limit of the mean positional error of the refined 
model is estimated to about 0.21 A (Fig. 30; Table 15). 
The formula derived by Cruickshank (1949) gives a lower 
limit of 0.11 A for the accuracy of the final structure. 




0.600 



[1/d] 



Fig. 30. Luzzati plot of the final thrombin model after X-PLOR 
reflnement. 



Hydrogen-bond definition, alignment procedures, 
and electrostatic calculations 

Accessible surface calculations and identification of pos- 
sible hydrogen bonds were carried out by means of the 
program HYBAC (Levitt, 1974). Possible hydrogen-bond 
donor-acceptor interactions were considered as hydrogen 
bonds if the interaction energy was E < —0.7 kcal/mol 
calculated with DSSP according to Kabsch and Sander 
(1983). Interface areas were calculated with a program 
of T.J. Richmond (see Richards, 1977). Most of the plots 
were made using the graphics facilities of FRODO» 
MAIN (D. Turk), and a RIBBON version (Priestle, 1988) 
modified by one of us (A.K,) 

Optimal superposition of related proteinases and de- 
termination of the number of topologically equivalent 
a-carbon atoms was achieved with the program O VRLAP 
(Rossmann & Argos, 1975) as modified by W.S. Bennett; 
the following selection parameters were used: Pcut = 0.1; 
El = £2 = 2.0; run length >6. 

The amino acid sequences were aligned using the pro- 
gram system ALIGN (Needleman & Wunsch, 1970). In 
single runs only pairs of sequences were compared; the 
Mutation Data Matrix (250 PAMS) was used as scoring 
matrix; a penalty of 20 was assigned to each break, and 
100 random runs were performed for each comparison. 
Protein sequence searches were done against the MIPSX 
merged Protein Sequence Data Base, Release 25.0, June 
30, 1990 (Martinsried Institute for Protein Sequences). 

The electrostatic calculations were performed on the 
basis of the modified Tanford-Kirkwood theory (Tan- 
ford & Kirkwood, 1957; Karshikov et al., 1989) as well 
as of the finite-difference algorithm developed by Gilson 
et al. (1985) and Klapper et ah (1986). In both methods 
the charges carried by the titratable groups and the pep- 
tide dipoles are considered. The point charges were attrib- 
uted (1) to those between NHl and 2 of the arginine 
guanidyl groups, (2) to Nf of lysine residues, (3) to the 
center of imidazole rings, and (4) to those between both 
carboxylate oxygen atoms of aspartates and glutamates; 
orientation and size of the peptide dipoles were taken as 
described by Karshikov et al. (1989). The solvent acces- 
sibilities of the charged groups in the protein structures 
defined as the accessibility ratio to that of a fully ex- 
tended Ala-Xaa-Ala peptide were calculated according to 
Lee and Richards (1981). Dielectric constants of 4 and 78 
were used for the space inside and outside of the protein. 
All calculations were performed for an ionic strength of 
0.125 and a pH value of 7.0. In the case of the Tanford- 
Kirkwood model the intrinsic pK values were taken from 
Matthew (1985). 

Both methods yielded quite consistent results regard- 
ing the shape of the electric potential; the contouring sur- 
faces shown in Figure 8 and the electrostatic interaction 
energies presented in this paper are those derived by 
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means of the Tanford-Kirkwood model. In the case of 
the finite-difference method, the calculations were started 
with coulombic boundary conditions followed by two 
steps of focusing calculations. The electrostatic equipo- 
tential surfaces were programmed (A.K.) to be displayed 
on a PS system using the FRODO graphics facilities. 
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abstract: Previous alanine scanning mutagenesis of thrombin revealed that substitution of residues W50, 
K52, E229, and R233 (W60d, K60f, E217, and R221 in chymotrypsinogen numbering) with alanine altered 
the substrate specificity of thrombin to favor the anticoagulant substrate protein C. Saturation mutagenesis, 
in which residues W50, K52, E229, and R233 were each substituted with all 19 naturally occurring amino 
acids, resulted in the identification of a single mutation, E229K, that shifted the substrate specificity of 
thrombin by 130-fold to favor the activation of the anticoagulant substi'ate protein C over the procoagulant 
substrate fibrinogen. E229K thrombin was also less effective in activating platelets (1 8-fold), was resistant 
to inhibition by antithrombin III (33-fold and 22-foId in the presence and absence of heparin), and displayed 
a prolonged half-life in plasma in vitro (26-fold). Thus E229K thrombin displayed an optimal phenotype 
to function as a potent and specific activator of endogenous protein C and as an anticoagulant in vivo. 
Upon infusion in Cynomolgus monkeys E229K thrombin caused an anticoagulant effect through the 
activation of endogenous protein C without coincidentally stimulating fibrinogen clotting and platelet 
activation as observed with wild-type thrombin. In addition, E229K thrombin displayed enhanced potency 
in vivo relative to the prototype protein C activator E229A thrombin. This enhanced potency may be 
attributable to decreased clearance by antithrombin III, the principal physiological inhibitor of thrombin. 



Activation of the multienzyme blood coagulation cascade 
results in the conversion of the inactive ^mogen prothrom- 
bin into active thrombin (Rosenberg, 1987; Jackson & 
Nemerson, 1980). Thrombin is a multifunctional serine 
protease that recognizes and proteolytically activates multiple 
substrates (Mann & Lundblad, 1987; Esmon 1987) that have 
both procoagulant and anticoagulant functions. Through 
precise regulation of its substrate specificity thrombin may 
function as a promoter of blood coagulation, mediating both 
fibrin clot formation and platelet aggregation, or alternatively 
tiirombin may act as a regulator of coagulation by activating 
a natural anticoagulant mechanism, the protein C (PC) 
pathway. 

At sites of vascular injury or inflammation, thrombin 
recognizes two substrates with key procoagulant functions. 
The hydrolysis of fibrinogen results in the formation of an 
insoluble fibrin clot, and the cleavage of the platelet thrombin 
receptor results in the generation of a tethered ligand that 
promotes platelet activation and aggregation (Vu et al., 1991). 
Additional procoagulant substrates include factor XIII (Lo- 
rand & Radek, 1992), which stabilizes the clot by cross- 
linking fibrin monomers, and factors V, VIII (Mann et al., 
1988), and XI (Gailani & Broze, 1991) that perpetuate the 
clotting stimulus and the generation of new thrombin through 
feedback activation of the coagulation cascade. Thrombin 
that escapes the site of an injury becomes bound to the 
cofactor thrombomodulin (TM), a transmembrane protein 
expressed on endothelial cells that line the vasculature. Upon 
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binding to TM the activity of thrombin toward the proco- 
agulant substrates is blocked, and a proposed Ca^~''-dependent 
conformational change enhances the recognition and activa- 
tion of the anticoagulant substrate PC. Activated protein C 
(aPC) cleaves and inactivates activated coagulation factors 
Va and Villa to repress the coagulation cascade and may 
also enhance fibrinolysis by inactivating PAI-I, the principal 
physiological inhibitor of tissue plasminogen activator (Comp 
& Esmon, 1981; van Hinsbergh, 1985), or indirectly by 
inhibiting the activation of TAFI (thrombin activatable 
inhibitor of thrombolysis) (Bajzar et al., 1995). Thus the 
aPC pathway may serve as a mechanism to attenuate the 
extent of blood clotting and prevent propagation of the clot 
beyond the site of an injury (Esmon, 1987). 

Several approaches have been used to exploit the PC 
pathway as a means to achieve anticoagulation in vivo 
including infusion of soluble thrombomodulin (Gomi et al., 
1990), infusion of a PC-activating protease isolated from the 
venom of snakes belonging to the genus Akistrodon (Struk- 
ova et al., 1989; Kogan et ah, 1993), direct infusion of 
exogenously activated PC (Comp & Esmon, 1981; Gruber 
etal., 1989, 1990; Okajima et al., 1990; Taylor etal., 1987), 
or infusion of a variant of PC that is more susceptible to 
activation by thrombin (Richardson et al., 1992; Kurz et al., 
1994). 

An alternative approach has been to take advantage of 
enzyme tumover and activate endogenous PC by infusion 
of thrombin itself. Infusion of thrombin in dogs resulted in 
systemic anticoagulation (Comp et al., 1982), and infusion 
of thrombin in baboons was shown to be effective in reducing 
thrombus formation in arteriovenous shunts (Hanson et al., 
1993). In both cases, coinfusion of antibodies that blocked 
PC also blocked the anticoagulant effect of the thrombin 
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infusion, strongly implicating the aPC anticoagulant pathway. 
However, the effects of the procoagulant activities of 
thrombin were also evident particularly at higher doses. 
Elevation of markers of fibrin formation and platelet activa- 
tion and consumption of fibrinogen and clotting factors were 
indicative of intravascular coagulation. 

One approach to minimizing the procoagulant functions 
of thrombin while retaining the anticoagulant effect has been 
to infuse a reversibly acylated form of thrombin [(giianidi- 
nobenzoyl)thrombin] that was maintained in inactive form 
prior to thrombomodulin binding (McBane et al., 1995). 
However, preliminary studies demonstrating that the PC 
activation and fibrinogen clotting activities of thrombin could 
be dissected by point mutations (Wu et al., 1991) suggested 
to us that a genetically engineered variant of human thrombin 
with impaired specificity for the procoagulant substrates that 
retained anticoagulant activity might be a more attractive 
anticoagulant agent. 

A library of 62 thrombin mutants, in which the charged 
and polar amino residues that were highly exposed on the 
surface of the molecule were substituted with the small 
neutral amino acid alanine, was screened for mutants that 
were defective in fibrinogen clotting yet retained their ability 
to activate PC in the presence of TM. Four mutants (W50A, 
K52A, E229A, and R233A) displayed significant shifts In 
their substrate specificity that favored the anticoagulant 
substrate, PC (Tsiang et al., 1995). The E229A mutant, 
which displayed a 22-fold switch in substrate specificity, was 
selected as a prototype and was demonstrated to function as 
a selective endogenous protein C activator (PC A) in Cyno- 
molgus monkeys, causing reversible anticoagulation without 
activating platelets or fibrin clot formation (Gibbs et al., 
1995). 

In this study, we attempted to optimize the specificity of 
thrombin toward PC by individually substituting thrombin 
residues W50, K52, E229, and R233 with all 19 naturally 
occurring amino acids. Screening of this collection of all 
76 possible substitutions identified a mutant, E229K, with 
enhanced specificity for PC and enhanced anticoagulant 
potency in vivo. 

EXPERIMENTAL PROCEDURES 

Materials. Reactl-BIND maleic anhydride activated poly- 
styrene 96-well plates were from Pierce (catalog no. 151 10). 
PPACK (D-Phe-Pro-Arg chloromethyl ketone) was from 
Sigma. Purified human a-thrombln and protein C (fi-ee of 
detectable prothrombin) were from Haematologic Technolo- 
gies. Purified human antithrombin III was from Enzyme 
Research Laboratories. The anti-human prothrombin poly- 
clonal antibody was from DAKO (catalog no. 325), and the 
goat anti-rabbit antibody conjugated with horseradish per- 
oxidase was from Vector Laboratories. Echis carinatus 
snake venom was from Sigma. DEAE-Sepharose Fast Flow 
was fix)m Pharmacia. Amberllte CG50 cation-exchange resin 
was from ICN Biomedicals. 

Construction, Transient Expression, and Screening of 
Thrombin Mutants. The detailed methods describing the 
expression, processing, and assay of transiently expressed 
thrombin mutants have been described previously (Tsiang 
et al., 1995). 

Thrombin mutants were constructed by oligonucleoti de- 
directed mutagenesis as described previously (Gibbs & 
Zoller, 1991) using four synthetic oligonucleotide primers 
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where the codons encoding thrombin residues W50, K52, 
E229, and R233 were replaced with randomized codons NNS 
(where N = A or C or G or T and S = C or G). On average, 
clones encoding 13/19 of all possible amino acid substitutions 
at each position were identified following the sequencing of 
48 clones. The remaining mutations were constructed using 
specific mutagenesis primers. The collection of 76 thrombin 
mutants was transiently expressed in COS-7 cells. Condi- 
tioned medium was concentrated by ultrafiltration and treated 
with E. carinatis venom to process prothrombin to thrombin. 
Processing was determined to be complete following analysis 
by Western blotting of reducing SDS-PAGE gels. Throm- 
bin concentration in concentrated conditioned medium was 
determined by ELISA (see below). Specific amidolytic 
activity [S-2238 (H-D-Phe-Pip-Arg-pNA) hydrolysis], fi- 
brinogen clotting activity, and PC activation in the presence 
of TM were determined as described previously (Tsiang et 
al., 1995). Specific fibrinogen clotting activity (FC) (ex- 
pressed as a percent of wild-type thrombin activity) was 
determined from the time to clot formation following the 
addition of purified fibrinogen (to 0.4 mg/mL) to a prede- 
termined amount of each thrombin mutant. Clotting activity 
was related to the clotting activity of wild-type thrombin 
using a standard curve of clotting time relative to thrombin 
concentration. Specific activity of protein C activation (PA) 
(expressed as a percent of wild-type thrombin activity) was 
determined by Incubating each thrombin mutant with 5 nM 
TM and 887 nM PC. The concentration of aPC generated 
was determined from the hydrolysis of the chromogenic 
substrate S2366 following termination of the reaction by the 
addition of heparin/antithrombin III. Specific activity of PC 
activation was based on the concentration of thrombin/TM 
complexes in the reaction. The relative specificity for protein 
C activation over fibrinogen cloning was determined from 
the PA/FC ratio. 

Thrombin ELISA, The expression level of each thrombin 
mutant was estimated using an immunoassay employing 
PPACK to capture and immobilize thrombin on a microtiter 
plate for subsequent detection using an anti-human pro- 
thrombin/thrombin polyclonal antibody. Purified human 
a-thrombin was used to construct a standard curve for each 
assay. Microtiter plates were prepared fi^hly for each assay 
as follows: PPACK (15 fiM) in PBS was coated onto the 
bottom of a maleic anhydride activated 96-well plate and 
incubated overnight at room temperature. The PPACK was 
removed, and the excess sites on the plate were blocked for 
30 min with 3% BSA in PBS and 0.05% Tween-20 (Tw). 
The blocking step was repeated, and the plate was then 
washed with wash buffer (PBS, 0.05 % Tw). Triplicate 
aliquots of purified thrombin standards (0.5—12 ng/mL) and 
duplicate aliquots of thrombin mutants were diluted in 
dilution buffer (0.1% BSA in PBS 0.05% Tw) and then added 
to the plate and incubated for 2 h at room temperature. 
Thrombin was detected by incubating the plate for 1 h at 
room temperature with the rabbit an ti -human prothrombin/ 
thrombin polyclonal antibody (1:1000) diluted in dilution 
buffer followed by 1 h incubation at room temperature with 
a goat anti-rabbit antibody conjugated with horseradish 
peroxidase (1 :2000) diluted in PBS and 0. 1% Tw. The plate 
was washed between each incubation step and developed 
using the peroxidase substrate ABTS (Vector Laboratories). 
The absorbance at 405 nm was determined using a Molecular 
Devices plate reader. The concentration of each thrombin 
mutant was determined from the standard curve using data 
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from at least two dilutions each measured in duplicate. The 
assay was sensitive to 0.5 ng/mL thrombin, and the linear 
range extended beyond 12 ng/mL. 

Expression of Selected Thrombin Mutants Using an 
Expression Vector Based on the Sindbis Virus. An Xbal 
fragment encoding the entire prothrombin coding region was 
isolated fix>m BS(KS)-hFII (obtained from Ross MacGillivray, 
University of British Columbia) (Friezner Degen et al., 1983) 
and was cloned into the Xba\ site dovmstream of the 
subgenomic RNA promoter in the Sindbus virus-based 
expression vector SINrepS (Bredenbeek et al., 1993) to 
generate SINrep5-hI*T. A 1301 bp Kas\~Apa\ fragment that 
spans the region encoding the thrombin B-chain was isolated 
from pRc/CMV-hPT clones expressing selected mutants 
(E229A, E229K, E229S, E229W, E229Y, R233F, W50E, 
W50I) and used to replace the corresponding Kas\—Apa\ 
fragment in SINrep5-hPT, allowing expression of thrombin 
mutants from the Sindbis subgenomic promoter. 

Cell Culture and Cell Lines, COS-7 and BHK-21 cells 
were grown and passaged in media I (DMEM containing 
4.5 mg/mL glucose, 3.7 mg/mL NaHC03, 10% CS, 2 mM 
glutamine, 100 units/mL penicillin, and 100 /ig/mL strep- 
tomycin). During the prothrombin secretion stage, BHK- 
21 cells were incubated with media II (serum-free media I 
supplemented with 10 /^g/mL vitamin K, 5 ^g/mL insulin, 
5 //g/mL transferrin, and 5 //g/mL fetuin). The replicon 
packaging helper cell line BHK-BBneo was a generous gift 
from Dr. Sondra Schlesinger (Washington University, St. 
Louis, MO). This cell line was derived by stable transfection 
of BHK-21 cells (Bredenbeek et al., 1993) with a construct 
comprised of the Sindbis helper RNA (DH-BB) encoding 
the Sindbis structural proteins (Liljestrom & Garoff, 1991; 
Liljestrom et al., 1991) followed by a Neo*^ gene. In the 
absence of Sindbis replication machinery (supplied in trans 
through transfection of replicon RNAs), these cells do not 
express Sindbis-specific polypeptides. BHK-BBneo cells 
were passaged in media III [a-MEM containing 10% PCS, 
100 units/mL penicillin, 100 /^g/mL streptomycin, and 800 
^g/mL G418 (Gibco)]. 

Generation of Sindbis Repl icons and Expression of Pro- 
thrombin Mutants. SINrep5-hPT mutants were linearized 
with Notl, The linearized plasmids (1—3 fig/50 ^L of 
reaction) were used as template for the synthesis of replicon 
RNA by SP6 RNA polymerase in the presence of 1 mM 5' 
cap analog m'G(5')ppp(50G (New England Biolabs) as 
previously described (Rice et al.. 1987). BHK-BBneo cells 
were transfected with 2 pmol of replicon RNA by electropo- 
ration, diluted in 13.5 mL of media III, and plated in two 
T25 flasks (Liljestrom et al., 1991). The media containing 
packaged repl icons were harvested 48 h postelectroporation 
when the cytopathic effect was obvious. At this time, titers 
of packaged replicons were 3 x 10*— 2 x 10* infectious 
units/mL (Schlesinger, personal communication). We as- 
sumed that all our PO harvests had a titer of '^1 x 10* 
infectious units/mL. To amplify the replicon stock, new 
monolayers of BHK-BBneo were infected with the PO 
harvest at an moi of '^l.O. The amplified replicon stock 
harvested 48 h postinfection was used to infect BHK-21 cells 
in roller bottles at an moi of '^6.0. Seventy two hours 
postinfection, conditioned medium containing secreted pro- 
thrombin was harvested. The expression level was '^4.6 ^g 
of prothrombin/ 10^ cells or ~417 ^g of prothrombin/roiler 
bottle. 
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Purification of Thrombin Mutants. The purification 
procedures for prothrombin and thrombin were modified 
fix)m previously described protocols (Wu et al., 1991; Fenton 
et al. 1977). For each mutant, the conditioned medium from 
four roller bottles was clarified by centrifiigation at 2500 
rpm and vacuum filtration through Whatman No. 3 and 
Whatman No. 1 filter paper. Sindbis replicon particles in 
the media were inactivated by adding Triton X-100 to a final 
concentration of 0.2%. Benzamidine hydrochloride (5 mM) 
and barium sulfate (40 mg/mL) were added, and the medium 
was incubated on ice for 30 min. The medium was then 
centrifuged at 3000 rpm for 15 min, and the pellet was 
washed three times with 5 mM sodium acetate, pH 7.0. The 
prothrombin was eluted with 200 mM sodium citrate, pH 
7.0, and 5 mM benzamidine hydrochloride and dialyzed 
against 0.1 M potassium phosphate, pH 7.5, containing 5 
mM benzamidine hydrochloride with the final dialysis step 
performed in the absence of benzamidine hydrochloride. The 
dialysate was then loaded onto a DEAE-Sepharose Fast Flow 
column (10 mL) equilibrated in 0.1 M potassium phosphate, 
pH 7.5. The column was washed with the equilibration 
buffer, and the prothrombin was eluted with a 0.1—0.7 M 
potassium phosphate, pH 7,5, gradient. A sample from each 
fraction was activated by E, carinatus venom and assayed 
for amidolytic activity. The peak fractions were pooled and 
dialyzed against 0.02 M HEPES and 0.1 M NaCl, pH 8.0. 
The dialysate was then concentrated using a stirred cell with 
a PM30 membrane (Amicon). The prothrombin was pro- 
cessed to thrombin by E. carinatus venom that had been 
cleared of components that adsorb to Amberlite CG50 cation- 
exchange resin. Following activation, the resulting thrombin 
was immediately loaded onto an Amberlite CG50 column 
(8 mL) equilibrated in 0.02 M HEPES and 0.1 M NaCl, pH 
8.0. The column was washed with equilibration buffer and 
eluted with a 0.10-1.0 M NaCl gradient containing 20 mM 
HEPES, pH 8,0. The fractions were assayed for amidolytic 
activity, pooled, and concentrated. The purity of each mutant 
was assessed by SDS— PAGE and silver staining, and the 
thrombin concentration was determined by thrombin ELISA. » 

Kinetic Analysis of Fibrinopeptide A Release. Fibrin- 
opeptide A (FPA) release was carried out as previously 
described (Higgins et al., 1983). Briefly, 500 fA. reactions 
of fibrinogen and mutant thrombin in reaction buffer 
containing 137 mM NaCl, 2.5 mM KCl, 0.9 mM CaClj, 0.5 
mM MgCk, 9.47 mM sodium phosphate, pH 7.4, and 0,001% 
, PEG-6000 were incubated at 37 °C and stopped with 43 ^L 
of 3 M HCIO4. The precipitated protein was spun down in 
a microcentrifuge at 15 000 rpm for 5 min. The supernatant 
was transferred to an automatic sample injector for injection 
of 250 fiL into the HPLC system. FPA and FPB were 
separated on a Cig column using a 15 min gradient of 
22.5%-49.5% CH3CN in 0.1% TFA. FPA and FPB eluted 
at 30.6% and 32.4% CH3CN, respectively. For determination 
of EC50 (the concentration of thrombin causing the release 
of 50% of FPA after 60 min at 37 °C), mutant thrombin 
concentration was varied from 0.5 to 30 nM, and the 
reactions were incubated for 60 min at 37 °C and stopped. 
The data were fitted to the empirical function P = Pmax (1 
_ g-(in 2)[iu]/EC5o)^ which best fit the data, using nonlinear 
regression to calculate the ECso and associated standard error. 
P is the concentration of FPA released with a thrombin 
concentration of [I la], and f*max is the maximum concentration 
of FPA that can be released. 
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For determination of the catalytic efficiency (kaJKm) of 
FPA release, a thrombin concentration that allowed the 
complete release of both FPA and FPB in 60 min was used. 
The kinetic parameters were determined by the analysis of 
differential progress curves (Graycar & Estell, 1987) gener- 
ated at two fibrinogen concentrations, 1.4 and 21 ^M, from 
0 to 60 min and from 0 to 70 min, respectively. 

Briefly, for the simple one-substrate Michael is— Menten 
mechanism with product inhibition, the difTerential rate 
equation is 



dP 



(So - sy 
1 \ \-^s 



(1) 



where S is the substrate concentration at time /, So the 
substrate concentration at time / = 0, (5o — 5) the product 
concentration, and Ki the product inhibition binding constant 
Equation 1 can be rearranged to give a linear plot of \/v 
versus l/S: 
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In the absence of product inhibition, K\ approaches infmity 
and eq 2 simplifies to the Lineweaver— Burk equation. 
Performing a second progress curve using the same total 
enzyme but a new initial substrate level such that So = aSo 
allows for the K\ to be determined. The ratio of the slopes 
from eq 2 for each reaction gives 
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Rearranging eq 3 we have 
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Using the derived K\ and the slope m and>^-intercept b from 
one of the two progress curves, the and Km are calculated 
as follows: 
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TTie progress curve of FPA release can be expressed as the 
integrated form of the Michaelis— Menten equation: 



/ = ^1 Inl 
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where [FPA] is the concentratio of FPA at time /, [Aa]o is 
the initial concentration of Aa-chains of fibrinogen, k\ is a 
constant, function of [Aa]o, V^^ K^, and K\ (the product 
inhibition constant), and is another constant, function of 
Kn, ATm, and K\, [Aa]o, ku and were determined by fitting 



eq 7 to the data. The inverse rate \lv of FPA release at 
each time point was then calculated using the equation: 
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\lv was plotted versus l/[Aa] = l/([Aa]o - [FPA]), and 
the slope m and ^-intercept b were then determined by curve 
fitting to a line. ATj, K„i, and f^max were subsequently 
calculated according to eqs 4—6. 

Kinetic Analysis of Protein C Activation, Cell lysates 
containing recombinant human TM were prepared as previ- 
ously described (Tsiang et al., 1995). The concentration of 
thrombomodulin was calculated by direct equilibrium binding 
of '25i-DIP-thrombin to cells (Tsiang et al., 1990). Activation 
of PC was carried out in 50 mM Tris-HCl, pH 8.0, 2 mM 
CaCb, 100 mM NaCl, and 0.1% BSA. The aPC generated 
was assayed by hydrolysis of chromogenic substrate S-2366. 
To determine the affinity of mutant thrombin for TM, TM 
and PC were used at 0.5 nM and 1 .0 jwM, respectively, with 
increasing concentrations of mutant thrombin (0.1—20 nM). 
The ATd for TM binding and the rate of PC activation at 
saturating thrombin concentration ( Vs) were determined by 
fitting the data to the binding equation [EL] = [Eo][L]/(/rd 
+ [L]), where [EL] is the concentration of the thrombin/ 
TM complex measured as the rate of PC activation and [Eo] 
is the concentration of TM measured as Ks: the rate of PC 
activation when TM is saturated with thrombin and the 
concentration of PC is 1 fjM. The background rate of PC 
activation in the absence of TM was subtracted. 

To determine the catalytic efficiency {k^JK^ of PC 
activation, steady-state kinetic conditions were used, with 
TM at 0.5 nM, mutant thrombin at the saturating concentra- 
tion of 30 nM, and seven concentrations of PC (0.50, 0.66, 
1.00, 1.30, 2.00, 4.00, and 10.00 //M). The initial velocity 
of PC activation for each concentration of PC was determined 
from a time course with time points taken at 7-min intervals. 
The Ktn and for PC activation were again determined by 
fitting the Michaelis— Menten equation to the data assuming 
that the enzyme concentration was 0.5 nM, equal to the 
concentration of TM. 

Platelet Aggregation. Platelet aggregation by recombinant 
wild-type and mutant thrombins (WT, E229A, E229K, and 
E229W) was performed with fi-esh citrated human platelet- 
rich plasma (PRP) using a Chrono-Log dual channel aggre- 
gometer (model 560-VS, Chrono-Log Corp., Havertown, 
PA). Recombinant human thrombin (WT or mutants) was 
added to 400 piL of prewarmed PRP (37 °C for 2 min) to a 
final concentration of 0.19-1.29 nM (WT), 2.7-22.0 nM 
(E229A), 5.9-29.7 nM (E229K), or 12.9-49.7 nM (E229W). 
The platelet aggregation stimulated by addition of wild-type 
or mutant thrombins was monitored by the aggregometer for 
5 min as an increase in light transmission. The extent of 
platelet aggregation was quantitated by measuring the area 
under the tracing of the aggregation curve from 0 to 1 min 
after the addition of thrombin. The extent of aggregation 
was expressed in the arbitrary unit of cm^ and plotted against 
the concentration of the agonist. The data were fitted to the 
empirical function which best fit the data: A = -4max/(l + 
e^»-lita])j^ where A is the extent of platelet aggregation at 
a thrombin concentration of [Ila], Amax is the maximum extent 
of platelet aggregation, and w is a parameter that controls 
the steepness of the logistic curve. The EC50 value (con- 
centration of thrombin required to induce half-maximal 
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aggregation after 1 min) and associated standard error were 
determined by nonlinear regression. 

Inhibition of Selected Thrombin Mutants by Antithrombin 
III in the Absence of Heparin. Selected purified thrombin 
mutants were screened for resistance to ATIII by determining 
the concentration required for half-maximal inhibition (IC50) 
over a 30 min time period in the absence of heparin. Wild- 
type or mutant thrombins (5—80 nM) were incubated in 
selection buffer (20 mM Tris— acetate, pH 7.5, 140 mM 
NaCl, 5 mM KCl, 1 mM MgCla, 1 mM CaCb) plus 0.1% 
BSA with increasing concentrations (0.008—8.0 fiM) of 
purified ATIII at 25 °C. After 30 min the inhibition reaction 
was terminated by the addition of 100 ^wM chromogenic 
peptide substrate S-2238. The residual activity due to 
uninhibited thrombin was determined by monitoring the 
hydrolysis of S-2238. IC50 values (ATIII concentration 
required for 50% inhibition of thrombin after 30 min) and 
the associated standard errors were determined by fitting the 
data to the empirical ftinction: A = 100(IC5o)"/((IC5o)" + 
[ATIII]"), which best fit the data, using nonlinear regression, 
where A is the percent thrombin activity remaining at an 
antithrombin III concentration of [ATIII] and n is a parameter 
that controls the steepness of the curve. 

Second-order rate constants for inhibition of purified 
thrombin mutants were determined as described previously 
(Ye et al., 1994). Thrombin (20-60 nM) was incubated in 
selection buffer plus 0.1% BSA with increasing concentra- 
tions (0.2-6.0 /iM) of purified ATIII at 25 At various 
time intervals (1—95 min) aliquots of the reaction mixture 
were diluted 10-fold into 100 ^M S-2238 to terminate the 
inhibition reaction. The residual activity due to uninhibited 
thrombin was determined by monitoring the rate of hydroly- 
sis of S-2238. Under pseudo-first-order conditions the time 
course of inhibition followed eq 9: 
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^(1 + [S]/KJ 
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(10) 
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where is the thrombin activity at time r = 0 in Et 
the thrombin activity at time and K the observed pseudo- 

first-order rate constant. The pseudo-first-order rate constant 
of the decrease in activity due to inhibition by ATIII {k) 
was determined by nonlinear regression, and the second- 
order rate constant of inhibition (^2) was determined from 
the slope of plots of k versus ATIII concentration. 

Heparin-Dependent Inhibition by Antithrombin III. The 
reaction buffer contained 20 mM Tris— acetate, pH 7.5, 140 
mM NaCl, 5 mM KCl, 1 mM MgCb, 1 mM CaCU, and 
0.1% PEG-8000. Inhibition by ATIII in the presence of 
heparin was performed in the presence of competing substrate 
(S-2238) as previously described (Sheehan et al., 1993). 
Briefly, 150 nM ATIII, 1.0 unit/mL heparin, and 150 //M 
S-2238 were premixed in 590 /^L of reaction buffer. This 
mixture was added to a spectrophotometric cuvette containing 
10 /<L of 300 nM mutant thrombin (final concentration of 5 
nM), and hydrolysis of S-2238 was immediately measured 
for 2 min. The pseudo-first-order rate constant of inhibition 
{l^) was determined by fitting eq 9 to the first derivative of 
the time course of S-2238 hydrolysis, which was directly 
calculated from the data. To determine the second-order rate 
constant ^2, ^ was corrected for substrate competition and 
divided by the inhibitor concentration [I] as shown in eq 
10: 



where [S] is the substrate concentration during the reaction 
and ATm the Michaeiis constant of the substrate for each 
mutant thrombin. 

Clearance of Thrombin Activity from Human Plasma in 
Vitro, Thrombin and thrombin mutants (100 nM) were 
mixed with 10 volumes of fibrinogen-deficient human plasma 
(George King Biomedical) , and incubated at 25 °C. At 
various time intervals aliquots of the reaction mixture were 
diluted 50-fold into 100 //M S-2238 to terminate the 
inhibition reaction. The residual activity due to uninhibited 
thrombin was determined by monitoring the hydrolysis of 
S-2238. The kinetics of the clearance of thrombin activity 
versus time approximately corresponded to a first-order 
exponential decay (eq 9). The half-life of thrombin and 
thrombin variants in human plasma was estimated using 
nonlinear regression analysis. 

Anticoagulation Studies in Cynomolgus Monkeys, A cell- 
line stably expressing E229K prothrombin was constructed 
by transfecting CHO cell-line AA8 with pRc/CMV-hPT 
containing the E229K mutation. E229K prothrombin was 
secreted into CHO serum-free medium (Gibco), harvested, 
processed to thrombin, and purified as described previously 
(Gibbs et al., 1995). 

All animals received humane care consistent with pub- 
lished NIH guidelines (NIH, 1985). Four adult Cynomolgus 
monkeys received 10 min intravenous infiisions of purified 
E229K at a dose of 0.5 fxg kg"' min"', formulated in a total 
volume of 10 mL steric isotonic saline. Blood samples were 
drawn at intervals before, during, and after (up to 180 min) 
infusion to allow the coagulation state of the animal to be 
monitored. Plasma samples for determination of clotting 
times, [fibrinogen], [factor V], and [factor VIII], were 
anticoagulated with 0.38% citrate. Samples for measurement 
of aPC levels were anticoagulated with 0.38% citrate plus 
25 mM /7-aminobenzamidine. Samples before (/ = 0) and 
after infiiision (/ = 10 min) for determination of fibrinopeptide 
A and D-dimer levels were anticoagulated with an inhibitor 
cocktail (containing citrate, heparin, and specific protease 
inhibitors) provided with the assay kit (Asserachrom FPA). 
Before (/ = 0) and after (/ = 180 min) samples for 
determination of platelet factor 4 and /3-thromboglobulin 
were anticoagulated with 0.38% citrate plus adenosine, 
theophylline, and dipyridamole. Plasma was obtained from 
whole blood immediately. Whole blood samples for deter- 
mination of platelet counts (before and after) were collected 
in 0.15% EDTA. 

All clotting assays and quantitative determinations per- 
formed on plasma samples have been described previously 
(Gibbs et al., 1995). Template bleeding times were per- 
formed before infusion (/ = 0) and at the peak of antico- 
agulation (/ = 20 min). A blood pressure cuff was inflated 
to 40 mmHg, and a defined incision was made on the volar 
surface of the forearm using a commercial device (Surgicutt 
Jr.). The wound was blotted at 15 s intervals, and the time 
to the cessation of bleeding was recorded. 

R£SULTS 

Initial Screening of 76 Saturation Mutants of Residues 
W50, K52, E229, and R233. Each mutant was transiently 
expressed in COS-7 cells and assayed for TM-dependent PC 
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Figure 1 : Initial screening of thrombin mutants for enhanced specificity toward protein C following saturation mutagenesis of residues 
W50. K52, E229, R233. PC activation (PA) and fibrinogen clotting (FC) specific activity was determined for each thrombin mutant in 
concentrated conditioned cell culture medium follovvdng transient transfection of COS-7 cells. Specificity was expressed as the PA/FC 
ratio where wild-type thrombin has a PA/FC ratio of 1. A ratio of greater than 1 indicates a greater specificity toward PC as compared to 
wild type (bars pointing upward), and a ratio of less than 1 indicates a lower specificity toward PC (bars pointing downward). The mean 
relative standard error for the PA/FC ratio of all mutants is 40%. 



Table 1: Secondary Screening of Thrombin Mutants with Enhanced Protein C Activation Activity Compared to Fibrinogen Clotting Activity" 


thrombin 


FPA release. 


TM binding, 


PC activation/ 


ATIII inhibition. 


mutants^ 


EC50 (nM) 


/Td (nM) 


Ks (pmol of aPC/h) 


IC50 ifM) 


WT 


ND 


3.46 ±0.41 


6.12 ±0.59 


0.069 ± 0.003 


E229A 


0.69 ± 0.05 


3.75 ± 1.13 


2.37 ± 0.66 


1.97 ±0.23 


E229K 


2.90 ±0.16 


4.38 ± 0.67 


1.38 ±0.16 


5.73 ±0.41 


E229S 


0.75 ± 0.02 


4.69 ± 0.70 


2.54 ± 0.30 


1.34 ±0.09 


E229W 


3.55 ± 0.30 


3.43 ± 0.40 


1.36 ±0.11 


0.78 ± 0.03 


E229Y 


0.34 ± 0.01 


4.12 ±0.66 


1.80 ±0.22 


1 .08 ± 0.07 


R233F 


0.49 ±0.01 


3.52 ± 0.46 


0.74 ± 0.07 


1.48 ±0.11 


W50E 


0.35 ± 0.02 


1.88 ±0.47 


0.3 1 ± 0.05 


5.28 ±0.10 


W50I 


0.125 ±0.016 


3.32 ± 0.44 


1.36 ±0.13 


1.53 ±0.06 



" Mutants that displayed a PA/FC ratio > 18 following initial screening (Figure 1) were purified and characterized in more detail. * Errors represent 
standard errors. ^ Vs = rate of PC activation for 0.5 nM TM saturated with thrombin and 1 PC. 



activation and fibrinogen clotting activity. Specificity for 
protein C was expressed as the ratio of the specific activity 
of PC activation (PA) over the specific activity of fibrinogen 
clotting (FC) (Figure 1). The ratio for wild-type thrombin 
was assigned a value of 1 . Nine mutants with a PA/FC ratio 
of greater than 18, E229K, E229S, E229W, E229Y, R233F, 
W50C, W50E, W50I, and K52C, were identified in addition 
to E229A thrombin, the prototype protein C activator. 
Because the PA/FC ratios for mutants with very low specific 
fibrinogen clotting activity (FC) values may be subject to 
error, all these mutants except W50C and K52C were 
selected for secondary screening. W50C and K52C were 
not selected for further analysis despite their favorable PA/ 
FC ratio because of their free cysteine, which has the 
potential to cause dimer formation or disruption of normal 
disulfide bridges within the thrombin molecule. 

Secondary Screening of 7 Mutants with an Enhanced PA/ 
FC Ratio, The seven thrombin mutants with a PA/FC ratio 
>18 identified in the initial screening were subcloned into 
the Sindbis virus expression vector for medium scale 
expression. About 100—120 fig of each mutant thrombin 
was purified for kinetic characterization. For secondary 
screening, the ECso for FPA release, the for TM binding. 



the Vs for PC activation, and the IC50 of ATIII inhibition 
were determined (Table 1). The affinity for TM of all the 
mutants tested did not change significantly from that of the 
wild type. E229K and E229W thrombins were distinguished 
by their decreased activity toward fibrinogen, displaying a 
4— 5-fold increase in the EC50 for FPA release compared to 
E229A thrombin. However, their for TM-dependent PC 
activation remained comparable to that of E229A thrombin. 
In addition, E229K but not E229W thrombin displayed 
increased resistance to ATIII. These two mutants were 
subjected to more detailed kinetic characterization to deter- 
mine which had superior specificity for PC. 

Final Discrimination between E229K and E229W Throm- 
bins, The kinetic parameters of FPA release and PC 
activation were determined for wild-type, E229A, E229K, 
and E229W thrombins (Table 2). The W/^m of FPA release 
for E229K and E229W were almost identical and ~5-fold 
lower than that of E229A and '^300-fold lower than that of 
wild-type thrombin. The kcJKm of PC activation in the 
presence of TM for E229K did not change from that of 
E229A while that of E229W decreased by 50.5% with 
respect to E229A. The net result is that E229K had a 130.9- 
fold higher specificity for PC over fibrinogen than wild type. 
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Table 2: 


Kinetic Analysis of Purified Thrombin Mutants with Enhanced Specificity for Protein C over Fibrinogen 




thrombin 

mutants" 


fibrinopeptide A release* protein C activation*" 


specificity ratio'' 


^(s-') Kr^UiM) *c«i//r„C"M-'-s-») Ab«(s-0 /rmC"M) WATm CuM-'-s-' 


) CEpa/CEfr 


WT 
E229A 
E229K 
E229W 


59 ± 13 3.16 ± 0.38 18.7 ± 6.7 0,650 ± 0.056 3.19 ± 0.51 0.20 ± 0.05 
7.88 ± 0.03 23.6 ± 0, 1 0.334 ± 0.003 0. 1 89 i 0.005 2.00 ± 0. 1 3 0.095 ± 0.009 
2.53 ± 0.23 38 ± 3 0.07 ± 0.01 0,21 ± 0.01 2.15 ± 0.26 0.098 ± 0.016 
3.12 ±0.02 52.8 ±0.3 0.059 ±0.001 0.148 ±0.004 3.12 ±0.19 0.047 ± 0.004 


0.0107(1) 
0.2844 (26.59) 
1.4(130.90) 

0.7966 (74.48) 


° Errors are standard errors. * The thrombin concentrations used for WT, E229A, E229K, and E229W were 0.3, 7.0, 30. and 25 nM, respectively. 
^ Thrombin concentration was 30 nM, TM was 0.5 nM, and PC was varied from 0.5 to 10 //M. For calculation of the enzyme concentration 
was assumed to be equal to that of TM (0.5 nM) saturated with thrombin. ''The' specificity ratio is the ratio of the catalytic efficiency for protein 
C activation (CEra) divided by the catalytic efficiency for fibrinopeptide A release (CEfr). The relative specificity compared to that of wild-type 
thrombin (specificity ratiOmaimit^specificity ratiowiid type) is in parentheses. 




Table 3: 


Further Characterization of Thrombin Mutant E229K 




thrombin 

mutants** 


platelet aggregation, S-2238 hydrolysis, ATIII/heparin inhibition, ATIII inhibition, 
ECso (nM) Kr^ (jM) kz x lO"* (M-^-min"') x 10"* (M->-min-0 


plasma inactivation, 

tm (s) 


WT 

E229A 

E229K 


1.3 ±0.9 2.95 ±0.46 5.98 ±0.14 3.25 ±0.18 
15 ±7 50.2 ±2.7 0.63 ±0.01 0.47 ± 0.04 
23 ± 9 63.2 ± 4.9 0.181 ± 0.005 0.150 ± 0.004 


36 ±9 
169 ± 48 
934 ± 224 


" Errors represent standard errors. 
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Figure 2: Activation of endogenous protein C and anticoagulation 
followdng iniusion of E229K thrombin in Cynomolgus monkeys: 
E229K thrombin was infused at a dose of 0.5 fi% kg"* min~* for 
10 min (« = 4). Anticoagulation was monitored by prolongation 
of the aJPTT (baseline = 19.8 s); endogenous aPC levels were 
recorded as a percent of total monkey plasma PC. Error bars 
represent standard errors. 

whereas E229W had only a 74.48-fold higher specificity for 
PC. E229K thrombin was also '^18-foid less efficient than 
wild type in its ability to activate platelets (Table 3) and 
also displayed increased resistance to antithrombin III 
compared to wild type, in both the presence and absence of 
heparin, by a factor of 33 and 22, respectively. The increased 
resistance to antithrombin III may be correlated to the 
observed 26-fold increase in plasma half-life in vitro (Table 
3). 

Anticoagulation in Vivo following Infusion of E229K 
Thrombin. Infusion of E229K thrombin in four Cynomolgus 
monkeys at a dose of 0.5 /zg kg~^ min"' for 10 min caused 
activation of endogenous protein C and an anticoagulant 
effect as measured by prolongation of the aPTT (Figure 2). 
The peak anticoagulant effect was observed between / = 10 
min and / = 30 min over which period the aPTT was 
prolonged by approximately 2.3-fold and approximately 6.6% 
of total plasma PC was observed to be in the aPC form. The 
effect of the infusion was reversible with aPC levels returning 
to baseline with a half-life of approximately 60 min. 
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Figure 3: Maintanence of hemostatic parameters following infusion 
of E229K thrombin in Cynomolgus monkeys: Preinfusion (Pre, / 
~ 0) and postinfusion levels (Post) of hemostatic parameters were 
determined following 10 min infbsions of E229K thrombin at a 
dose of 0.5 fi% kg~' min"^ (w = 4). Postinfusion values were 
determined at / = 180 min except for D-dimer and FPA (r = 10 
min) and bleeding time (/ = 20 min). Error bars represent standard 
errors. 

Notably, template bleeding times measured at the peak of 
anticoagulation were not prolonged (Figure 3). 

Fibrinogen was not consumed and FPA and D-dimer levels 
were not elevated following infusion of E229K thrombin 
(Figure 3), indicating that E229K thrombin did not cause 
detectable fibrinogen clotting in vivo. Similarly, the levels 
of coagulation factors V and VIII were not perturbed (Figure 
3), indicating that E229K thrombin did not activate the 
coagulation cascade. In addition, platelets were not con- 
sumed and plasma levels of platelet secretion granule 
components, y5-thromboglobulin and platelet factor 4, were 
not elevated (Figure 3), indicating that E229K thrombin did 
not stimulate platelet activation in vivo. 
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Figure 4: Specific contacts of the side chain of thrombin residue 
E229 in the complex of thrombin with FPA. The image was 
constructed using MidasPlus (UCSF) on the basis of the crystal 
structure of hiunan thrombin complexed with FPA (Stubbs et al., 
1992). Fibrinopeptide A is shown with the PI and P5 residues 
highlighted. The carbonyl oxygen of the amide bond in FPA 
cleaved by thrombin is indicated with an arrow. The only thrombin 
residues shown are E229 and K236. E229 is in van der Waals 
contact with the glycine residue at P5 in FPA (E229 Cy— P5 Gly 
O distance = 3.43 A) and forms a salt bridge with K236 (E229 
0€l-K236 N£ = 4.1 1 A; E229 062-K236 = 3.33 A). Atom 
coloring is as follows: carbon = brown, oxygen = red, and nitrogen 
= blue. Interatomic contacts with the side chain of E229 are 
indicated by white dotted lines. 

DISCUSSION 

In previous in vivo studies where thrombin infusion has 
been utilized to achieve an anticoagulant effect through the 
activation of endogenous PC, wild-type thrombin displayed 
a weak anticoagulant activity in vivo due to rapid clearance 
by ATIII and the dose-limiting effects of the familiar 
procoagulant functions of wild-type thrombin (fibrinogen 
clotting and platelet activation) (Hanson et al., 1993; Gibbs 
et al., 1995). In this study, screening of a library of 76 
thrombin mutants, where all possible substitutions at posi- 
tions W50, K52, E229, and R233 were sampled, identified 
a single amino acid substitution, E229K, that conferred 
optimal anticoagulant activity. Substitution of thrombin 
residue E229 with lysine resulted in a 267-fold decrease in 
the catalytic efficiency of cleavage of the fibrinogen Aa 
chain, an 18-fold decrease in the efficiency of cleavage of 
the platelet thrombin receptor, and a 33- and 22 -fold decrease 
in the rate of inhibition by ATIII in the presence and absence 
of heparin. In contrast, the affinity for TM was unchanged 
and the catalytic efficiency of TM-dependent PC activation 
was decreased by only 51%. 

Thrombin residue E229 is located at surface of thrombin 
on the edge of the active site cleft where it is well positioned 
to interact with substrates and active site inhibitors such as 
ATIII on the N-terminal side of the scissile bond. In the 
crystal structure of human thrombin complexed with FPA, 
E229 is observed in van der Waals contact with a glycine 
residue at P5 in FPA (Figure 4). As discussed previously 
(Gibbs et al., 1995), the most attractive structural rationaliza- 
tion for the differential effects of the substitution of E229 
on substrate recognition is the hypothesis that PC binds 
thrombin in a mode that is distinct from the other substrates 
and less affected by the replacement of E229. This 



hypothesis gains some support from the crystal structure of 
human thrombin complexed with FPA, where FPA is bound 
in a hooked conformation (Figure 4) due to a type II /?-tum 
that requires the residue at P5 to be glycine to avoid strain 
(Stubbs et al., 1992). In PC, the absence of glycine at P5 or 
elsewhere proximal to the cleavage site suggests PC is likely 
to bind thrombin in an extended confomiation unlike that 
observed with FPA. 

On average, the catalytic deficiencies toward the proco- 
agulant substrates and inhibition by ATIII are approximately 
5-fold more severe for the E229K variant than for E229A. 
This may be due to the introduction of a residue of opposite 
charge or altered conformation at position 229, leading to 
less favorable interactions with these substrates. Alterna- 
tively, the effects of the lysine substitution may be more 
indirect. In thrombin, the side chain of E229 can form an 
ion pair with the side chain of K236 (distance between 
charged centers = 3.7 A) (Figure 4) (Bode et al., 1992). The 
introduction of another lysine residue at position 229 may 
cause charge repulsion and more generalized structural 
changes that prevent optimal interactions with the proco- 
agulant substrates. 

The effects of the improved specificity and resistance to 
clearance of E229K thrombin observed in vitro were also 
manifested in vivo upon infusion of E229K thrombin in 
Cynomolgus monkeys. E229K thrombin was a more potent 
PC activator and anticoagulant than E229A thrombin in terms 
of both magnitude and duration of the effects. Previously, 
following infusion of E229A thrombin at a dose of 2.5 fig 
kg~* min~^ the maximal prolongation of the aPTT was 2.3- 
fold; aPC levels peaked at 5.5% of total plasma PC and 
returned to baseline with a half-life of approximately 15 min 
(Gibbs et al., 1995). Infusion of E229K thrombin caused 
effects of a similar magnitude (2.6-fold maximal prolongation 
of aPTT and peak conversion of 7.1% of plasma PC to aPC) 
but at a 5-fold lower dose of 0.5 //g kg~^ min~*. In addition, 
aPC levels returned to baseline more slowly with a half-life 
of approximately 60 min, which results in a cumulative 
increase in potency of greater than 10-fold when the area 
under the decay curve is taken into account. Although it is 
difficult to extrapolate in vitro clearance studies to in vivo 
situations, the increased potency of E229K thrombin ob- 
served in vivo may be correlated to its prolonged half-life 
in plasma in vitro (Table 3) and its resistance to purified 
ATIII (Table 3), which is the most important physiological 
inhibitor of thrombin (Rosenberg, 1987; Olsen & Bjork, 
1992). 

The enhanced specificity of E229K thrombin toward PC 
was also manifested in vivo. Previous studies in baboons 
(Hanson et al., 1993) and Cynomolgus monkeys (Gibbs et 
al., 1995) demonstrated that, following infusion of wild-type 
thrombin, the effects of both the procoagulant and the 
anticoagulant functions of thrombin were evident. In addi- 
tion to activating PC and prolonging clotting times, wild- 
type thrombin also cleaved fibrinogen (fibrinogen consump- 
tion, elevation of FPA and D-dimer), activated the coagulation 
cascade (consumption of factor VIII and factor V), and 
activated platelets (elevation of platelet factor 4 and /3-throm- 
boglobulin). Following infusion of the prototype PC activa- 
tor, E229A tiirombin, in monkeys (Gibbs et al., 1995), the 
anticoagulant effect was observed in the absence of almost 
all of the aforementioned procoagulant side effects with the 
only indication of any procoagulant activity being a mild 
elevation of FPA levels (42 nM compared with 254 nM for 
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wild type). In the present study there was no indication of 
any residual procoagulant function following infusion of 
E229K thrombin. FPA levels remained constant at prein- 
fiision levels (15 nM), which may reflect the enhanced 
specificity of E229K thrombin for PC over fibrinogen. 

A notable feature of the anticoagulation studies in Cyno- 
molgus monkeys is that anticoagulation by E229K thrombin 
as well as E229A thrombin (Gibbs et al., 1995) was not 
associated with prolonged template bleeding times even at 
elevated doses where the aPTT was in excess of 200 s (> 10- 
fold prolonged, « = 1) (data not shown). The mechanism 
of anticoagulation mediated by thrombin variants is through 
endogeneous aPC generation which by inactivating factors 
Va and Villa inhibits generation of factor Xa and thrombin. 
This mechanism is similar to that shared by inhibitors acting 
directly on coagulation factors upstream of thrombin. In 
contrast to direct thrombin inhibitors [PPACK (Hanson & 
Marker, 1988) and hirudin (Kelly et al., 1991)] and direct 
antiplatelet agents [mAbs versus vWF and GPIIbllla (Cadroy 
et al., 1994) and RGDV peptide (Cadroy et al., 1989)], 
anticoagulants that act by inhibiting thrombin generation [tick 
anticoagulant peptide (Sitko et al., 1992) and aPC (Gruber 
et al., 1990, 1991)] have shown antithrombotic efficacy 
without perturbing primary hemostasis as measured by 
bleeding time. 

Therefore, E229K thrombin which can function as a potent 
anticoagulant in vivo without any procoagulant side effects 
and without prolonging bleeding time may be a superior 
antithrombotic agent with reduced potential for bleeding 

complications. 
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We describe a selection procedure for construction of very high affinity variants of human 
growth hormone (hGH) for binding to the extra cellular domain of its receptor (called the 
hGHbp). Five different libraries of mutated hGH genes (each containing ^^2 x 10' protein 
variants) were created by randomly mutating four different codons at residues that were 
shown to be important for receptor binding by structural or functional criteria. Mutated 
proteins were displayed as single copies from their respective filamentous phagemid particles 
and sorted in vitro for binding to the immobilized hGHbp. Phagemid particles that bound 
the immobilized hGHbp were eluted and propagated. After three to seven rounds of binding 
enrichments, hGH variants were isolated that contained 2 to 4 mutations and exhibited 
three- to sixfold improvements in binding affinity. 

Because of the limits of DNA transfection efficiency in creating the library we could not 
sample thoroughly mutations at more than four codons at once. Nonetheless, the free energy 
effects for these mutations acted cumulatively. Thus, by combining aflSnity enhanced 
mutants from libraries independently sorted we created an hGH variant with 15 
substitutions that bound ^400-fold more tightly to the hGHbp than wild-type hGH. The 
affinity enhancements occurred predominantly by slowing the off-rate of the hormone 
(>GO-fold), and partly through increasing the on-rate (up to 4-fold). Residues that were 
shown to be important for binding by alanine-scanning were most highly conserved after 
binding selection, and interestingly many of them could be further improved. Thus, we 
found it most effective to randomly mutate the residues that were shown to modulate 
afRnity by alanine-scanning, and to combine the selectants from separate libraries that 
exhibit the highest affinities. The selection procedure and mutagenesis strategy provides a 
framework for affinity maturation of protein-protein complexes. 

Keytvorda: mutagenesis; phage display; human growth hormone; structure/function; 

hormones/receptors 
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} AbbreviationB used: hGH, human growth hormone; 
hOHbp, extracellular domain of the hGH receptor also 
called thed HGH binding protein; hPL, human placental 
lactogen; ETXJ. iV-ethyl-iV'(dimethylaminopropyl)- 
carbodiimide; NHS, N-hydroxysuccinimide; PBS, 
phosphate buffered saline (pH 7 4); PDEA, 2-(2- 
pyridinyldithio)ethaneamine hydrochloride; MAb, 
monodonal antibody; RIA, radio-immunoprecipitation 
assay; ssDNA and dsDNA, single-stranded and double- 
stranded DNA; SPR, surface plasmon resonance. hGH 
mutants arc designated by the wild-type residue (in 
single-letter code) followed by its position in the 
sequence and the mutant residue. For example. R64K 
indicates a mutation in which Arg64 is converted to 
Lys. Multiple mutants are indicated by a series of single 
mutants separated by slashes. 
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1. Introduction 

Determining how proteins associate is funda* 
mental to molecular recognition processes in 
biology. One of the most well-characterized protein- 
protein complexes is that between hGHJ and the 
extracellular domain of its receptor, called the 
hGHbp (for review, see Wells & de Vos, 1»93). High 
resolution structural and mutational analyses show 
that one hGH binds two hGHbps sequentially using 
two distinct sites on the hormone, called sites 1 and 
2 (Cunningham et aX., 1991a; de Vos ei al,, 1992). 
A striking feature of this complex is that less than 
half of the sido'chains from hGH that are buried in 
site I affect binding affinity when converted to 
alanine (^g. 1(a), (b); Cunningham & Wells, 
1993). In fact, a few of the buried aide-chains even 
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enhance binding affinity when converted to alanine 
(Cunningham & Wells. 1989, 1993; Fuh et al., 1992). 
An HGH mutant enhanced in affinity for site 1 and 
blocked in its ability to bind site 2 was a better 
antagonist of the hGH receptor than the site 2 
mutant alone (Fuh et aL, 1992). Thus, by improving 
site 1 affinity further, it may be possible to obtain 
an even better antagonist that could have utility in 
treating conditions of growth hormone excess such 
as acromegaly. 

Monovalent phage display of hGH (Bass ct aL, 
1990) has been used to produce three to eightfold 
increases in the affinity of site 1 for the hGH bp 
(Lowman et al.y 1991a). Additional improvements 
might be obtained by mutating more residues in the 
phage display lib^a^3^ However, it was not feasible 
to generate enough transformants to ensure that all 
possible residue combinations were represented 
when more than five or six codons were randomized 
at once (see Lowman & Wells, 1991). Mutations at 
protein-protein interfaces usually exhibit additive 
effects upon binding (Wells, 1990). Therefore, we 
reasoned that miich larger improvements in affinity 
may he possible by combining afiRnity enhanced 
variants from libraries in which other residues at the 
interface were mutated independently. 

Here, very high affinity variants of hGH were 
produced by combining mutants of hGH that were 
independently selected for increased binding affinity 
by monovalent phage display. By this approach we 
produced an hGH variant containing 15 substitu- 
tions that bound the hGHbp '^400- fold tighter than 
wild-type hGH. These studies suggest guidelines for 
targeting residues for affinity optimization. 

2. Materials and Methods 

(a) General procedures 

Restriction enzymes, polynucleotide kinase, bacterio- 
phage T7 DNA polymerafie, and T4 DNA ligase were 
obtained from Gibco-BRL or New England Biolabs and 
used according to the manufacturer's directions. Oligo- 
nucleotide cassettes that contained random substitutions 
were phosphoryiated, annealed, and ligated into 
constructs as described (Lowman et aL, 109 la; Lowman & 
Wells, 1001), Sequenase^ was purchased from United 
States Biochemical and used according to the manufac- 
turer's directions for single-stranded sequencing (Sanger et 
al„ 1977), 

Site-specific mutants of hGH were constructed by oligo- 
nucleotide-directed mutagenesis, using a single-stranded 



template that contained deoxyuracil (Kunkel el al., 1991). 
The plasmid, phGHam-g3 which encodes wild-ty|>e hGH 
fused to the carboxy-terminal domain of Mlli gene III 
(Lowman et al.^ 1991a), was used to construct parental 
vectors for cassette mutagenesis (Wells et a/., 1985). 
Phagemid particles that display a single copy of the hGH- 
gene TTT fusion protein were prepared (Lowman & Wells, 
1991) by electro-transforming EsctieHchia coli XLl-Bluc 
cells (from Stratagene*), and adding M13K07 helper 
phage (Vieira & Messing, 1987) which provides a large 
excess of the wild-type gene III protein. 

Soluble hormones were expressed in K. coli (Chang et 
al., 1987) and ammonium -sulfate precipitat^jd from osmo- 
tically shocked cell supernat^nts (Olson et aL, 1981). The 
protein concentration was determined by laser densito- 
metry of Coomassie-stained SDS-PAGE gels as previously 
described (Cunningham et al., 1990). Some variants were 
purified further by ion-exchange chromatography on a 
Mono-Q column (Pharmacia-LKB Biotechnology, Inc.). 

(b) Preparation of hGH -phagemid libraries 

To construct the Minihelix-1 library of hGH (Ml) in 
which residues K41, Y42, L45 and Q46 were randomly 
mutat«d (Fig. 1(c)), the existing Aaill site in phGHam-g3 
was destroyed using oligonucleotide no. 718 (5'-GCCA0C- 
TGATGTCTAAGAAAC-3'). The underlined nucleotide 
indicates a mismatch on the template. Unique Sfil and 
Aatll sites were introduced into phGHam-g3 to create 
pH0779, using oligonucleotides 782 (S'-TTTGAAGAGGC- 
CTATATGGCCAAGGAACAGAAG-3'), and 821 (5'-CAG- 
AACCCCCATTGACGTCCCTCTGTTTC-3'), respectively. 
The latter oligonucleotide introduced a +2 frameshift 
and a TGA atop codon after residue 40. A randomly 
mutat«d cassette was constructed from the complemen- 
tarv oligonucleotides 822 (r/-TCCCGAAGGAGCAGN- 
XSNNSTCGTTCNNSNNSAACCCGCAGACGT-3') and 
823 (5'-CTGCGGGTTSNNSNNGAACGASNNSNNCTGC- 
TCCTTCGGGATAT-3'), where N represents any of the 4 
nucleotides and S represents either G or C. The parental 
DNA (pH0779) was dige8te<l with restriction enzymes Sfil 
and AatW, and the large fragment was purified and 
ligated with the cassette. The ligation products were 
electro-transformed into XLl-Blue'^ cells for phagemid 
preparation in 2 aliquots, yielding I x 10* independent 
transformants each as described (Lowman & Wells, 1991). 

To construct the loop- J librar>' of hGH (LI) in which 
residues F64, E66, 168 and R64 were randomized, the 
existing AcUVl site in phGHam-g3 was destroyed using 
oligonucleotide 718. Unique Aat\l and BslEXl restriction 
sites were introduced in the hGH gene to construct the 
plasmid, pH0709, using oligonucleotides 719 (5'-AA0CCC- 
CAGACGTCCCTCTGT-3') and 720 (S'-GAAACACAACA- 
GTAAAGGTAACCrrAGAGCTGCT-3'). The latter oligo- 
nucleotide introduced a+1 frameshift and a TAA 



Figure 1. Comparison of receptor binding epitopes defined by alanine-scanning (a). X-ray structural analysis (b), or 
phage display (c). (a) Residues where alanine substitutions caused a 2-fold or greater effect on receptor binding affinity 
on a cartoon model of hGH based on the crystal structure of the hGH(hGHbp)2 complex (de Vos ei, al., 1992)). Dark 
spheres (#) show Ala substitutions that improved binding AAO=s —1 to —0*5 kcal/mol). Light spheres (o, 0« O) 
denote Ala substitutions (or a Gin substitution in the case of Lys41) that reduced binding energy by +0-5 to 
10 kcal/mol, -»-I O to 1-5 kcal/mol, +1-5 to 20 kcal/mol, or -h20 to 2-5 kcal/mol, respectively, (b) The hGH site 1 
structural epitope taken from Cunningham & Wells (1993). Light spheres (o, O, O, O) represent a change in solvent 
accessible area of —20 to 0 A', 0 to 20 A*. 20 to 40 A* or 40 to 60 A* respectively, at each residue upon alanine 
substitution as calculated from the hGH(hGHbp)2 crystal structure (de Vos et at., 1992). (c) The conservation of residues 
selected for binding to the hGHbp from random phagemid libraries. The fraction of wild-type hGH residues found at 
each position after sorting for binding to the hGHbj) is indicated by the size of the dark sphere: (•) 0 to 10% conserved; 
(•) 10 to 25%; (•) 25 U) 50%; (•) >50% (taken from data reported herein, and Lowman et eil., 1991a). 
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sto]i codon after residue 69. In addition, the unique 
fJc&RT site was destroyetl using oligonucleoticie 536 
(5'-CGTCnTCAAOAGTTCAACTTCTCC-3'>» to permit 
restriction -selection against possible contaminating clones 
from previous libraries (lawman & Wells. 1901). 
A randomized cassette was constructed from the comple- 
mentary oligonucleotides 803 (S'-pOCCTCTGTNNSTCA- 
NNSTCTNNSOCGACACCCACTAATNXSGACKIAAACA- 
CAACAGAAGA-3') and 804 (5'-pGTTACTCTTCrrCJTTG- 
TGTTTCCTCSNNATTACTGGGTGTCGGSXNAGASN- 
NTGASNNACAGAGGGACGT-3'). The pH0709 DNA was 
digested with restriction enzymes AatJl and BetKll, and 
thk large fragment was purified and ligated with the 
cassette. The ligation products were electrotraiuiformed 
into XL 1 -Blue c^lls for phagemid preparation in 2 
alitjuots, yielding 1*6x10 and 10x10* independent 
transformante. 



(c) Combinatoriul hGH libraries 

J)NA from the helix- 1 (called HI) library (randomly 
mutated at FIO, M14, H18 and H21) and the helix-4B 
(ca.lcd H4b) library (randomly mutated at R107. D171, 
T175, 1179) in which the E174S/F176Y mutations were 
fix(Hl, was isolated after binding selections fur 0, 2 or 4 
rounds on hGHbp-beads (Low^man el aL, 1991a). The 
DXA from each pool was purified and digested with the 
restriction enzymes Acc\ and RstXJ. The large fragment 
from each HI pool was purified and ligated with the small 
fragment from each H4b pool to yield the 3 combinatorial 
libraries H].0/H4b.O (unselected HI and H4b pools). 
H].2/H4b.2 (twice-selected HI pool with twice- selected 
H4b pool), and H1.4/H4b.4 (4-time8 selected HI pool 
with 4-times selected H4b pool). 

The ligation products were processed and electro- 
transformed into XLl-Blue^ cells. The number of inde- 
pendent transformants obtained from each pool was 
determined by colony- forming units (c.f.u.) as follows: 
2-4x10* for the HI*0/H4b.O library, 1-8x10* for the 
H1.2/H4b.2 library, l-6x 10* for the*H1.4/H4b.4 library\ 
hGH -phagemid particles were prepared and selecteil for 
hG Hp-binding over 2 to 7 cycles as described (Low man el 
at. 199La). 

We combined the 3 highest affinity variants of hCH 
that were isolated after independent sorting from the H I 
an«l H4b libraries (Ix)wman ei a/., 1901a). From the Hi 
library these included FU)H/M14G/H18N/H21N (variant 
HI. 4 A which corresponds to clone A isolated after 4 
rounds of selection from the HI library), F10A/M14W/ 
H18D/H21N (variant HL4B), M14S/H18F/H21L 
(variant H1.6B). From the H4b library these included 
R 1 67X/D 1 7 1 S/E 1 74S/F 1 76 Y /1 1 79T (variant H4b.4 A), 
RI67K/I)I7IS/EI74S/F176Y (variant H4b.6K) and 
Rl67^7D17l^7E174S/F176Y/T179T (variant H4b.6B). 
h(»H-phagemid DNA was purified and digested with the 
restriction enzymes EcoK\ and ButHl, The large fragment 
frcm each H4b variant was then purified and ligated with 
thi* small fmgment from each HI variant to yield a 
recombinant with mutations in both helix- 1 and helix-4. 

Oligonucleotides were used to revert many of the 
phage-<ierived mutations in the H1.4B/H4b.4A variant to 
thii corresponding wild-type residue 797 (S'-GTGCGTGC- 
TC'ACCGTCTTCACCAGlTGGCXmTG-:^) for D18H/ 
N21H; 798 (5'-GTCAGCACATTCCTGCGCACC-3') for 
Y176F; 799 (5 -CTCTCGCGGCrrCTTCGACAACXJ(XJAT(2- 
CTGCGTGCT-3') for A10F/VVI4M; 800 (S'-TAOTGCniX;- 
A( J G A AGG ACATGGACA AGGTCAGC-3') for X 1 67R/ 
S171D); 801 (5'-CTGCGCATCGTGCAGTGC-3') for TI79I: 
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875 (5^CTCrraM(iG(^(^TTOGACAACCCGTGG-3') for 
AlOF). 

The hGH variant (Hl.4B/H4b.4A/MI.5B/LI .6E) was 
constructed using H1.4B/H4b.A as t^^mplate and the 
following oligonucleotides: 843 (S'-CAGACCTCCCTCTGT- 
m.rrCAGAGTGTATTCXK;-3') for adding F54H; 844 
(5'-AC:ACCCTCC A AC A AGG AGG AAAC ACAACAG-3') for 
R64K: 846 (S'-CCAAAGGAACAGATTCATTCATTCTGG- 
TGGAAaXXX?A( J ACXTCXI-S') for k41 1/ Y42H /L45W/ 
Q46VV. Variant MI.5B/LI.6E was constructed using the 
same oligonucleotides with template phGHam-g3. 

(d) HadiO'immunoprecipiiation assays 

To determine the equilibrium binding affinity for the 
h(^Hbp, hGH variants were assaved in competition with 
*"T-labeled hGH. or the labeled^H1.4B/H4b.4A variant, 
or the labeled HL4B/H4b.4A/ML5B/LI.6E variant, in 
binding buffer that contained 50 mM Tris (pH 7-5) 10 mM 
MgC^lj, 0*1% (w/v) bovine serum albumin, 0*02% (w/v) 
sodium azide (Lowman et al., VMUb). Although these 
mutants contain the F176Y mutation, control experi- 
ments showed that iodination did not affect their binding 
affinities. Immunoprecipitation of the hGH*hGHbp 
complex (Cunningham et al.^ 1991a) was carried out using 
MAb5 (Barnard et al., 1984). 

(e) Kinetics tissays 

Association and dissociation rate constants for binding 
of hGH variants to the hGHp were measured by surface 
plasmon resonance (SPR) on a Pharmacia BLAcore™ 
(LofAs & Johnsson. 1990). The S2I0C mutant of the 
hGH bp was immobilized on the biosensor chip via its free 
thiol group (Cunningham & Wells, 1993). This uniformly 
oriented the hOHbp and prevented receptor dimerization 
on the matrix so that site I binding kinetics could be 
tested alone. Binding and elution steps were carried out at 
u flow rate of 3 to 20 /il/min in PBS buffer (pH 7-4) 
containing 0-05% (w/v) Tween-20, 

The density of the hGHbp coupled to the matrix can 
affect the absolute k^^ ^ou values by up to 2 -fold for 
wild-type hGH. Thus each biochip was standardized with 
wild-type hGH, so that relative changes in k^^ and k^f 
values between different mutants could be compared more 
accurately. Calculated affinity measurements correlated 
well with the results of the radio-immunoprecipitation 
assay (Cunningham & Wells, 1993). Dissociation rate 
constants were obtained by plotting in {RJR^) versus t\ 
association rate constants were obtained by plotting 
Islo}^ of (d/2|/dO versus H^] against hormone concentration 
(Karlsson et oi.. 1991), or by plotting ln(6RJdt) against 
hormone concentration using the BTAcore™ kinetics 
evaluation software Pharmacia Biosensor Manual). 
Equilibrium dissociation constants, K^b. were calculated 
*s k^ul^an- Standard deviations, o^^ for k^^ and a^^ for k^ft, 
were obtained from measurements with 2 or more series of 
2-fold or 3-fold dilutions (k^^) or with 2 or more concen- 
trated (>5 /4M) hormone samples {k^() (for a discussion 
see Bevington, 1969). 



3. Results 

(a) Residues in tke hGH -receptor binding epitope 

Structural analysis of the hGH(hGHbp)2 complex 
(de Vos et at., 1992) identified 31 side-chains in site 
1 of hOH that undergo some degree of burial when 
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Table 1 

Receptor binding affinities for alanine 
mvXaTds of hGH 



Variant 


Number uf 
van der Waals 

• 

contacts 






hGH (wild-type) 




340 


(1) 


K41A 




NE 


NE 


K4lQt 


7 


880±84 


2-6 


Y42A 


30 


540 + 80 


1-6 


L4oA 


7 


3400 + 330 


10-0 


Q46A 


16 

• 


320± 20 


09 



Reireptor binding affinities for alanine mutants of hOH 
measured by BIA(50re^^(t) or by RIA and normatized relative to 
the RIA value for wild-type HGH (Cunningham tt uL, 1989). In 
the case of K4I , the AJa variant did not express (N£), and so the 
Gin substitution was tested (Jin o/., 1092). For comparison 
with the structural epitope, the number of van der WaaU con- 
tacts with receptor is also shown, derived from the crystal 
structure of the hGH(hGHbp)2 complex (de Vos et al„ 1992). 



the first receptor binds (Fig. 1(b)). Although most of 
these were tested as alanine mutants prior to the 
structural elucidation (Oinningham & Wells, 1989, 
1991), four residues (K41, Y42, L45 and Q46) in the 
first minihelix (minihelix-l ) were not evaluated. We 
therefore converted these residues singly to alanine 
and measured the effects on binding affinity either 
by competitive displacement with [*^^I]hGH and 
immunoprecipitation (Cunningham & Wells, 1989) 
or using the BIAcore™ biosensor from Pharmacia. 
Both methods gave comparable affinity measure- 
ments (Cunningham & Wells, 1993). 

The side-chains of Tyr42 and G1n46 make many 
van der Waals contacts with the hGHp, yet alanine 
replacements caused less than a twofold reduction 
in affinity (Table 1). The L45A mutant causes a 
tenfold reduction in affinity even though Tjeu45 
makes fewer contacts with the receptor than either 
Tyr42 or Gln46. Lys41 makes a salt-bridge with 
Glul27 of the receptor. Although the K41A mutant 
did not express well enough to obtain material for 
an affinity measurement, we were able to express a 
more conservative variant, K41Q. This had a 
2-6-fold lower affinity than wild-type hGH. With 
these data and those of other studies (Cunningham 
& Wells, 1989, 1993) the binding effects have been 
measured for at least one replacement (mostly 
alanines) for all residues whose side-chains become 
buried when the first receptor binds at site 1 
(Fig. 1(a)). 

(b) Design and analysis of random mutant libraries 

We sorted five separate libraries (Fig. 2) in which 
four residues within the structural or functional site 
1 epitope (Fig. 1(a), (b)) were randomly mutated. 
Restricting each library' to four random codons 
allowed a reasonable sampling of the ~2 x 10* pro- 
tein sequences generated from I x 10^ DNA 
sequences within an average library that contained 
^h'K 10^ independent transform ants. 



Helix-4a 

(2 milt) 5.6 X 
I 

Helix-^b 

(5 mm) 8.3 X 
I 



Helix I 

(4imit)3.3x 
I 



1 

H1.4B/H4b.4A 

(9 rnnt) 32 x 

I 



Minihelix- 1 

(4 oiut) 4 J X 
I 



1 

M1.SB/L1.6E 

(6 mut) 54 X 
I 



Loop-A 

(2inta)4.8x 

I 



I 

HL4B/H4b.4A/M1.5B/LL6E 
(15 mut) 380 X 

Figure 2. Strategy for combining mutations selected 
from various libraries by phage display that increase 
receptor binding affinity. Each iibraiy is indicated' 
followed by the number of substitutions for the highest 
aifinity variant isolated (in parentheses) and the increase 
in binding affinity over wild -type hGH. Libraries were 
produced by randomly mutating 4 residues at a time as 
follows: helix-1 [FIO, MI4, HIS, H2t]; minihelix-l [K41, 
Y42, L45, Q46]; loop-1 [F54» E56, T58, R641; helix-4a 
[K172, E174, F)76, R178]; helix-4b [B167, D171, Ti75, 
TI79]. These were sorted for binding to the hGHbp and 
the tightest binding variant was combined with those 
from other libraries to produce even higher affinity 
variants as described in the text and Table 4. 



Previously, a library (called helix-4a; H4a) was 
produced in which residues K172, E174, F176 and 
R178 were randomized and displayed on mono- 
valent phagemid particles (Lowman et al., 1991a). 
After only one cycle of binding selection, a tighter 
binding mutant (E174S/ri76Y, called F!4a.l0) was 
isolated with an affinity about fivefold higher than 
wild-type hGH. The double mutant was fixed in a 
second library (called helix-4b; H4b) in which R167, 
Dill, T175 and 1179 were mutated simultaneously. 
After four rounds of selection we isolated a penta- 
mutant (R167N/D171S/E174S/F176Y/T79T, called 
H4b.4A) that bound about eightfold tighter than 
wild- type hGH. In a separate library (called helix- 1, 
HI) residues FIO, M14, H18 and H21. were 
randomly mutated. After 4 rounds of selection we 
isolated a tetramutant (F10A/M14W/HI8D/H21 N, 
called H1.4B) that bound threefold tighter than 
wild -type hGH. 

Here, the phage selection studies were extended 
to the four contact residues in minihelix-l (K41, 
Y42, L45 and Q46). These residues were randomised 
and representative clones were sequenced after two 
to seven rounds of binding selection (Table 2). Some 
residues were strongly selected at given positions 
compared to what was expected from a random 
distribution in the starting library. For example, 
about 35% of the clones contained a Q46W 
mutation. This was 7-6 standard deviation units 
(7*6<t) above a random chance occurrence for Trp 
in the library. This way of scoring the pool of 
selectants establishes a consensus sequence 
by accounting for the expected codon bias and 
sampling statistics. Using this criterion there was a 
inild preference for K41R (3-7a), a slight preference 
for Y42R (2a) or Y42Q (2a), a strong preference for 
|yl5W (4-8a) or L45 (4-5a) and a stronger preference 
for Q46VV (7-6(f). 
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Table 2 

Ctmsenstis residues identified after sorting 
hOH-phagemid libraries 











n 
"f 




Mimnelix- 


1 










n.41 


R 




0*071 




3' 7 




F 

Mi 


0031 


0*042 


012 


2-0 


Y42 


R 




0K)71 


024 


20 




Q 


0031 


(y042 


018 


2-0 


M5 


w 


a031 


0-042 


024 


4-8 






CK)94 


0-071 


0-41 


4-5 


Q46 


w 


O031 


0-042 


0-35 


7-6 




F 


(H)31 


0 042 


012 


2-0 




Y 


0031 


0-042 


012 


2-0 


Ltop-I 












F64 


P 


0-062 


xrihlf 


U* # o 


14*1 




D 


a031 


LrO.>4 


/k. 1 o 


4*7 




W 


0-031 




rwiQ 


4'i 




Y 


0031 


t\J\OA 




Z'O 




1 


0031 










V 


0-062 


IKM7 


O-Zo 


3*5 


R4(4 


K 


0K)31 


0-034 




22*8 


Ccmbinatorial (helix-l/helix-4b) 








F)0 


A 


0K)62 


003 


0-41 


12 0 




F 


0O31 


002 


0-25 


104 




H 


0*031 


002 


016 


6-2 


MI4 


W 


0-031 


0O2 


026 


111 




S 


0094 


0O4 


026 


4-8 




Y 


0-031 


V/ v/^ 








N 


0-031 


0O2 


009 


2-7 




H 


0O31 


002 


O07 


20 


HIS 


D 


0-031 


O02 


043 


18-8 




F 


0K>3\ 


002 


0-12 


4-1 




N 


0-031 


O02 


OlO 


34 


Hil 


N 


0-031 


0K)2 


046 


202 




H 


0031 


O02 


013 


4-8 


R167 


X 


0O31 


002 


063 


25-6 




K 


0O31 


O02 


013 


41 


D171 


S 


0O94 


O04 


064 


14-1 




1> 


0H)31 


O02 


014 


4-8 




N 


OOdl 


002 


013 


4-1 


T)75 


T 


0O62 


O03 


10 


291 


1179 


T 


0O62 


O03 


066 


186 




N 


0O31 


O02 


013 


4-1 



The most frequently occurring residues from phage-displayed 
libraries are shown, based on fractional representation (Pf) 
atioQg all clonefi isolated after 2 to 7 rounds of binding selection. 
Expected frequencies (P,) were calculated from the number of 
NNS codons for each amino acid theoretically in the starting 
111 irary. Standard deviations (a^) were calculated as <;^„ = 
[F.<l-P.>/n]*'*. Only residues for which the fraction found 
exceeded the fraction expected by at least 2(r^ are shown (i.e. 
[{Pf-P^)/o„]>2). For the minihelix-l library, n=I7 sequences; 
loi>p-l library, n=26; combinatorial library (helix- l/helix-4b), 
n«68. Twelve of the sequences from tfce combinatorial library 
contained a wild-type helix-4 region because of religation of the 
heUx-1 vector (see Material and Methods). 



A second library (called loop-1) was constructed 
in which F54, E56, 158 and R64 were randomly 
mutated. Alanine replacements at any of these 
residues caused a 4 to 20-fold reduction in affinity 
(Fig. 1(a)). Despite the fact that R64 is the only one 
of this group to make direct contact with the 
receptor (Fig. (b)), all positions showed a moderate 
to very strong preference for a residue that was 
usually different from the wild-type. R64K was the 



Table 3 

Binding data for hGH variants isolated from the 
minihelix'l (Ml) or the loop^l (LI) libraries 



Cllone 



Minihelix-1 

hGH 

Ml. 3 A 

M1.3B 

M1.3C 

Ml. 3D 

MI.3R 

M1.3F 

Ml. SO 

M1.3H 

M1.5A 

M1.5H 

M1.5C 

M1.5D 

M1.5E 

M1.7A 
M1.7B 
MI 7C 
M1.7D 
Consensus: 



Clone 



Loop-1 



Residue position 



41 42 45 46 



lieaidue position 
54 56 5d 64 



Aa(hGH) 
A'4(mut) 



K 


Y 


u 


Q 


340 


1 


V 


S 


L 


w 


190 ±26 


1-8 


L 


u 


L 


w 


190 ±23 


1-8 


F 


R 


L 


Y 


160±23 


2-2 


V 


F 


L 


H 


190 ±19 


2-3 


A 


I 


Q 


W 


ND 


ND 


L 


Y 


V 


R 


ND 


ND 


Y 


W 


G 


Y 


ND 


ND 


F 


L 


V 


L 


ND 


ND 


G 


T 


VV 


T 


270 ±80 


1-3 


I 


H 


W 


VV 


76 ±29 


4-5 


R 


R 


L 


F 


ND 


ND 


M 


R 


W 


R 


ND 


ND^ 


R 


T 


A 


V 


ND 


ND!| 


K 


Q 


L 


W 


I40±20 


2-4 


R 


Q 


I. 


W 


140-1-20 


2'4t 


R 


T 


A 


V 


ND 


NDII 


R 


S 


W 


F 


ND 


ND 


R 


R 


W 


W 








Q 


L 









hGH 


F 


£ 


J 


R 


340 


1 


LI .3A 


P 


D 


T 


R 


210±110 


1-6 


L1.3B 


P 


Y 


I 


K 


170+30 


20 


LI.3C 


H 


W 


L 


K 


83±25 


4-2 


LI .3D 


M 


R 


L 


K 


ND 


NDt 


L1.4A 


G 


W 


V 


R 


660 ±140 


0-50 


L1.4B 


F 


W 


V 


R 


630±120 


0*53 


LI .40 


S 


H 


L 


K 


620±120 


o-5e§ 


L1.4D 


P 


W 


L 


R 


520 ±100 


0-67 


L1.4E 


P 


L 


D 


K 


460±100 


0-74 


L1.4F 


P 


T 


V 


K 


250 ±40 


1-4 


LI. 40 


P 


Y 


1 


K 


I70±30 


20t 


L1.4U 


P 


L 


Q 


K 


I20±30 


2-8 


LI. 41 


P 


D 


T 


K 


61 ±8 


56 


L1.4J 


P 


T 


P 


K 


ND 


ND 


L1.4K 


P 


A 


L 


K 


ND 


ND 


L1.4L 


P 


C 


1 


K 


ND 


ND 


L1.6A 


R 


D 


I 


R 


350 ±250 


10 


L1.6B 


P 


T 


V 


K 


250 ±40 


l-4t 


L1.6C 


P 


D 


1 


K 


I80±40 


1-9 


L1.6D 


P 


Y 


1 


K 


170 + 30 


201 


LI .6E 


P 


E 


1 


K 


73+16 


4-8 


L1.6F 


P 


K 


1 


K 


73±I6 


4-Bt 


L1.6G 


P 


D 


T 


K 


61±8 


5-6t 


L1.6H 


E 


W 


V 


K 


ND 


ND 


LL6J 


P 


M 


V 


K 


ND 


ND 


L1.6J 


P 


L 


Q 


K 


ND 


ND 


Consensfis: 


P 


D 




K 










W 











The nomenclature indicates the genealogy of the variant by 
designating its parental library (Ml. or Li.) followed by the 
number of enrichment cycles after which it was asutated, and a 
letter for each sequence. Affinity constants were mechsured by 
RTA (C:h2nningham & Wells, 1989). The increase in affinity over 
hGH for binding hGHbp is shown as /irj{hGH)//f^(mutant). Some 
clones were not anal3^cid (NO). Identical affinities were assumed 
for equivalent variants (t). Clones with spurious mutations are 
indicated (E65Vt; S57Y§; N47YT; P48S||). 
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most preferred (found in 81 % of the clones); it is 
known that R64K alone causes a '^threefold 
improvement in binding affinity (Cunningham et al., 
1990). After this the order of preference was 
F54P>T58>E56D or W. 

We analyzed the binding affinities for 27 of these 
mutants by expressing the free hormone in a non- 
suppressor host which terminates translation at the 
amber codon at the end of hGH and the start of the 
gene ITI domain (Lowman et aL, 1991a). All eight 
mutants tested after three to seven rounds of 
binding selection from the miniheIix-1 library had 
affinities greater than wild-type hGH (Table 3). The 
best was K41I/Y42H/1>45VV/Q46W which was 4-5- 
fold improved in affinity over hGH. This DNA 
sequence is expected to occur randomly at a 
frequency of only one in a million clones. The fact 
that it was efficiently rescued demonstrates the 
power of the affinity selection. Of the 19 loop-1 
selectants tested 13 had affinities greater than hGH. 
The best isolates were F64P/R64K and F54P/ 
K56D/I58T/R64K which were both -^fivefold 
improved over wild-ty|)e. 

(c) Improving affinity using additivity principles 

According to additivity principles, mutations in 
non-interacting parts of a protein should combine to 
give simple additive changes in the free energy of 
binding (for review, see Wells, 1990). We therefore 
sought to improve hGH binding through site-1 by 
combining the substitutions isolated from phage 
display libraries (Fig. 2). The three tightest- binding 
vanants of hGH from the HI library (FI0H/M14G/ 
H18N/H21N, called H1.4A; F10A/M14W/H 18D/ 
H21N, called H1.4B; and M14S/H18F/H21L, called 
H1.6B) were joined to each of the three tightest 
binding variants found in the H4b library (RI67N/ 
D171S/Ei74S/F176Y/1179T. called H4b.4A; R167/ 
D171S/E174S/F176Y, called H4b.6E; and R167N/ 
D171N/E174S/F176Y/I179T. called H4b.6B). All 
variant proteins were obtained in yields 
approaching that of wild-type hGH except for those 
containing the H41.1 variant. The H1.4A variant 
alone or combined with mutations from the H4b 
library' migrated as dimers {M^ ^^44,000) in non- 
reducing SDS-PAGE and as monomers (il/, 
'^22,000) when reduced. Although these proteins 
did not contain an additional Cys residue, disulfide 
exchange could occur if they formed non-covalent 
dimers. HGH is known to form a weak dimeric 
complex involving residues in helices 1 and 4 (Cunn^ 
Ingham et al,, 19916). Because these proteins formed 
disulfide dimers we did not pursue them further. 
The H1.6B variant also generated a disulfide dimcr, 
but when it was combined with the three H4b 
mutants no dimer was evident. 

All the Hl/H4b recombinants analyzed showed 
cumulative increases in affinity over the parental 
components (Table 4). The H1.4B/H4b.4A variant 
had the greatest affinity which was 34-fold tighter 
than wild-typo hGH. The tightest- binding mutant 
from the Ml library (K41T/Y42H/L45W/Q46W. 



Table 4 

Dissociation constants for recombined hGH variants 
measured by HI A as described in 
Materials and Methods 



Variant name 




A'd( variant) 


hGH 


340±50 


1 


H1.4Bt 


100±30 


3-4 


Hl.GBt 


680±190 


0*5 


H4b.4At 


40±20 


8-5 


H4b.GEt 


40 + 20 


80 


H4b.6Bt 


m±30 


5-7 


Hl.4B/H4b.4A 


1()±3 


34 


H1.6B/H4b.4A 


11±3 


31 


HK6B/H4b.6E 


14±8 


24 


Hl.4B/H4b.6B 


16±5 


21 


H).6B/H4biJB 


21 ±11 


IB 


M1.5B/L1.6R 


7 9 ±2-4 


43 


HI.4B/H4b.4A/M1.5B/LI.6K 


O9±0-3 


380 



The fold improvement in binding affinity is expressed as 
A'4(hGH)/Kd(variant). Some affinities(t) from Luwman et ol. 
(1091a). Variants are designated by a name indioating the library 
(e.g. HI or H4b), the number of enrichment uydes, and a letter 
for each mutant. Individual variants from cai;h librarv are as 
follows: H1.4B = (F10A/M14W/H 18D/H21N), H1.6B = (MI4S/ 
H18F/H21L), H4b,4A (RI67N/D171.S/E174S/F17tty/I179T), 
H4b.6E = (R167E/D171S/E174S/F176Y), H4b.«B = (R167X/ 
D171N/E174S/F170Y/1179T). M1.5B = (K41 1/y42H/M5W/ 
Q46W), and L1.6E = (Kr>4P/Uf*4K). 



called 1M1.5B) and one of the tightest from the LI 
library (F54P/64K, called L1.6E) were: combined to 
produce the hexamutant, M1.5B/L1.6E, whose affi- 
nity was 43-fold higher than wjld-ty]>e hGH. This 
was put together with the H1.4B/H4b.4A recombi- 
nant to yield HI .4B/Hb4.,4A/Ml .5B/L1.6E which 
bound 38()-fold tighter than wild-tyfie hGH. This 
variant retained the wild-type residue at only 5 of 
the 20 positions randomized (£56. 158, K172, T17d» 
R178). 

From the product of the improvements in affinity 
by the individual components we expected this 
v^ariant to bind about 15()0-fold tighter than hGH. 
The fact that the expected and observed values do 
not agree precisely suggests that on interactions 
between the mutated residues affect the affinity. 
Nonetheless, when one compares the actual versus 
the expected decrease in the free energy of binding 
( — 3-6 versus —4-3 kcal/mol, respectively) the parity 
of these values is comparable to additivity experi- 
ments reported for many other pnjt-eins (Wells, 
1990). 

(d) Combinatorial libraries of hGH 

Tn some cases we have observed very non-addi- 
tive effects when mutations were introduced at 
neighboring residues (e.g. F176Y/E174S; Lowman et 
a/., 1991a). Therefore, because some side-chains 
from helix -1 can potentially contact residues in 
helix-4 we investigated a combinatorial approach 
(Huse et al,, 1989; Clackson et al. 1991) to sorting 
mutants derived from the HI and !E14b libraries. 
Independent binding selections were carried out on 



Table 5 



hGf^ mutarUs isolaied after binding selection of combinatorial libraries 

( see text for details ) 









Helix 1 






Httlix 4b 












F 


M 


IT 
il 


H 


K 


D 


T 


I 






(Ulrme 


P 

m 


10 


14 


18 


21 


UV7 


171 


175 


179 




AT^rmut) 






















A 


(V60 


H 


G 


N 


N 


N 


S 


T 


X 


XD 


«'8tll 


H 


O40 


A 


N 


D 


A 


X 


X 


T 


X 


r>o±40 






















A 


\t ■ » 


K 

X 


c 
n 


r 


G 


H 


s 


T 


T 


Xi> 




R 




H 


r\ 
V 


1 


S 


A 


D 


T 


T 

M 


XD 




c 


0*14 


H 




V 


N 


X 


A 


T 


T 


XD 




n 


014 




s 


F 


L 


S 


1) 


T 


T 


XD 




E 


0-14 


A 


R 


T 


N 










XD 




F 


0-14 


Q 


Y 


N 


X 


H 


S 


T 


T 


74±30 


46 


a 


0-14 


W 


G 


S 


S 










XD 














(Hl.2lH4b,2),2 (repeat) 










0*13 


F 


1 


s: 

£> 


S 


K 


X 


T 


V 


XD 




/ 


0-13 


w 




1^ 


S 


H 


s 


T 


T 


160±70 


21 


J 


013 


A 


X 


A 




X 


s 


T 


T 


XD 




A* 

Mm 


0-13 


r> 

m 


s 


D 


N 










XD 




L 


0-13 


H 


G 


N 


X 


X 


X 


T 


T 


XD 




M 


0-13 


F 


S 


T 


G 








— 


XD 




i\ 


0-13 


M 


T 


S 


N 


Q 


8 


T 


T 


XD 




O 


0-13 


F 


S 


F 


L 


T 


S 


T 


T 


XD 
















(IJL2lH4b.2),4 










J 

MM 


0'I7 


A 


w 




N 










100 + 30 


3-3t 


n 


0*17 


A 

** 


w 


D 


N 


H 




T 


X 


XD 




c 


0-17 


M 


Q 


M 


X 


X 


S 


T 


T 


XB§ 




D 


0-17 


H 


Y 


D 


H 


R 


D 


T 


T 


XD 




B 


017 






S 


H 










820 ±200 


0*4t 


F 


0-17 


L 


N 


S 


H 


T 


S 


T 


T 


34±10 
















(Hl.2lH4h^).6 










A 
£\ 


iroo 


A 
J\ 


\V 


D 


N 










I0O+30 


3'3t 


n 


013 


A 


\V 


D 


N 


X 


s 


T 


S 


XD 




c 


0-13 


A 


W 


D 


N 


K 




T 


T 


XD 




D 


(M3 


A 


T 


S 


X 


N 


S 


T 


T 


XD 




E 


0-13 


M 


A 


D 


X 


X 


s 


T 


T 


68 ±46 


5i>t, 


F 


0-13 


H 


Y 


1> 


H 


X 


s 


T 


T 


XD 














(HL2iii4b.2/.6 (repeal) 








r* 
\» 




A 


1 1 


• 

A 


S 


X 


s 


T 


T 


XD 




u 

n 






s 


L 


A 


X 


8 


T 


T 

1 


XD 




/ 
• 




u 


Y 


D 


H 


Y 


s 


T 




XD 




J 


0 13 


V 


L 


D 


H 


X 


s 


T 


T 


XD 




K 


0-13 


A 


VV 


D 


N 


X 


X 


T 


1 


XD*' 
















(ni.2lH4b,2).7 










A 




A 
/% 


vv 


i> 


N 


X 


A 


T 


T 






U 


ft- 1 7 


A 


w 




N 












3-3t 


c 


0O8 


A 


w 


D 


X 


N 


s 


T 


X 


XD 




D 


008 


A 


w 


n 


X 


R 


X 


T 


T 


XD 




R 


0-08 


A 


w 


D 


X 


K 


8 


T 


S 


XD 




F 


008 


F 






0 










XD 




G 


0-08 


1 


Q 


£ 


H 


X 


S 


T 


T 


I0±]0 


21 


// 


008 


H 


Y 


D 


H 


X 


S 


T 


T 


XD 














(HI 


\2IH4h,2).7 (reipeut) 








/ 


050 


F 


s 


L 


A 


N 


s 


T 


V 


32±5 


11 


J 


0-25 


A 


H 


A 


-s 


X 


s 


T 


T 


XD 




K 


0-13 


A 


W 


D 


N 


A 


X 


T 


T 


XD 




L 


0-13 


H 


Y 


U 


H 


Y 


s 


T 


S 


XD 
















(ill.4/II4bJ),4 








23} 


A 


0-67 


F 


R 


K 


h 


K 


1> 


T 


T 


150 ±70 


B 


frl7 


F 


S 


F 


L 


X 


s 


T 


T 


11±3 


31 Ot 


C 


0-17 


M 


A 


D 


N 


X 


8 


T 


T 


68±46 


60t 



All variants contain (K1745S/F176Y), except for those with the wild-type helix 4 aequence ( — ), 
which were non'recombiiiaiita. Libraries are designated by the original helix library followed by the 
number of independent enrichment cycles. For example, {Hl-2/H4b.2).4 indicates that helix-J and 
helix'4b libraries were sorted independently for 2 rounds (Hl.2/H4b.2). DXA pools were recombined. 
and the combinatorial library was sorted 4 more times (.4). '^repeat** indicates that the library was 
reconstructed and sorted independently. The clone letter identifies the specific mutant. indicates the 
fractional occurrence amonj;; sequenced clones. Some affiAitie^ are from howman et al. {lOOlo); 
equivalent variants are assumed to have identical affinitics(t)- Several variants appeared as > U)% 
disulfide dimers(t). One clone contained an amber (TA(i( = (jlji in SupS strains) codon(§), one contained 
a spurious mutation. El74X(1!). and one(||) contained 2 mutations (Ll5Ii/Klfi8R). Some variants were 
not expressed (XE> or not analyzed (XD). 
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the H 1 and H4b libraries for 0, 2 or 4 cycles. DNA 
from the HI ftool was ligated together with DNA 
from the H4b pool that was sorted for binding to 
the hGH bp for the same number of rounds. The 
three combinatorial libraries (H1.0/H4b.O, HI. 2/ 
H4b.2 and H1.4/H4b.4) were then sorted an addi- 
tional two to seven cycles and 68 clones were 
. sequenced (Table 5). 

Overall, the sequences of the highest affinity 
variants isolated from any of the combinatorial 
libraries resembled those assembled by independent 
sorting of the HI and H4b libraries (Lowman et aL, 
1991a) and recombination described above. For 
example, the highest affinity mutants isolated 
previously from the HI library were F10A/M14VV/ 
H18D/H21N (H1.4B) and FIOH/M 14G/H18N/ 
H21N (H1.4A) which bound about 3*3-fold and 
2-4-fold tighter than wild-type hGH, respectively. 
The H1.4A sequence was recovered in 60% of the 
clones from combinatorial library (HI .0/H4b.0).4, 
and in 13% of the clones isolated in early rounds of 
sorting from combinatorial library H1.2/H4b.2. The 
H1.4B sequence predominated in later rounds of 
sorting the H1.2/H4b.2 library. Most of these were 
independent clones (not siblings or contaminants) 
because they had different DNA sequences and 
usually differed in the mutations selected in helix 4. 

Similar results were obtained with selectants in 
helix 4. When the H4h library was independently 
sorted, many sequences containing R167N, D171S 
or N, T175 and T179T were selected (Lowman ei al,, 
1991a). These were the same residues that tended to 
be selected in any of the combinatorial libraries. In 
fact, one of the best mutants previously isolated 
(R1671^/D171S/T175/T179T) was comm'^only iso- 
lated by combinatorial sorting and predominated in 
the later rounds. 

Some sequences sorted by combinatorial means 
were very different from ones selected from the two 
independent libraries, perhaps for st-atistical 
reasons. For example, the HI and H4b libraries 
contained about 10^ different DNA sequences, and 
if combined (without pre-selection) would contain 
10*^ possible combinations. Transformation effi- 
ciencies limit the sampling size to ^10^ indepen- 
dent clones. Thus, the selection of the same 
sequences is remarkable given the high diversity of 
sequences possible in these libraries and the mild 
increases in affinity being selected for. 

We measured the affinities for a number of the 
isolates from the combinatorial libraries (Table 5). 
All had improved binding affinity (2 to 31 -fold) 
compared to wild -type hGH, Most were improved 
over E174S/F176Y which was present in all the 
starting clones, and independently caused a 5-6-fold 
increase in affinity over wild -type hGH (Lowman et 
aL, 1991a). The highest affinity variants were 
generally isolated from later rounds of sorting and 
were highly abundant in those pools. For example, 
one of the highest affinity mutants we tested was 
clone (H1.2/H4b.2).7A which was isolated after 
seven rounds of sorting of H1.2/H4b.2 library. This 
isolate represented a third of the clones in that pool. 



Remarkably, this clone is identical to the 
H1.4B/H4b.4A variant (Table 4) except that 
instead of D171S it contained the conservative 
substitution, DI71A. Not surprisingly, these 
variants bound with virtually the same affinities 
(12 pM and 10 pM, respectively. In this case, the 
combinatorial and additive strategies yield compar- 
able solutions for successful optimization of affinity. 

(e) Testing the importance of individual side-chains 

in affinity Tnaluration 

To evaluate the contribution of mutated residues 
to the binding affinity we introduced them into 
wild-type hGH, or converted them back to the wild- 
tvpe residue in some of the affinity improved 
variants (Table 6). The K41I/Y42H/L46W/Q46W 
tetramutant bound 4-5-fold tighter than wild-type 
hGH. Yet, each of the single mutants in hGH 
caused 1-7 to 2-5-fold reductions in affinity. This 
indicates that the combination of mutations at this 
site is critical for the affinity improvements. These 
residues lie on adjacent positions on one face of the 
minihelix-1. Non-additive effects are well docu- 
mented for neighboring residues in other proteins 
(Wells, 1990) as well for hGH (Lowman et al., 
1991a). 

Affinity improvements caused by substitutions in 
the H1.4B/H4b.4A variant were tested by mutating 
them back to the wild-type residue either indivi- 
dually or in adjacent pairs (Table 6). This showed 
that seven of the nine substitutions contribute only 
small or virtually undetectable improvements in 
binding (1-1 to l*7-fold). Even the most dominant 
effect, F176Y, imparts on)y a 4-6-fold improvement 
in binding affinity. Nonetheless, the product of 



Table 6 

Importance of individtial side-chains in mrums hGH 
mutants (described in Tables 3 and 4) for binding to 

the hOUbp 

K^{pyi) Xd(hGH) /Ca(Kl.4B/K4b.4A} 



Point mutants in wild-type background 




hGH (wild-type) 


340±50 


1-0 




K4llt 


580±140 


1-7 




Y42Ht 


860±50 


2-5 




L46\Vt 


722 ±60 


21 




Q46\Vt 


780±100 


2-3 




Revertante in H1.4B/H4b.4A background 




(Hl.4B/H4b.4A) 


10±3 




l-O 


T)18H/N21H 


12 + 9 




M 


A10F/W14M 


13±5 




1-2 


AlOFt 


13±4 




1-3 


N167R/S17ID 


17±8 




1-6 


T179I 


18±9 




1-7 


YI76F 


49±21 




4-6 



Binding afinitieft were measured by BIAcore^ (f) or by RIA 
and normalized to the RIA value for hGH determined previously 
((Cunningham & Wells, 1989). The fold decrease in affinity is 
expressed as /f^ (re vertant)//rd( parent), where the parent is the 
background used for mutagenesis. 



# 



High A ffinity Variants of hGH by Phage Display 



thtse effects in the octamutant, F10A/M14W/ 
H 1 8D/H2 1 N/R 1 67 N/ D 1 7 1 S/F 1 76 Y/T 1 79T (not 
tested here), predicts a 16-fold improvement in affi- 
nity versus wild-type hGH. This compares to the 
34-fold enhancement measured for the H1-4B/ 
H4b.4A variant. This simple additivity calculation 
is in reasonable agreement with experiment 
considering that S174H], which was not tested in the 
H1.4B/H4b.4A context, was expected to enhance 
affinity an additional two- to threefold. Thus, the 
affinity improvements come by a series of small 
eff<^cts that combine to yield large affinity 
enhancements. 

(f) Effects of affinity maturation on the hitieiics 

of binding 

""^o better understand the molecular basis for 
affinity improvements selected here, we used the 
BTAcore^** to measure the kinetics of binding to the 
hGHp (Table 7). Tn general, as we increased 
the affinity from wild-type hGH, the off-rat-e 
decreased with little change in on-rate. In fact, in 
going from wild-type to the highest affinity mutant. 
H1.4B/H4b.4A/M1.5B/l.l.ttE, there was more than 
a BO'fold decrease in the off-rate and only a fourfold 
increase in the on-rate. (For this mutant the off-rate 
wa-.* too slow to measure accurately* but if we calcu- 
late* it from the on-rate and the measured hy 
RTA, the off-rate would be 100-fold slower than 
wild- type hGH.) The fact that off- rate is most 
affected among the phage selectants suggests that 
the sorting was performed under conditions 
ap])roaching equilibrium. We had previously 
recruited the hGH binding site into hPL which 
differs in sequence by ^^15% from hGH and binds 
-2000-fold weaker (Lowman et al.. 19916). The 
recruited hPL variant has kinetic parameters for 
binding that are similar to hGH (Table 7). Relative 
to wild-type hPL, this mutant shows much larger 
improvements in off-rate ('^100-fold) compared to 
on-rate (-«- 10-fold). 

Table 7 



Bindiny kinetics of hGH variantn 













.Mutint 


.M ' s - » 








hPL 


3-2 


cooot 


isoo 




hPL varuint 


43 


49 


II 


0-79 


hCF 


24 


34 


1-4 


1 




13 


60 


052 


2-7 




1\ 


6*6 


0-3 1 


4-5 


ML.'iK/ia.Uli: 




61 


0-14 


10 


HI.-.H/H4b.4A 


20 


30 


O-lf) 


9-3 


Hl.^U/H4b.4A/ 


98 




S 0-006 


^230 



MI.;%K/LI.6B 



Kinetic* (ronstants were det-ermined using a HL Acore™ in which 
the {S20lC)hCi!Hbp wom iinniobilizei] on the biosc^nijor chip in PBS 
huffrr+O-On^'o Tween-20. The is calculated from k^fjk^^. 
except for hPL. for whiuh and were measured and k^f 
r»lcti]ateti(t). The ratio of vahies indicates Lhtr fold increase in 
binding affinity ver^w/ \vt-hOH. hGH mutants are detvcribed in 
Table 4 and the hPl^ variaiH eonlain.«i V4l/D56K/M<S4H/'H:i74A/ 
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4. Discussion 

(a) Mmiovaieni phage display is a powerful 
selection procedure 

We randomly mutated regions of hGH which 
were thought to be important either because they 
were in contact with the receptor or because when 
converted to alanine they affected hinding affinity. 
Thus, an average random mutant from these librar- 
ies should be dramatically reduced in binding affi- 
nity from wild-type hGH. Yet clones isolat^ed after 
only a few rounds of selection bound with similar 
and often higher affinity than wild-type hGH. These 
usually exhibited consensus sequences that were 
different from the wild- type residue (Table 2). 

Small improvements in affinity led to rapid and 
almost exclusive convergence in these libraries. For 
example, the R64K mutant binds only three 
times tighter than wild-type hGH (Cunningham ei 
aL, 1990). After just three cycles of binding selec- 
tion, R64K dominated the library (Table 3). 
Similarly, T170T contributed only a 1-7-fold 
improvement in affinity (Table 6). However, when 
sorted separately in the H4b library (lx>wman et aL, 
1091a) or com binato rial ly with mutants in HI 
(Tables 2, 5), T179T was selected almost exclusively. 
Strong selection for these slight improvements in 
affinity emphasize the power of this technique for 
rescuing the highest affinity variants in the pool. 

The phage display method has important limita- 
tions also [for reviews, see Smith, 1991; Wells & 
lx>wman, 1992). For example, not all variants w^ill 
be displayed on the phage because they may be 
either digested by proteases, aggregated, miafolded 
or blocked in secretion or assembly on phage. 
Although a strong bias against particular DNA 
sequences is unlikely, there is a clear selection 
against C'ys- containing mutants. This has been 
previously noted for mutants of hOH (Lowman et 
at., 1991a) as well as other proteins (Matthews & 
Wells, 1993) and may be caused by an odd thiol 
fouling the disulfide bonding scheme in the selected 
protein binding domain, or in the gene III protein 
itself which also contains disulfides (van Wezenbeck 
et al,, 1980). Limits in Iransfection efficiency restrict 
the number of possible sequences one can screen so 
one should be selective about the codons randomly 
mutated. 

(h) Simple axiditivity dominates affinity maturation ^ 

Additivity principles (for a review, see Wells, 
U)90) suggest that when residues are in close 
contact, the sum of the free energ^*^ effects for the 
components may not equal the combined mutant 
because the mutant side-chains can interact intra- 
molecularly, ac<!ording to the following equation: 

AAr?AB = AA^A + AACb + A AG, ( 1 ) 

where AA6\b is the change in free energy for the 
double mutant, AA6\ and AAO^ are the changes in 
free energy for the two single mutants, respectively, 
and AA(?| is the change in the interaction free 
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energy between the two residues. When the residues 
do not contact, they act independently, and thus 
the component residues should contribute simple 
additive eflFects on the binding free energy (i.e. 
AAGi = 0). Studies of multiple mutants of hGH, 
hPRL and hPL have shown that this relationship 
holds for many substitutions throughout the hGH 
molecule (Cunningham et al., 1990; Cunningham & 
Wells, 1991; bowman et aL, 19916). 

Residues on the same face of an a-helix were 
mutated simultaneously because some side-chains 
displayed trom them couJd contact one or more of 
the others. In fact, several examples of non- 
additivity (i.e. AAGi not zero) were seen within a 
given library. The K41I/Y42H/Iyt5W/Q46W 
mutant isolated from the Ml library bound 4*5-fold 
tighter than wild-type hGH (Table 3), However, 
when each of these residues were installed into wild- 
type hGH they caused 1*7 to 2'6-fold reductions in 
affinity (Table 6). A similar finding was reported for 
the F176Y mutant in the context of E174S/FI76Y 
(Lowman et al., 1991a). Despite these examples, 
simple additivity within each library was usually 
found. For example. F54P/E56D/r58T/R(VIK 
bound 3'd-fo1d tighter than the triple mutant, 
F54P/E56D/158T (Table 3). The R64K mutant 
binds -^threefold tighter than wn*ld-type hGH 
(Cunningham et at,, 1990) which can account for the 
difference between the tetra-mutant and the triple 
mutant. As another exam pie » the F64P/R04K 
double mutant bound 2'5-foJd tighter than the triple 
mutant F54P/E56D/R64K (Table 3). The D56E 
mutant in hPI- caused a 2-4-fold improvement in 
binding to the hGH bp (Lowman et al,, 19916). 
Further evidence for simple additivity is that 
residues at many positions of each library showed a 
consensus that was independent of variation at the 
neighboring residues. This argues that their impact 
on affinity was generally independent of the other 
residues that were mutated. Lastly, the highest 
affinity variants selected from the HI and H4a 
libraries were virtually the same whether sorted 
independently or sorted together (Tabic 5), This 
occurred even though some side-chains from the 
helix- 1 library interact with those in helix-4b. For 
example, His21 makes a hydrogen bond with 
Aspl71. 

Combining the affinity enhanced variants from 
separate libraries generally produced simple addi- 
tive improvements in binding free energ^'^ (Table 4). 
Thus, the modest gains in affinity (3 to 6-foId) from 
each of the mutant libraries accumulated to produce 
a combined variant with a 380-fold improved affi- 
nity. Each of the 15 mutant side-chains imparts 
only a small gain in affinity as evidenced by the 
small or sometimes insignificant reductions in affi- 
nity when various side-chains in the H 1 .4J3/H4b.4A 
mutant are converted back to the wild-type 
(Table 6). 

The small gains in affinity produced by each 
mutant side-chain can sometimes by rationalized 
from the structure of the bound complex and 
modeled side-chain replacements. For example. 



T179T may form a hydrogen bond to the main -chain 
amide of Gly168 in the receptor. I'he salt-bridge 
between Arg64 of hGH and Aspl64 of the receptor 
may be improved by substituting the shorter Lys64 
side-chain. This could reduce a repulsive interaction 
between Arg64 (N^hi) receptor Lysl67 (N,,), less 
than 4*5 A away. Of course without the structure of 
the bound and free components we cannot attribute 
the enhanced affinities to improvements in the 
packing of the bound complex or dest^biliKation of 
the free components. R<icently, the structure of the 
free affinity optimized mutant containing 15 substi- 
tutions has been solved (Ultsch et of., 1993). The 
overall structui^ is very close to that of wild -type 
hGH in the bound hGH(hGHbp)2 complex. 

The affinity enhanc*ements that occurred for the 
hGH variants compared to wild-type hGH, or for 
the recruited hPL variant compared to wild -type 
hPL, were caused largely by decreasing the off- rate, 
not increasing the on-rate (Table 7). This is not 
surpri.sing given the previous finding that the 
largest reduction caused by an alanine substitution 
in on-rate is only threefold, whereas the largest 
reduction in off-rate is about 30-fol<l (Cunningham 
& Wells, 1993). We have suggested this is because 
there are many pathways leading to associati<m, but 
there is likely to be only one bound complex. 
Analysis of a mutant antibody that is improved in 
affinity similarly shows the off-rate is decreased 
with little effect upon on-rate (Marks et aL, 1992). 

Northrup & Erickson (1992) have suggested that 
association rate is controlled by diffusion and long- 
lived collisions that result from Brownian dynamic 
effects. While these factors arc largely independent 
of side-chain sub.stitutions, we estimate that posi- 
tively^ charged side-chains at the site 1 epitope can 
contribule up to a factor of 20 to the association 
rat« (Cunningham & Wells, 1993). Consistent with 
these electrostatic effects, the recruited hl'L variant 
(V41I/D56K/M64R/E1 74A/M 1 791). which associ- 
ates tenfold faster than wild-type hPL, ccmtains 
M64R and EI74A (Table 7). These make the 
hormone more electropositive at positions where 
alanine substitutions in hGH are known to afPect 
on-rate by about twofold each (Cunningham & 
Wells, 1993). Furthermore, the hGH mutant, K41I/ 
Y42H/L45W/Q46W, is reduced almost twofold in 
association rate which may be due to the K41 1 
substitution. The on-rate effects are more subtle and 
accumulate in a somewhat more complex fashion 
than the Qflf-rate effects. For example, most of the 
component mutants used to derive the H1.4B/ 
H4b.4A/Ml.5B/L1.6E variant (whose on-rate is 
about 4-fold higher than wild-type hGH) had on- 
rates that were leas than or equal to wild-tyfxs hGH. 



(c) Rekitianship of phage sorting to structural and 

functional epitopes 

Less than half of the side-chains that become 
buried at site 1 by the first receptor significantly 
affect binding affinity when converted to alanine 
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In (frequency of wt from phage) 

Figure 3. Relationship between the frequency that a 
wild- type residue is recovered after randomization and 
|>hag** wleetion ver^tit^ the extent to which the side-chain 
is buried upon binding the receptor (A), or is retlut^ in 
binding affinity when the residue is converted to alanine 
(B). or the frequency that the residue is (conserved in the 
natural family of growth hormones (C). The natural loga- 
rithm of the frequency with which wi!d-tyj>e h(JH 
rets duen were selected was calculated from the pha^e 
sorting data shown in Table 2 and Lowman et al. (i991a). 
Th*? log scale was chosen to scale the data and for com- 
parison with buried surface area which has been relat^id to 
binding free energy. Data for residues M14, H18, K4K 
Q4'?. R167 and El 74 could not be used in the graph 
\)e<ause the wild-tyj^ residue was not recovered at any of 
these positions. Wild-type residues from left to right 
across the X axis are F. Phe54: I, Tlel79; V, Tyr42; H, 
His2I: E. Glur)6; D, Aspl71; F, PhelO; R, Arge>4; F. 
Ph«-176: T, Ilc58: K. Lysl72; U Leu45; Argl78: and T, 
Th"l75. In A the change in solvent-accessible surface area 
of each side-chain upon binding was calculated as the 
accessible areas of the side-chain in free h(lrH minus that 
in I he hCU(hGHbp)2 complex (see Cunningham & Wells, 
1993). The eorrelaticm coefficient {F}) for this plot was 
0-022. fn B the relative change in binding affinity from 
wili-type hGH for alanine substitutions is plotted as 
In A'd(Ala mutant)//rj(hGH)J to scale to free energy. 
ThHsc parameters are mildly rrurrelated (/;^ = 0-7l), with a 
slope near unity (y= I'Ox -«-3-9). In C the conservation of 
residues among 1 7 natural growth hormone variants was 
determined from the amino acid sequences (CSenbank. v. 



(Cunningham & Wells, 1993; Fig. 1(a)). The mini- 
helix- 1 contact residues provide a good example of 
this (Table I). The aide-chains of Tyr42 and Gln46 
each make more van der Waals contacts and 
undergo more burial than Leu45. Yet, Y42A and 
Q46A have much smaller cflfects upon binding 
compared to L45A- These studies suggest that 
functionality is not easily assessed by the extent to 
which a side-chain makes contaet with the receptor. 

Another way to evaluate this is to correlate the 
conservation of wild-type residues after binding 
selection with the extent to which they arc buried 
hy the receptor. As shown in Figure 3A, there is no 
correlation between these parameters (^^<0-05). 
This is also evident by comparing the structural and 
phage conservation epitopes shown in Figure 1(b) 
and (c), respectively. Binding aifinity is determined 
by the difference in energy between the bound and 
fi-ee states. Clearly the X-ray structure of the 
complex (de Vos et al., 1992) only reveals the 
molecular details of the bound state. Solvation and 
intramolecular interaction energies in the free state, 
which we have little current information on, can 
dramatically affect the energetics of binding. 



(d) Selected residues at functionally important and 
buried sites terui to be identical or isosieric 
with the wild-type 

There is a moderate correlation (/^^=0"71) 
between the reduction in affinity' as a^ssessed by 
alanine-scanning and side-chain conservation 
following phage sorting (Fig. 3B; Lowman et al.^ 
1991a). This indicates that functionality determined 
by alanine-soanning is similar to that determined by 
sequence conservation after binding selection (also 
compare Fig. 1(a) and (c)). There is not a good 
correlation between the frequency of conservation 
of given residues among natural variant growth 
hormones and conservation following binding selec- 
tion (Fig. 3C). We have argued that natural varia- 
tion occurs in the corresponding hormone receptors 
to complement mutations in the hormone (L#owman 
et aL, 1991a). Also, hOH binds to the prolactin 
receptor which is coded for separately from the hGH 
receptor. Thus, in nature the functional constraints 
on growth hormone are not fixed by a single and 
non -adaptable target receptor as they are by the in 
vitro binding selection. 

We find that selected residues at the most func- 
tionally important and highly buried sites, either at 
the interface or in the hormone itself, tend to be 
retained a« the wild -type residue or a close homolog. 
For example, all five of the residues that are most 
conservecl as the wild-type after extensive phage 



75, Feb.. 1993) from monkey, pig, elephant, hamster, 
whale, alpaca, fox, horse, sheep, rat, turtle, chicken, 
mink, euw. salmon, frog and trf>ut, as well as human 
placental lactogen, hOH(20K) and hOH-V. The correla- 
tion cocflicient for this plot was /f^ = 0*005. 
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sorting (E56, 168, K172, T175 and 11178) are com- 
pletely buried in the complex; converting them to 
alanine caused 4 to 60-fold reductions in affinity 
(Cunningham & Wells, 1989). When substitutions 
were tolerated at these positions they were typically 
similar to the wild-type residue. For example, the 
highest affinity selectants contained either Asp or 
Glu at position 56, /^-branched residues at position 
68, Lys or Arg at 172, Thr or Ser at 176, and Lys or 
Arg at 178 (Table 3; lawman el a/., 1991a). 

There is another group of functional residues that 
become highly buried upon receptor binding (K41, 
L45, R64, D171 and 1179). When these were 
randomized we found improved substitutes that 
tended to be similar in character to the wild-type 
residue. For example, Lys41 was often replaced 
with Arg; Leu45 was substituted with large hydro- 
phobic side-chains; Arg64 was most frequently 
substituted by Lys; Aspl71 was optimally replaced 
by Asn and sometimes Ser; Ilel79 was usually 
substituted by ^-branched residues (Tables 2, 3, 6; 
Lowman et ai,, 1991a). Thus, improvements can bo 
made at functionally important residues buried at 
the interface. These substitutions tend to be toward 
an isosteric side-chain or one of similar chemical 
character. 

Two of the residues that were randomized (His! 8 
and Glul74) had enhanced binding affinity when 
converted to alanine and were completely buried in 
the complex. These almost always sorted to some- 
thing smaller than the wild -type residue. For 
example, the preferred substitution for Hisl8 was 
Asp or Asn, and for Glul74 was Ala, Ser or Thr 
(Lowman et aL, 1991a). This suggests that packing 
at these positions is energetically unfavorable. 

Another class of residues (H21, Y42, Q46 and 
R167) are highly buried at the interface, but have 
little or no effect on binding affinity when converted 
to alanine. These residues rarelv sort back to the 
wild-typo residue. For example, His21 tended to 
sort to Asn; Tyr42 often came back as Arg or Gin; 
Gln46 converted to Trp and Argl67 often sorted to 
Asn (Tables 2, 3, 5; Lowman et al., 1991a}. J>espite 
the consensus found at these buried residues, the. 
affinit}' enhancements made from them were very 
small (Table 6). Thus, it appeared more difficult to 
obtain improvements in affinity from contact 
residues that were functionally inert to start with. 

The last group of residues (FIO, Ml4 and F54) are 
virtually buried in the folded hormone and affected 
binding affinity by two- to fourfold when converted 
to alanine, presumablj' by indirect structural 
effects. Surprisingly radical substitutions were toler- 
ated here that showed consensus sorting (Tables 2, 
3, 5; Lowman et ai., 1991a). For example, F54P was 
almost the sole solution in the LI librarv. Phe54 is 
84% buried in the hormone and 10 A away from 
making contact with the receptor. We estimate the 
F54P mutant enhances affinity by a factor of 

I G-fo!d based on the fact that the double mutant 
{F54F/R64K) is improved in binding by 4-8-fold 
(Table 3) and the R64K mutant alone enhances 
binding by a factor of '^3 (Cunningham et at., 1990). 



Although it is possible to rationali2:e the general 
features of these mutants by combining the func- 
tional and structural data, there were always 
unusual mutants that came through the sorting. For 
example, Ilel79 was almost always conservatively 
replaced by a )?- branched side-chain (especial 1}' Tie 
or Thr), but I179S has appeared (Table 5). 
Similarly, Leu45 was almost always replaced by a 
large side-chain (Leu or Trp), but L45A was also 
found (Table 3). Some of these may appear because 
of a background of weaker affinity variants in the 
selected phage pools. 



(e) Phcuge sorting strategies 

These studies suggest guidelines for affinity matu- 
ration of binding interfaces using monovaJent phage 
display. One cannot easily search more than five or 
six codons exhaustively (Lowman Wells, 1991), 
therefore the library needs to be focused on residues 
where one can hope to improve afllinity. A good 
starting point for targeting residues for optimisa- 
tion of affinity is a complete alanine-scan of the 
relevant interface. Tt is also possible to limit the 
codon choices (see e.g, Arkin & You van, 1992), and 
this is especially reasonable if one has detailed 
knowledge of the structural interface to begin with. 

The residues where the most obvious improve- 
ments in affinitv occurred were those that were 
shown by alanine-scanning to significantly affect 
binding. For example, we estimate the largest 
improvements in affinity came from R64K, E174S 
and F176Y. El 74 A w^as known to enhance affinity, 
but R64A and F176A caused large reductions in 
affinity. Thus, despite the fact that the most highly 
conserved residues in the phage sorting were those 
that were most important by alanine-scanning, we 
still found improved variants here, ft is also note- 
worthy that all three of these residues are func- 
tionally important for binding to the prolactin 
receptor (Cunningham & Wells, 1991). Thus, nature 
may have selected these residues for their dual role, 
not simply for optimal binding to the hGH receptor. 

Ideallv, one should randomise residues that can 
contact each other in the same mutagenesis step so 
that they can covary. We saw covariance in the M I 
and H4a libraries when residues were- close enough 
to interact. Sorting libraries by combinatorial 
means is especially useful in situations where 
mutations may lead to non-additive effects. For 
example, if side-chain replacements cause large 
conformational changes, as they may in flexible 
loops of antibodies, combinatorial sorting would 
allow for improvements by searching randomly for 
the best combinations of mutant heavy and light 
chains (Huse et al., 1989; Clackson et a I., 1991 ; Collet 
et aL, 1992). 

In general the improvements in hGH tended to 
occur by simple additive effects both between librar- 
ies and within libraries. Practically, this means that 
one can randomize many residues independently 
and combine them in the end to obtain high affinity 
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variants. Fundamentally, it suggests that the inter- 
actions between side-chains on hGH often have 
little overall effect on the free energy of binding 
receptor. 
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